Search | arXiv e-print repository

arXiv:2312.08634 [pdf]

Unlocking High Performance, Ultra-Low Power Van der Waals Transistors: Towards Back-End-of-Line In-Sensor Machine Vision Applications

Authors: Olaiyan Alolaiyan, Shahad Albwardi, Sarah Alsaggaf, Thamer Tabbakh, Frank W. DelRio, Moh. R. Amer

Abstract: Recent reports on machine learning (ML) and machine vision (MV) devices have demonstrated the potentials of 2D materials and devices. Yet, scalable 2D devices are being challenged by contact resistance and Fermi Level Pinning (FLP), power consumption, and low-cost CMOS compatible lithography processes. To enable CMOS+2D, it is essential to find a proper lithography strategy that can fulfill these… ▽ More Recent reports on machine learning (ML) and machine vision (MV) devices have demonstrated the potentials of 2D materials and devices. Yet, scalable 2D devices are being challenged by contact resistance and Fermi Level Pinning (FLP), power consumption, and low-cost CMOS compatible lithography processes. To enable CMOS+2D, it is essential to find a proper lithography strategy that can fulfill these requirements. Here, we explore modified van der Waals (vdW) deposition lithography and demonstrate a relatively new class of van-der-Waals-Field-Effect-Transistors (vdW-FETs) based on 2D materials. This lithography strategy enables us to unlock high performance devices evident by high current on-off ratio (Ion/Ioff), high turn-on current density (Ion), and weak Fermi Level Pinning (FLP). We utilize this approach to demonstrate a gate-tunable near-ideal diode using MoS2/WSe2 heterojunction with an ideality factor of ~1.65 and current rectification of 102. We finally demonstrate a highly sensitive, scalable, and ultra-low power phototransistor using MoS2/ WSe2 vdW-FET for Back-End-of-Line (BEOL) integration. Our phototransistor exhibits the highest gate-tunable photoresponsivity achieved to date for white light detection with ultra-low power dissipation, enabling ultra-sensitive, ultra-fast, and efficient optoelectronic applications such as in-sensor neuromorphic machine vision. Our approach shows the great potential of modified vdW deposition lithography for back-end-of-line CMOS+2D applications. △ Less

Submitted 13 December, 2023; originally announced December 2023.

arXiv:2302.14394 [pdf]

Exploring Layer Thinning of Exfoliated \b{eta}-Tellurene and Room Temperature Photoluminescence with Large Exciton Binding Energy Revealed in TeO2

Authors: Ghadeer Aljalham, Sarah Alsaggaf, Shahad Albawardi, Thamer Tabbakh, Frank W. DelRio, Moh. R. Amer

Abstract: Due to its tunable band gap, anisotropic behavior, and superior thermoelectric properties, device applications using layered tellurene (Te) are becoming attractive. Here, we report a thinning technique for exfoliated tellurene nanosheets using thermal annealing in an oxygen environment. We characterize different thinning parameters including temperature and annealing time. Based on our measurement… ▽ More Due to its tunable band gap, anisotropic behavior, and superior thermoelectric properties, device applications using layered tellurene (Te) are becoming attractive. Here, we report a thinning technique for exfoliated tellurene nanosheets using thermal annealing in an oxygen environment. We characterize different thinning parameters including temperature and annealing time. Based on our measurements, we show that controlled layer thinning occurs in the narrow temperature range of 325 oC to 350 oC. We also show a reliable method to form \b{eta}-tellurene oxide (\b{eta}- TeO2), which is an emerging wide band gap semiconductor with promising electronic and optoelectronic properties. This wide band gap semiconductor exhibits a broad photoluminescence (PL) spectrum with multiple peaks covering the range 1.76 eV to 2.08 eV. This PL emission coupled with Raman spectra are strong evidence of the formation of 2D \b{eta}- TeO2. We discuss the results obtained and the mechanisms of Te thinning and \b{eta}-TeO2 formation at different temperature regimes. We also discuss the optical band gap of \b{eta}-TeO2 and show the existence of pronounced excitonic effects evident by the large exciton binding energy in this 2D \b{eta}-TeO2 system that reach 1.54 eV to 1.62 eV for bulk to monolayer, respectively. Our work can be utilized to have better control over Te nanosheet thickness. It also sheds light on the formation of well-controlled \b{eta}-TeO2 layered semiconductor for electronic and optoelectronic applications. △ Less

Submitted 6 October, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

arXiv:2210.00631 [pdf]

doi 10.1002/smll.202205763

Lattice Transformation from 2-D to Quasi 1-D and Phonon Properties of Exfoliated ZrS2 and ZrSe2

Authors: Awsaf Alsulami, Majed Alharbi, Fadhel Alsaffar, Olaiyan Alolaiyan, Ghadeer Aljalham, Shahad Albawardi, Sarah Alsaggaf, Faisal Alamri, Thamer A. Tabbakh, Moh R. Amer

Abstract: We investigate the thermal properties of these Zirconium based materials using confocal Raman spectroscopy. We observed 2 different and distinctive Raman signatures for exfoliated ZrX2 (where X = S or Se). These Raman modes generally depend on the shape of the exfoliated nanosheets, regardless of the incident laser polarization. These 2 shapes are divided into 2D- ZrX2 and quasi 1D- ZrX2. For 2D-… ▽ More We investigate the thermal properties of these Zirconium based materials using confocal Raman spectroscopy. We observed 2 different and distinctive Raman signatures for exfoliated ZrX2 (where X = S or Se). These Raman modes generally depend on the shape of the exfoliated nanosheets, regardless of the incident laser polarization. These 2 shapes are divided into 2D- ZrX2 and quasi 1D- ZrX2. For 2D- ZrX2, Raman modes are in alignment with those reported in literature. However, for quasi 1D-ZrX2, we show that Raman modes are identical to exfoliated ZrX3 nanosheets, indicating a major lattice transformation from 2D to quasi-1D. We also measure thermal properties of each resonant Raman mode for each ZrX2 shape. Based on our measurements, most Raman modes exhibit a linear downshift dependence with temperature. However, for ZrS2, we see an upshift (blueshift) with temperature for A1g mode, which is attributed to non-harmonic effects caused by dipolar coupling with IR-active modes. Moreover, the observed temperature dependence coefficient for some phonon modes of quasi 1D-ZrX2 differ dramatically, which can be caused by the quasi 1D lattice. Finally, we measure phonon dynamics under optical heating for each of 2D-ZrX2 and quasi 1D-ZrX2 and show phonon confinement in quasi 1D-ZrX2 nanosheets. We extract the thermal conductivity and the interfacial thermal conductance for each of 2D-ZrX2 and quasi 1D-ZrX2 nanosheets. Our calculations indicate lower interfacial thermal conductance for quasi 1D-ZrX2 compared to 2D-ZrX2, which can be attributed to the phonon confinement in 1D. Based on our model, we show low thermal conductivity for all ZrX2 nanosheets. Our results demonstrate exceptional thermal properties for ZrX2 materials, making them ideal for future thermal management strategies and thermoelectric device applications. △ Less

Submitted 2 October, 2022; originally announced October 2022.

Journal ref: Small 2022

arXiv:1909.08703 [pdf, other]

Deep Complex Networks for Protocol-Agnostic Radio Frequency Device Fingerprinting in the Wild

Authors: Ioannis Agadakos, Nikolaos Agadakos, Jason Polakis, Mohamed R. Amer

Abstract: Researchers have demonstrated various techniques for fingerprinting and identifying devices. Previous approaches have identified devices from their network traffic or transmitted signals while relying on software or operating system specific artifacts (e.g., predictability of protocol header fields) or characteristics of the underlying protocol (e.g.,frequency offset). As these constraints can be… ▽ More Researchers have demonstrated various techniques for fingerprinting and identifying devices. Previous approaches have identified devices from their network traffic or transmitted signals while relying on software or operating system specific artifacts (e.g., predictability of protocol header fields) or characteristics of the underlying protocol (e.g.,frequency offset). As these constraints can be a hindrance in real-world settings, we introduce a practical, generalizable approach that offers significant operational value for a variety of scenarios, including as an additional factor of authentication for preventing impersonation attacks. Our goal is to identify artifacts in transmitted signals that are caused by a device's unique hardware "imperfections" without any knowledge about the nature of the signal. We develop RF-DCN, a novel Deep Complex-valued Neural Network (DCN) that operates on raw RF signals and is completely agnostic of the underlying applications and protocols. We present two DCN variations: (i) Convolutional DCN (CDCN) for modeling full signals, and (ii) Recurrent DCN (RDCN) for modeling time series. Our system handles raw I/Q data from open air captures within a given spectrum window, without knowledge of the modulation scheme or even the carrier frequencies. While our experiments demonstrate the effectiveness of our system, especially under challenging conditions where other neural network architectures break down, we identify additional challenges in signal-based fingerprinting and provide guidelines for future explorations. Our work lays the foundation for more research within this vast and challenging space by establishing fundamental directions for using raw RF I/Q data in novel complex-valued networks. △ Less

Submitted 18 September, 2019; originally announced September 2019.

arXiv:1907.09000 [pdf, other]

Image Classification with Hierarchical Multigraph Networks

Authors: Boris Knyazev, Xiao Lin, Mohamed R. Amer, Graham W. Taylor

Abstract: Graph Convolutional Networks (GCNs) are a class of general models that can learn from graph structured data. Despite being general, GCNs are admittedly inferior to convolutional neural networks (CNNs) when applied to vision tasks, mainly due to the lack of domain knowledge that is hardcoded into CNNs, such as spatially oriented translation invariant filters. However, a great advantage of GCNs is t… ▽ More Graph Convolutional Networks (GCNs) are a class of general models that can learn from graph structured data. Despite being general, GCNs are admittedly inferior to convolutional neural networks (CNNs) when applied to vision tasks, mainly due to the lack of domain knowledge that is hardcoded into CNNs, such as spatially oriented translation invariant filters. However, a great advantage of GCNs is the ability to work on irregular inputs, such as superpixels of images. This could significantly reduce the computational cost of image reasoning tasks. Another key advantage inherent to GCNs is the natural ability to model multirelational data. Building upon these two promising properties, in this work, we show best practices for designing GCNs for image classification; in some cases even outperforming CNNs on the MNIST, CIFAR-10 and PASCAL image datasets. △ Less

Submitted 21 July, 2019; originally announced July 2019.

Comments: 13 pages, BMVC 2019

arXiv:1905.03319 [pdf, other]

Data-Efficient Mutual Information Neural Estimator

Authors: Xiao Lin, Indranil Sur, Samuel A. Nastase, Ajay Divakaran, Uri Hasson, Mohamed R. Amer

Abstract: Measuring Mutual Information (MI) between high-dimensional, continuous, random variables from observed samples has wide theoretical and practical applications. Recent work, MINE (Belghazi et al. 2018), focused on estimating tight variational lower bounds of MI using neural networks, but assumed unlimited supply of samples to prevent overfitting. In real world applications, data is not always avail… ▽ More Measuring Mutual Information (MI) between high-dimensional, continuous, random variables from observed samples has wide theoretical and practical applications. Recent work, MINE (Belghazi et al. 2018), focused on estimating tight variational lower bounds of MI using neural networks, but assumed unlimited supply of samples to prevent overfitting. In real world applications, data is not always available at a surplus. In this work, we focus on improving data efficiency and propose a Data-Efficient MINE Estimator (DEMINE), by developing a relaxed predictive MI lower bound that can be estimated at higher data efficiency by orders of magnitudes. The predictive MI lower bound also enables us to develop a new meta-learning approach using task augmentation, Meta-DEMINE, to improve generalization of the network and further boost estimation accuracy empirically. With improved data-efficiency, our estimators enables statistical testing of dependency at practical dataset sizes. We demonstrate the effectiveness of our estimators on synthetic benchmarks and a real world fMRI data, with application of inter-subject correlation analysis. △ Less

Submitted 24 May, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

arXiv:1905.02850 [pdf, other]

Understanding Attention and Generalization in Graph Neural Networks

Authors: Boris Knyazev, Graham W. Taylor, Mohamed R. Amer

Abstract: We aim to better understand attention over nodes in graph neural networks (GNNs) and identify factors influencing its effectiveness. We particularly focus on the ability of attention GNNs to generalize to larger, more complex or noisy graphs. Motivated by insights from the work on Graph Isomorphism Networks, we design simple graph reasoning tasks that allow us to study attention in a controlled en… ▽ More We aim to better understand attention over nodes in graph neural networks (GNNs) and identify factors influencing its effectiveness. We particularly focus on the ability of attention GNNs to generalize to larger, more complex or noisy graphs. Motivated by insights from the work on Graph Isomorphism Networks, we design simple graph reasoning tasks that allow us to study attention in a controlled environment. We find that under typical conditions the effect of attention is negligible or even harmful, but under certain conditions it provides an exceptional gain in performance of more than 60% in some of our classification tasks. Satisfying these conditions in practice is challenging and often requires optimal initialization or supervised training of attention. We propose an alternative recipe and train attention in a weakly-supervised fashion that approaches the performance of supervised models, and, compared to unsupervised models, improves results on several synthetic as well as real datasets. Source code and datasets are available at https://github.com/bknyaz/graph_attention_pool. △ Less

Submitted 28 October, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

Comments: NeurIPS 2019, camera-ready and supplementary material

arXiv:1811.09595 [pdf, other]

Spectral Multigraph Networks for Discovering and Fusing Relationships in Molecules

Authors: Boris Knyazev, Xiao Lin, Mohamed R. Amer, Graham W. Taylor

Abstract: Spectral Graph Convolutional Networks (GCNs) are a generalization of convolutional networks to learning on graph-structured data. Applications of spectral GCNs have been successful, but limited to a few problems where the graph is fixed, such as shape correspondence and node classification. In this work, we address this limitation by revisiting a particular family of spectral graph networks, Cheby… ▽ More Spectral Graph Convolutional Networks (GCNs) are a generalization of convolutional networks to learning on graph-structured data. Applications of spectral GCNs have been successful, but limited to a few problems where the graph is fixed, such as shape correspondence and node classification. In this work, we address this limitation by revisiting a particular family of spectral graph networks, Chebyshev GCNs, showing its efficacy in solving graph classification tasks with a variable graph structure and size. Chebyshev GCNs restrict graphs to have at most one edge between any pair of nodes. To this end, we propose a novel multigraph network that learns from multi-relational graphs. We model learned edges with abstract meaning and experiment with different ways to fuse the representations extracted from annotated and learned edges, achieving competitive results on a variety of chemical classification benchmarks. △ Less

Submitted 23 November, 2018; originally announced November 2018.

Comments: 11 pages, 5 figures, NIPS 2018 Workshop on Machine Learning for Molecules and Materials

arXiv:1804.10652 [pdf, other]

Human Motion Modeling using DVGANs

Authors: Xiao Lin, Mohamed R. Amer

Abstract: We present a novel generative model for human motion modeling using Generative Adversarial Networks (GANs). We formulate the GAN discriminator using dense validation at each time-scale and perturb the discriminator input to make it translation invariant. Our model is capable of motion generation and completion. We show through our evaluations the resiliency to noise, generalization over actions, a… ▽ More We present a novel generative model for human motion modeling using Generative Adversarial Networks (GANs). We formulate the GAN discriminator using dense validation at each time-scale and perturb the discriminator input to make it translation invariant. Our model is capable of motion generation and completion. We show through our evaluations the resiliency to noise, generalization over actions, and generation of long diverse sequences. We evaluate our approach on Human 3.6M and CMU motion capture datasets using inception scores. △ Less

Submitted 18 May, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

arXiv:1707.00750 [pdf, other]

Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Authors: Dhanesh Ramachandram, Michal Lisicki, Timothy J. Shields, Mohamed R. Amer, Graham W. Taylor

Abstract: A popular testbed for deep learning has been multimodal recognition of human activity or gesture involving diverse inputs such as video, audio, skeletal pose and depth images. Deep learning architectures have excelled on such problems due to their ability to combine modality representations at different levels of nonlinear feature extraction. However, designing an optimal architecture in which to… ▽ More A popular testbed for deep learning has been multimodal recognition of human activity or gesture involving diverse inputs such as video, audio, skeletal pose and depth images. Deep learning architectures have excelled on such problems due to their ability to combine modality representations at different levels of nonlinear feature extraction. However, designing an optimal architecture in which to fuse such learned representations has largely been a non-trivial human engineering effort. We treat fusion structure optimization as a hyper-parameter search and cast it as a discrete optimization problem under the Bayesian optimization framework. We propose a novel graph-induced kernel to compute structural similarities in the search space of tree-structured multimodal architectures and demonstrate its effectiveness using two challenging multimodal human activity recognition datasets. △ Less

Submitted 3 July, 2017; originally announced July 2017.

Comments: Proceedings of the 25th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, April 2017, Bruges, Belgium

arXiv:1609.09146 [pdf]

Photo-thermal Self-oscillations in Cavity-Coupled Carbon Nanotube pn-Devices

Authors: Moh. R. Amer, Tony Levi, Stephen B. Cronin

Abstract: We observe photothermal self-oscillations in individual, suspended, quasi-metallic carbon nanotube (CNT) pn devices irradiated with focused CW 633nm light. Here, the bottom of the trench forms an optical cavity with an anti-node at lambda/4. Oscillations arise from the optical heating of the nanotube, which causes thermal contraction of the nanotube (negative thermal expansion coefficient). This,… ▽ More We observe photothermal self-oscillations in individual, suspended, quasi-metallic carbon nanotube (CNT) pn devices irradiated with focused CW 633nm light. Here, the bottom of the trench forms an optical cavity with an anti-node at lambda/4. Oscillations arise from the optical heating of the nanotube, which causes thermal contraction of the nanotube (negative thermal expansion coefficient). This, in turn, moves the CNT out of the anti-node (maximum field intensity), where the nanotube cools to a lower temperature. It then expands and returns to the maximum field intensity anti-node where it is optically heated once again. The oscillations are observed through a change of the tunneling current in the CNT device. A pn-junction, established by two electrostatic gates positioned beneath the nanotube, results in Zener tunneling, which depends strongly on temperature. A Zener tunneling model with oscillating temperature shows good agreement with our measured I-V curves, providing further evidence that these oscillations are photothermal in nature. △ Less

Submitted 28 September, 2016; originally announced September 2016.

arXiv:1603.06554 [pdf, other]

Action-Affect Classification and Morphing using Multi-Task Representation Learning

Authors: Timothy J. Shields, Mohamed R. Amer, Max Ehrlich, Amir Tamrakar

Abstract: Most recent work focused on affect from facial expressions, and not as much on body. This work focuses on body affect analysis. Affect does not occur in isolation. Humans usually couple affect with an action in natural interactions; for example, a person could be talking and smiling. Recognizing body affect in sequences requires efficient algorithms to capture both the micro movements that differe… ▽ More Most recent work focused on affect from facial expressions, and not as much on body. This work focuses on body affect analysis. Affect does not occur in isolation. Humans usually couple affect with an action in natural interactions; for example, a person could be talking and smiling. Recognizing body affect in sequences requires efficient algorithms to capture both the micro movements that differentiate between happy and sad and the macro variations between different actions. We depart from traditional approaches for time-series data analytics by proposing a multi-task learning model that learns a shared representation that is well-suited for action-affect classification as well as generation. For this paper we choose Conditional Restricted Boltzmann Machines to be our building block. We propose a new model that enhances the CRBM model with a factored multi-task component to become Multi-Task Conditional Restricted Boltzmann Machines (MTCRBMs). We evaluate our approach on two publicly available datasets, the Body Affect dataset and the Tower Game dataset, and show superior classification performance improvement over the state-of-the-art, as well as the generative abilities of our model. △ Less

Submitted 21 March, 2016; originally announced March 2016.

arXiv:1505.02137 [pdf, other]

Human Social Interaction Modeling Using Temporal Deep Networks

Authors: Mohamed R. Amer, Behjat Siddiquie, Amir Tamrakar, David A. Salter, Brian Lande, Darius Mehri, Ajay Divakaran

Abstract: We present a novel approach to computational modeling of social interactions based on modeling of essential social interaction predicates (ESIPs) such as joint attention and entrainment. Based on sound social psychological theory and methodology, we collect a new "Tower Game" dataset consisting of audio-visual capture of dyadic interactions labeled with the ESIPs. We expect this dataset to provide… ▽ More We present a novel approach to computational modeling of social interactions based on modeling of essential social interaction predicates (ESIPs) such as joint attention and entrainment. Based on sound social psychological theory and methodology, we collect a new "Tower Game" dataset consisting of audio-visual capture of dyadic interactions labeled with the ESIPs. We expect this dataset to provide a new avenue for research in computational social interaction modeling. We propose a novel joint Discriminative Conditional Restricted Boltzmann Machine (DCRBM) model that combines a discriminative component with the generative power of CRBMs. Such a combination enables us to uncover actionable constituents of the ESIPs in two steps. First, we train the DCRBM model on the labeled data and get accurate (76\%-49\% across various ESIPs) detection of the predicates. Second, we exploit the generative capability of DCRBMs to activate the trained model so as to generate the lower-level data corresponding to the specific ESIP that closely matches the actual training data (with mean square error 0.01-0.1 for generating 100 frames). We are thus able to decompose the ESIPs into their constituent actionable behaviors. Such a purely computational determination of how to establish an ESIP such as engagement is unprecedented. △ Less

Submitted 28 May, 2015; v1 submitted 6 May, 2015; originally announced May 2015.

Showing 1–13 of 13 results for author: Amer, M R