-
Molecular Absorption-Aware User Assignment, Spectrum, and Power Allocation in Dense THz Networks with Multi-Connectivity
Authors:
Mohammad Amin Saeidi,
Hina Tabassum,
Mehrazin Alizadeh
Abstract:
This paper develops a unified framework to maximize the network sum-rate in a multi-user, multi-BS downlink terahertz (THz) network by optimizing user associations, number and bandwidth of sub-bands in a THz transmission window (TW), bandwidth of leading and trailing edge-bands in a TW, sub-band assignment, and power allocations. The proposed framework incorporates multi-connectivity and captures…
▽ More
This paper develops a unified framework to maximize the network sum-rate in a multi-user, multi-BS downlink terahertz (THz) network by optimizing user associations, number and bandwidth of sub-bands in a THz transmission window (TW), bandwidth of leading and trailing edge-bands in a TW, sub-band assignment, and power allocations. The proposed framework incorporates multi-connectivity and captures the impact of molecular absorption coefficient variations in a TW, beam-squint, molecular absorption noise, and link blockages. To make the problem tractable, we first propose a convex approximation of the molecular absorption coefficient using curve fitting in a TW, determine the feasible bandwidths of the leading and trailing edge-bands, and then derive closed-form optimal solution for the number of sub-bands considering beam-squint constraints. We then decompose joint user associations, sub-band assignment, and power allocation problem into two sub-problems, i.e., \textbf{(i)} joint user association and sub-band assignment, and \textbf{(ii)} power allocation. To solve the former problem, we analytically prove the unimodularity of the constraint matrix which enables us to relax the integer constraint without loss of optimality. To solve power allocation sub-problem, a fractional programming (FP)-based centralized solution as well as an alternating direction method of multipliers (ADMM)-based light-weight distributed solution is proposed. The overall problem is then solved using alternating optimization until convergence. Complexity analysis of the algorithms and numerical convergence are presented. Numerical findings validate the effectiveness of the proposed algorithms and extract useful insights about the interplay of the density of base stations (BSs), Average order of multi-connectivity (AOM), molecular absorption, {hardware impairment}, {imperfect CSI}, and link blockages.
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
Real time detection of C reactive protein in interstitial fluid using electrochemical impedance spectroscopy, towards wearable health monitoring
Authors:
Aristea Grammoustianou,
Ali Saeidi,
Johan Longo,
Felix Risch,
Adrian M. Ionescu
Abstract:
Traditional detection methods of C-reactive protein (CRP) inflammation biomarker, in blood are expensive, time-consuming and labor-intensive. Such existing point-of-care CRP detection devices remain invasive, since they need blood sampling (finger-pricking or venous puncture). Here, we propose an electrochemical impedance spectroscopy (EIS)-based sensor for the real-time, fast, specific, sensitive…
▽ More
Traditional detection methods of C-reactive protein (CRP) inflammation biomarker, in blood are expensive, time-consuming and labor-intensive. Such existing point-of-care CRP detection devices remain invasive, since they need blood sampling (finger-pricking or venous puncture). Here, we propose an electrochemical impedance spectroscopy (EIS)-based sensor for the real-time, fast, specific, sensitive, and label-free detection of C-reactive protein in the interstitial fluid (ISF) that can be accessed with minimally invasive microneedle arrays. The sensor has the potential to be integrated in a wearable device similar with the continuous glucose monitoring, that will detect CRP in interstitial fluid in a non-invasive, inexpensive and straightforward manner. The affinity based assay was tested in both buffer and ISF-like solution. The limit of detection achieved was 0.7 ug/mL of CRP in buffer, and 0.8 ug/mL of CRP in ISF-like solution and the sensor shows excellent linearity up to 10 ug/mL. It is worth noting that the proposed sensor operates in low sample volume (down to 5 uL), and has a response time of 100 seconds.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
Authors:
Md Nayem Uddin,
Amir Saeidi,
Divij Handa,
Agastya Seth,
Tran Cao Son,
Eduardo Blanco,
Steven R. Corman,
Chitta Baral
Abstract:
This paper introduces UnSeenTimeQA, a novel time-sensitive question-answering (TSQA) benchmark that diverges from traditional TSQA benchmarks by avoiding factual and web-searchable queries. We present a series of time-sensitive event scenarios decoupled from real-world factual information. It requires large language models (LLMs) to engage in genuine temporal reasoning, disassociating from the kno…
▽ More
This paper introduces UnSeenTimeQA, a novel time-sensitive question-answering (TSQA) benchmark that diverges from traditional TSQA benchmarks by avoiding factual and web-searchable queries. We present a series of time-sensitive event scenarios decoupled from real-world factual information. It requires large language models (LLMs) to engage in genuine temporal reasoning, disassociating from the knowledge acquired during the pre-training phase. Our evaluation of six open-source LLMs (ranging from 2B to 70B in size) and three closed-source LLMs reveal that the questions from the UnSeenTimeQA present substantial challenges. This indicates the models' difficulties in handling complex temporal reasoning scenarios. Additionally, we present several analyses shedding light on the models' performance in answering time-sensitive questions.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation
Authors:
Neeraj Varshney,
Satyam Raj,
Venkatesh Mishra,
Agneet Chatterjee,
Ritika Sarkar,
Amir Saeidi,
Chitta Baral
Abstract:
Large Language Models (LLMs) have achieved remarkable performance across a wide variety of natural language tasks. However, they have been shown to suffer from a critical limitation pertinent to 'hallucination' in their output. Recent research has focused on investigating and addressing this problem for a variety of tasks such as biography generation, question answering, abstractive summarization,…
▽ More
Large Language Models (LLMs) have achieved remarkable performance across a wide variety of natural language tasks. However, they have been shown to suffer from a critical limitation pertinent to 'hallucination' in their output. Recent research has focused on investigating and addressing this problem for a variety of tasks such as biography generation, question answering, abstractive summarization, and dialogue generation. However, the crucial aspect pertaining to 'negation' has remained considerably underexplored. Negation is important because it adds depth and nuance to the understanding of language and is also crucial for logical reasoning and inference. In this work, we address the above limitation and particularly focus on studying the impact of negation in LLM hallucinations. Specifically, we study four tasks with negation: 'false premise completion', 'constrained fact generation', 'multiple choice question answering', and 'fact generation'. We show that open-source state-of-the-art LLMs such as LLaMA-2-chat, Vicuna, and Orca-2 hallucinate considerably on all these tasks involving negation which underlines a critical shortcoming of these models. Addressing this problem, we further study numerous strategies to mitigate these hallucinations and demonstrate their impact.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization
Authors:
Amir Saeidi,
Shivanshu Verma,
Aswin RRV,
Chitta Baral
Abstract:
Large Language Models (LLMs) perform well across diverse tasks, but aligning them with human demonstrations is challenging. Recently, Reinforcement Learning (RL)-free methods like Direct Preference Optimization (DPO) have emerged, offering improved stability and scalability while retaining competitive performance relative to RL-based methods. However, while RL-free methods deliver satisfactory per…
▽ More
Large Language Models (LLMs) perform well across diverse tasks, but aligning them with human demonstrations is challenging. Recently, Reinforcement Learning (RL)-free methods like Direct Preference Optimization (DPO) have emerged, offering improved stability and scalability while retaining competitive performance relative to RL-based methods. However, while RL-free methods deliver satisfactory performance, they require significant data to develop a robust Supervised Fine-Tuned (SFT) model and an additional step to fine-tune this model on a preference dataset, which constrains their utility and scalability. In this paper, we introduce Triple Preference Optimization (TPO), a new preference learning method designed to align an LLM with three preferences without requiring a separate SFT step and using considerably less data. Through a combination of practical experiments and theoretical analysis, we show the efficacy of TPO as a single-step alignment strategy. Specifically, we fine-tuned the Phi-2 (2.7B) and Mistral (7B) models using TPO directly on the UltraFeedback dataset, achieving superior results compared to models aligned through other methods such as SFT, DPO, KTO, IPO, CPO, and ORPO. Moreover, the performance of TPO without the SFT component led to notable improvements in the MT-Bench score, with increases of +1.27 and +0.63 over SFT and DPO, respectively. Additionally, TPO showed higher average accuracy, surpassing DPO and SFT by 4.2% and 4.97% on the Open LLM Leaderboard benchmarks. Our code is publicly available at https://github.com/sahsaeedi/triple-preference-optimization .
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Record Acceleration of the Two-Dimensional Ising Model Using High-Performance Wafer Scale Engine
Authors:
Dirk Van Essendelft,
Hayl Almolyki,
Wei Shi,
Terry Jordan,
Mei-Yu Wang,
Wissam A. Saidi
Abstract:
The versatility and wide-ranging applicability of the Ising model, originally introduced to study phase transitions in magnetic materials, have made it a cornerstone in statistical physics and a valuable tool for evaluating the performance of emerging computer hardware. Here, we present a novel implementation of the two-dimensional Ising model on a Cerebras Wafer-Scale Engine (WSE), a revolutionar…
▽ More
The versatility and wide-ranging applicability of the Ising model, originally introduced to study phase transitions in magnetic materials, have made it a cornerstone in statistical physics and a valuable tool for evaluating the performance of emerging computer hardware. Here, we present a novel implementation of the two-dimensional Ising model on a Cerebras Wafer-Scale Engine (WSE), a revolutionary processor that is opening new frontiers in computing. In our deployment of the checkerboard algorithm, we optimized the Ising model to take advantage of the unique WSE architecture. Specifically, we employed a compressed bit representation storing 16 spins on each int16 word, and efficiently distributed the spins over the processing units enabling seamless weak scaling and limiting communications to only immediate neighboring units. Our implementation can handle up to 754 simulations in parallel, achieving an aggregate of over 61.8 trillion flip attempts per second for Ising models with up to 200 million spins. This represents a gain of up to 148 times over previously reported single-device with a highly optimized implementation on NVIDIA V100 and up to 88 times in productivity compared to NVIDIA H100. Our findings highlight the significant potential of the WSE in scientific computing, particularly in the field of materials modeling.
△ Less
Submitted 1 May, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
Authors:
Amir Saeidi,
Shivanshu Verma,
Chitta Baral
Abstract:
Large Language Models (LLMs) have demonstrated remarkable performance across a spectrum of tasks. Recently, Direct Preference Optimization (DPO) has emerged as an RL-free approach to optimize the policy model on human preferences. However, several limitations hinder the widespread adoption of this method. To address these shortcomings, various versions of DPO have been introduced. Yet, a comprehen…
▽ More
Large Language Models (LLMs) have demonstrated remarkable performance across a spectrum of tasks. Recently, Direct Preference Optimization (DPO) has emerged as an RL-free approach to optimize the policy model on human preferences. However, several limitations hinder the widespread adoption of this method. To address these shortcomings, various versions of DPO have been introduced. Yet, a comprehensive evaluation of these variants across diverse tasks is still lacking. In this study, we aim to bridge this gap by investigating the performance of alignment methods across three distinct scenarios: (1) keeping the Supervised Fine-Tuning (SFT) part, (2) skipping the SFT part, and (3) skipping the SFT part and utilizing an instruction-tuned model. Furthermore, we explore the impact of different training sizes on their performance. Our evaluation spans a range of tasks including dialogue systems, reasoning, mathematical problem-solving, question answering, truthfulness, and multi-task understanding, encompassing 13 benchmarks such as MT-Bench, Big Bench, and Open LLM Leaderboard. Key observations reveal that alignment methods achieve optimal performance with smaller training data subsets, exhibit limited effectiveness in reasoning tasks yet significantly impact mathematical problem-solving, and employing an instruction-tuned model notably influences truthfulness. We anticipate that our findings will catalyze further research aimed at developing more robust models to address alignment challenges.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Plasmon-driven creation of magnetic topological structures
Authors:
W. Al Saidi,
R. Sbiaa,
Y. Dusch,
N. Tiercelin
Abstract:
In the present research, we demonstrate the usage of plasmonic effects in thin film structures to control magnetic topological textures, specifically skyrmions and skyrmioniums. We investigate numerically the generation and alteration of these topological structures caused by hemisphere gold nanoparticle placed over a magnetic layer coated with a dielectric material. The electromagnetic and photot…
▽ More
In the present research, we demonstrate the usage of plasmonic effects in thin film structures to control magnetic topological textures, specifically skyrmions and skyrmioniums. We investigate numerically the generation and alteration of these topological structures caused by hemisphere gold nanoparticle placed over a magnetic layer coated with a dielectric material. The electromagnetic and photothermal models are used to clarify the processes of producing heat and absorption, and the results were implemented in micromagnetic formalism to reveal the dynamics of magnetization under various conditions. Our findings demonstrate the significance of the laser pulse duration and the contact area between nanoparticles and the underlying magnetic layer in forming topological textures. In particular, we show how to generate a single skyrmion, multiple skyrmions, and skyrmioniums, and how to dynamically transition between these states. These results highlight the possibility of manipulating magnetic textures by using plasmonic effects, which presents significant opportunities for spintronics and non-conventional computer applications.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Exploring the formation of gold/silver nanoalloys with gas-phase synthesis and machine-learning assisted simulations
Authors:
Quentin Gromoff,
Patrizio Benzo,
Wissam A. Saidi,
Christopher M. Andolina,
Marie-José Casanove,
Teresa Hungria,
Sophie Barre,
Magali Benoit,
Julien Lam
Abstract:
While nanoalloys are of paramount scientific and practical interests, the main processes leading to their formation are still poorly understood. Key structural features in the alloy systems, including crystal phase, chemical ordering, and morphology, are challenging to control at the nanoscale, making it difficult to transfer their usage to industrial applications. In this contribution, we focus o…
▽ More
While nanoalloys are of paramount scientific and practical interests, the main processes leading to their formation are still poorly understood. Key structural features in the alloy systems, including crystal phase, chemical ordering, and morphology, are challenging to control at the nanoscale, making it difficult to transfer their usage to industrial applications. In this contribution, we focus on the gold/silver system that has two of the most prevalent noble metals, and combine experiments with simulations to uncover the formation mechanisms at the atomic-level. Nanoparticles are produced using state-of-the-art inert-gas aggregation source and analyzed using transmission electron microscopy and energy-dispersive x-ray spectroscopy. Machine-learning-assisted molecular dynamics simulations are employed to model the crystallization process from liquid droplets to nanocrystals. Our study finds a preponderance of nanoparticles with five-fold symmetric morphology, including icosahedron and decahedron which is consistent with previous results on mono-metallic nanoparticles. However, we observe that gold atoms, rather than silver atoms, segregate at the surface of the obtained nanoparticles for all the considered alloy compositions. These segregation tendencies are in contrast to previous studies and have consequences on the crystallization dynamics and the subsequent crystal ordering. We finally show that the underpinnings of this surprising segregation dynamics is due to charge transfer and electrostatic interactions rather than surface energy considerations.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Atomic scale understanding of initial Cu-Ni oxidation from machine-learning accelerated first-principles simulations and in situ TEM experiments
Authors:
Pandu Wisesa,
Meng Li,
Matthew T. Curnan,
Jeong Woo Han,
Judith C. Yang,
Wissam A. Saidi
Abstract:
The development of accurate methods for determining how alloy surfaces spontaneously restructure under reactive and corrosive environments is a key, long-standing, grand challenge in materials science. Current oxidation models, such as Cabrera-Mott, are based on macroscopic empirical knowledge that lacks fundamental insight at the atomic level. Using machine learning-accelerated density functional…
▽ More
The development of accurate methods for determining how alloy surfaces spontaneously restructure under reactive and corrosive environments is a key, long-standing, grand challenge in materials science. Current oxidation models, such as Cabrera-Mott, are based on macroscopic empirical knowledge that lacks fundamental insight at the atomic level. Using machine learning-accelerated density functional theory with in situ environmental transmission electron microscopy (ETEM), we examine the interplay between surface reconstructions and preferential segregation tendencies of CuNi(100) surfaces under oxidation conditions. Our modeling approach based on molecular dynamics and grand canonical Monte Carlo simulations shows that oxygen-induced Ni segregation in CuNi alloy favors Cu(100)-O c(2x2) reconstruction and destabilizes the Cu(100)-O missing row reconstruction. The underpinnings of these stabilization tendencies are rationalized based on the similar atomic coordination and bond lengths in NiO rock salt and Cu(100)-O c(2x2) structures. In situ ETEM experiments show Ni segregation followed by NiO nucleation and growth in regions without MRR, with secondary nucleation and growth of Cu2O in MRR regions. This further corroborates the simulated surface oxidation and segregation modelling outcomes. Our findings are general and are expected to extend to other alloy systems.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
A Tractable Handoff-aware Rate Outage Approximation with Applications to THz-enabled Vehicular Network Optimization
Authors:
Mohammad Amin Saeidi,
Haider Shoaib,
Hina Tabassum
Abstract:
In this paper, we first develop a tractable mathematical model of the handoff (HO)-aware rate outage experienced by a typical connected and autonomous vehicle (CAV) in a given THz vehicular network. The derived model captures the impact of line-of-sight (LOS) Nakagami-m fading channels, interference, and molecular absorption effects. We first derive the statistics of the interference-plus-molecula…
▽ More
In this paper, we first develop a tractable mathematical model of the handoff (HO)-aware rate outage experienced by a typical connected and autonomous vehicle (CAV) in a given THz vehicular network. The derived model captures the impact of line-of-sight (LOS) Nakagami-m fading channels, interference, and molecular absorption effects. We first derive the statistics of the interference-plus-molecular absorption noise ratio and demonstrate that it can be approximated by Gamma distribution using Welch-Satterthwaite approximation. Then, we show that the distribution of signal-to-interference-plus-molecular absorption noise ratio (SINR) follows a generalized Beta prime distribution. Based on this, a closed-form HO-aware rate outage expression is derived. Finally, we formulate and solve a CAVs' traffic flow maximization problem to optimize the base-stations (BSs) density and speed of CAVs with collision avoidance, rate outage, and CAVs' minimum traffic flow constraint. The CAVs' traffic flow is modeled using Log-Normal distribution. Our numerical results validate the accuracy of the derived expressions using Monte-Carlo simulations and discuss useful insights related to optimal BS density and CAVs' speed as a function of crash intensity level, THz molecular absorption effects, minimum road-traffic flow and rate requirements, and maximum speed and rate outage limits.
△ Less
Submitted 25 August, 2023; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Resource Allocation and Performance Analysis of Hybrid RSMA-NOMA in the Downlink
Authors:
Mohammad Amin Saeidi,
Hina Tabassum
Abstract:
Rate splitting multiple access (RSMA) and non-orthogonal multiple access (NOMA) are the key enabling multiple access techniques to enable massive connectivity. However, it is unclear whether RSMA would consistently outperform NOMA from a system sum-rate perspective, users' fairness, as well as convergence and feasibility of the resource allocation solutions. This paper investigates the weighted su…
▽ More
Rate splitting multiple access (RSMA) and non-orthogonal multiple access (NOMA) are the key enabling multiple access techniques to enable massive connectivity. However, it is unclear whether RSMA would consistently outperform NOMA from a system sum-rate perspective, users' fairness, as well as convergence and feasibility of the resource allocation solutions. This paper investigates the weighted sum-rate maximization problem to optimize power and rate allocations in a hybrid RSMA-NOMA network. In the hybrid RSMA-NOMA, by optimally allocating the maximum power budget to each scheme, the BS operates on NOMA and RSMA in two orthogonal channels, allowing users to simultaneously receive signals on both RSMA and NOMA. Based on the successive convex approximation (SCA) approach, we jointly optimize the power allocation of users in NOMA and RSMA, the rate allocation of users in RSMA, and the power budget allocation for NOMA and RSMA considering successive interference cancellation (SIC) constraints. Numerical results demonstrate the trade-offs that hybrid RSMA-NOMA access offers in terms of system sum rate, fairness, convergence, and feasibility of the solutions.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Dynamics of interacting skyrmions in magnetic nano-track
Authors:
W. Al Saidi,
R. Sbiaa,
S. Bhatti,
S. N. Piramanayagam,
S. Al Risi,
O. Al Bahri
Abstract:
Controlling multiple skyrmions in nanowires is important for their implementation in racetrack memory or neuromorphic computing. Here, we report on the dynamical behavior of two interacting skyrmions in confined devices with a comparison to a single skyrmion case. Although the two skyrmions shrink near the edges and follow a helical path, their behavior is different. Because the leading skyrmion i…
▽ More
Controlling multiple skyrmions in nanowires is important for their implementation in racetrack memory or neuromorphic computing. Here, we report on the dynamical behavior of two interacting skyrmions in confined devices with a comparison to a single skyrmion case. Although the two skyrmions shrink near the edges and follow a helical path, their behavior is different. Because the leading skyrmion is between the edge and the trailing one, its size is reduced further and collapses at a lower current density compared to the single skyrmion case. For higher current density, both skyrmions are annihilated with a core-collapse mechanism for the leading one followed by a bubble-collapse mechanism for the trailing one.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Stabilizing skyrmions in stepped magnetic devices for multistate memory
Authors:
W. Al Saidi,
R. Sbiaa,
S. Al Risi,
F. Al Shanfari,
N. Tiercelin,
Y. Dusch
Abstract:
The dynamics and stability of magnetic skyrmions within a nano-track with multiple confinements are investigated. Firstly, the motion of a single skyrmion under spin transfer torque (STT) is studied. By accurately adjusting the current pulse magnitude and width, the study reveals the possibility to pin and stabilize the skyrmion in each confinement. Due to the Hall angle, the depining of the skyrm…
▽ More
The dynamics and stability of magnetic skyrmions within a nano-track with multiple confinements are investigated. Firstly, the motion of a single skyrmion under spin transfer torque (STT) is studied. By accurately adjusting the current pulse magnitude and width, the study reveals the possibility to pin and stabilize the skyrmion in each confinement. Due to the Hall angle, the depining of the skyrmion from the top confinement requires two pulses with adjustable time delay while a single pulse is enough to depin it for the case of bottom confinement. In the case of two skyrmions, once one is pinned in one confinement, the second one stabilizes in the nearest available empty state and no more than one skyrmion could be seen in single confinement. Finally and for further confirmation of this behavior, the motion of a large number of skyrmions is investigated under the same conditions. The results show that a multistate device could be obtained with still the existence of only one skyrmion per state. The skyrmions could be displaced along the nano-track until their annihilation at the end of the device.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Multi-band Wireless Networks: Architectures, Challenges, and Comparative Analysis
Authors:
Mohammad Amin Saeidi,
Hina Tabassum,
Mohamed-Slim Alouini
Abstract:
This paper presents the vision of multi-band communication networks (MBN) in 6G, where optical and TeraHertz (THz) transmissions will coexist with the conventional radio frequency (RF) spectrum. This paper will first pin-point the fundamental challenges in MBN architectures at the PHYsical (PHY) and Medium Access (MAC) layer, such as unique channel propagation and estimation issues, user offloadin…
▽ More
This paper presents the vision of multi-band communication networks (MBN) in 6G, where optical and TeraHertz (THz) transmissions will coexist with the conventional radio frequency (RF) spectrum. This paper will first pin-point the fundamental challenges in MBN architectures at the PHYsical (PHY) and Medium Access (MAC) layer, such as unique channel propagation and estimation issues, user offloading and resource allocation, multi-band transceiver design and antenna systems, mobility and handoff management, backhauling, etc. We then perform a quantitative performance assessment of the two fundamental MBN architectures, i.e., {stand-alone MBN} and {integrated MBN} considering critical factors like achievable rate, and capital/operational deployment cost. {Our results show that stand-alone deployment is prone to higher capital and operational expenses for a predefined data rate requirement. Stand-alone deployment, however, offers flexibility and enables controlling the number of access points in different transmission bands.} In addition, we propose a molecular absorption-aware user offloading metric for MBNs and demonstrate its performance gains over conventional user offloading schemes. Finally, open research directions are presented.
△ Less
Submitted 20 June, 2023; v1 submitted 14 December, 2022;
originally announced December 2022.
-
Investigation of dust grains by optical tweezers for space applications
Authors:
A. Magazzù,
D. Bronte Ciriza,
A. Musolino,
A. Saidi,
P. Polimeno,
M. G. Donato,
A. Foti,
P. G. Gucciardi,
M. A. Iatì R. Saija,
N. Perchiazzi,
A. Rotundi,
L. Folco,
O. M. Maragò
Abstract:
Cosmic dust plays a dominant role in the universe, especially in the formation of stars and planetary systems. Furthermore, the surface of cosmic dust grains is the bench-work where molecular hydrogen and simple organic compounds are formed. We manipulate individual dust particles in water solution by contactless and non-invasive techniques such as standard and Raman tweezers, to characterize thei…
▽ More
Cosmic dust plays a dominant role in the universe, especially in the formation of stars and planetary systems. Furthermore, the surface of cosmic dust grains is the bench-work where molecular hydrogen and simple organic compounds are formed. We manipulate individual dust particles in water solution by contactless and non-invasive techniques such as standard and Raman tweezers, to characterize their response to mechanical effects of light (optical forces and torques) and to determine their mineral compositions. Moreover, we show accurate optical force calculations in the T-matrix formalism highlighting the key role of composition and complex morphology in optical trapping of cosmic dust particles.This opens perspectives for future applications of optical tweezers in curation facilities for sample return missions or in extraterrestrial environments.
△ Less
Submitted 30 September, 2022;
originally announced October 2022.
-
Roadmap for Optical Tweezers
Authors:
Giovanni Volpe,
Onofrio M. Maragò,
Halina Rubinzstein-Dunlop,
Giuseppe Pesce,
Alexander B. Stilgoe,
Giorgio Volpe,
Georgiy Tkachenko,
Viet Giang Truong,
Síle Nic Chormaic,
Fatemeh Kalantarifard,
Parviz Elahi,
Mikael Käll,
Agnese Callegari,
Manuel I. Marqués,
Antonio A. R. Neves,
Wendel L. Moreira,
Adriana Fontes,
Carlos L. Cesar,
Rosalba Saija,
Abir Saidi,
Paul Beck,
Jörg S. Eismann,
Peter Banzer,
Thales F. D. Fernandes,
Francesco Pedaci
, et al. (58 additional authors not shown)
Abstract:
Optical tweezers are tools made of light that enable contactless pushing, trapping, and manipulation of objects ranging from atoms to space light sails. Since the pioneering work by Arthur Ashkin in the 1970s, optical tweezers have evolved into sophisticated instruments and have been employed in a broad range of applications in life sciences, physics, and engineering. These include accurate force…
▽ More
Optical tweezers are tools made of light that enable contactless pushing, trapping, and manipulation of objects ranging from atoms to space light sails. Since the pioneering work by Arthur Ashkin in the 1970s, optical tweezers have evolved into sophisticated instruments and have been employed in a broad range of applications in life sciences, physics, and engineering. These include accurate force and torque measurement at the femtonewton level, microrheology of complex fluids, single micro- and nanoparticle spectroscopy, single-cell analysis, and statistical-physics experiments. This roadmap provides insights into current investigations involving optical forces and optical tweezers from their theoretical foundations to designs and setups. It also offers perspectives for applications to a wide range of research fields, from biophysics to space exploration.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
A Novel Neuromorphic Processors Realization of Spiking Deep Reinforcement Learning for Portfolio Management
Authors:
Seyyed Amirhossein Saeidi,
Forouzan Fallah,
Soroush Barmaki,
Hamed Farbeh
Abstract:
The process of continuously reallocating funds into financial assets, aiming to increase the expected return of investment and minimizing the risk, is known as portfolio management. Processing speed and energy consumption of portfolio management have become crucial as the complexity of their real-world applications increasingly involves high-dimensional observation and action spaces and environmen…
▽ More
The process of continuously reallocating funds into financial assets, aiming to increase the expected return of investment and minimizing the risk, is known as portfolio management. Processing speed and energy consumption of portfolio management have become crucial as the complexity of their real-world applications increasingly involves high-dimensional observation and action spaces and environment uncertainty, which their limited onboard resources cannot offset. Emerging neuromorphic chips inspired by the human brain increase processing speed by up to 1000 times and reduce power consumption by several orders of magnitude. This paper proposes a spiking deep reinforcement learning (SDRL) algorithm that can predict financial markets based on unpredictable environments and achieve the defined portfolio management goal of profitability and risk reduction. This algorithm is optimized forIntel's Loihi neuromorphic processor and provides 186x and 516x energy consumption reduction is observed compared to the competitors, respectively. In addition, a 1.3x and 2.0x speed-up over the high-end processors and GPUs, respectively. The evaluations are performed on cryptocurrency market between 2016 and 2021 the benchmark.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Convergence Acceleration in Machine Learning Potentials for Atomistic Simulations
Authors:
Dylan Bayerl,
Christopher M. Andolina,
Shyam Dwaraknath,
Wissam A. Saidi
Abstract:
Machine learning potentials (MLPs) for atomistic simulations have an enormous prospective impact on materials modeling, offering orders of magnitude speedup over density functional theory (DFT) calculations without appreciably sacrificing accuracy in the prediction of material properties. However, the generation of large datasets needed for training MLPs is daunting. Herein, we show that MLP-based…
▽ More
Machine learning potentials (MLPs) for atomistic simulations have an enormous prospective impact on materials modeling, offering orders of magnitude speedup over density functional theory (DFT) calculations without appreciably sacrificing accuracy in the prediction of material properties. However, the generation of large datasets needed for training MLPs is daunting. Herein, we show that MLP-based material property predictions converge faster with respect to precision for Brillouin zone integrations than DFT-based property predictions. We demonstrate that this phenomenon is robust across material properties for different metallic systems. Further, we provide statistical error metrics to accurately determine a priori the precision level required of DFT training datasets for MLPs to ensure accelerated convergence of material property predictions, thus significantly reducing the computational expense of MLP development.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
Optical tweezers in a dusty universe
Authors:
P. Polimeno,
A. Magazzu,
M. A. Iati,
R. Saija,
L. Folco,
D. Bronte Ciriza,
M. G. Donato,
A. Foti,
P. G. Gucciardi,
A. Saidi,
C. Cecchi-Pestellini,
A. Jimenez Escobar,
E. Ammannito,
G. Sindoni,
I. Bertini,
V. Della Corte,
L. Inno,
A. Ciaravella,
A. Rotundi,
O. M. Marago
Abstract:
Optical tweezers are powerful tools based on focused laser beams. They are able to trap, manipulate and investigate a wide range of microscopic and nanoscopic particles in different media, such as liquids, air, and vacuum. Key applications of this contactless technique have been developed in many fields. Despite this progress, optical trapping applications to planetary exploration is still to be d…
▽ More
Optical tweezers are powerful tools based on focused laser beams. They are able to trap, manipulate and investigate a wide range of microscopic and nanoscopic particles in different media, such as liquids, air, and vacuum. Key applications of this contactless technique have been developed in many fields. Despite this progress, optical trapping applications to planetary exploration is still to be developed. Here we describe how optical tweezers can be used to trap and characterize extraterrestrial particulate matter. In particular, we exploit light scattering theory in the T-matrix formalism to calculate radiation pressure and optical trapping properties of a variety of complex particles of astrophysical interest. Our results open perspectives in the investigation of extraterrestrial particles on our planet, in controlled laboratory experiments, aiming for space tweezers applications: optical tweezers used to trap and characterize dust particles in space or on planetary bodies surface.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
Atomistic mechanisms of binary alloy surface segregation from nanoseconds to seconds using accelerated dynamics
Authors:
Richard B. Garza,
Jiyoung Lee,
Mai H. Nguyen,
Andrew Garmon,
Danny Perez,
Meng Li,
Judith C. Yang,
Graeme Henkelman,
Wissam A. Saidi
Abstract:
Although the equilibrium composition of many alloy surfaces is well understood, the rate of transient surface segregation during annealing is not known, despite its crucial effect on alloy corrosion and catalytic reactions occurring on overlapping timescales. In this work, CuNi bimetallic alloys representing (100) surface facets are annealed in vacuum using atomistic simulations to observe the eff…
▽ More
Although the equilibrium composition of many alloy surfaces is well understood, the rate of transient surface segregation during annealing is not known, despite its crucial effect on alloy corrosion and catalytic reactions occurring on overlapping timescales. In this work, CuNi bimetallic alloys representing (100) surface facets are annealed in vacuum using atomistic simulations to observe the effect of vacancy diffusion on surface separation. We employ multi-timescale methods to sample the early transient, intermediate, and equilibrium states of slab surfaces during the separation process, including standard MD as well as three methods to perform atomistic, long-time dynamics: parallel trajectory splicing (ParSplice), adaptive kinetic Monte Carlo (AKMC), and kinetic Monte Carlo (KMC). From nanosecond (ns) to second timescales, our multiscale computational methodology can observe rare stochastic events not typically seen with standard MD, closing the gap between computational and experimental timescales for surface segregation. Rapid diffusion of a vacancy to the slab is resolved by all four methods in tens of ns. Stochastic re-entry of vacancies into the subsurface, however, is only seen on the microsecond timescale in the two KMC methods. Kinetic vacancy trapping on the surface and its effect on the segregation rate are discussed. The equilibrium composition profile of CuNi after segregation during annealing is estimated to occur on a timescale of seconds as determined by KMC, a result directly comparable to nanoscale experiments.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Revisiting Trends in the Exchange Current for Hydrogen Evolution
Authors:
Timothy T. Yang,
Rituja B. Patil,
James R. McKone,
Wissam A. Saidi
Abstract:
Nørskov and collaborators proposed a simple kinetic model to explain the volcano relation for the hydrogen evolution reaction on transition metal surfaces in such that $ j_0= k_0 f(ΔG_H)$ where j_0 is the exchange current density, $f(ΔG_H)$ is a function of the hydrogen adsorption free energy $ΔG_H$ as computed from density functional theory, and $k_0$ is a universal rate constant. Herein, focusin…
▽ More
Nørskov and collaborators proposed a simple kinetic model to explain the volcano relation for the hydrogen evolution reaction on transition metal surfaces in such that $ j_0= k_0 f(ΔG_H)$ where j_0 is the exchange current density, $f(ΔG_H)$ is a function of the hydrogen adsorption free energy $ΔG_H$ as computed from density functional theory, and $k_0$ is a universal rate constant. Herein, focusing on the hydrogen evolution reaction in acidic medium, we revisit the original experimental data and find that the fidelity of this kinetic model can be significantly improved by invoking metal-dependence on $k_0$ such that the logarithm of $k_0$ linearly depends on the absolute value of $ΔG_H$. We further confirm this relationship using additional experimental data points obtained from a critical review of the available literature. Our analyses show that the new model decreases the discrepancy between calculated and experimental exchange current density values by up to four orders of magnitude. Furthermore, we show the model can be further improved using machine learning and statistical inference methods that integrate additional material properties
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Thermal fluctuations and carrier localization induced by dynamic disorder in MAPbI3 described by a first-principles based tight-binding model
Authors:
David J. Abramovitch,
Wissam A. Saidi,
Liang Z. Tan
Abstract:
Halide perovskites are strongly influenced by large amplitude anharmonic lattice fluctuations at room temperature. We develop a tight binding model for dynamically disordered MAPbI$_3$ based on density functional theory (DFT) calculations to calculate electronic structure for finite temperature crystal structures at the length scale of thermal disorder and carrier localization. The model predicts…
▽ More
Halide perovskites are strongly influenced by large amplitude anharmonic lattice fluctuations at room temperature. We develop a tight binding model for dynamically disordered MAPbI$_3$ based on density functional theory (DFT) calculations to calculate electronic structure for finite temperature crystal structures at the length scale of thermal disorder and carrier localization. The model predicts individual Hamiltonian matrix elements and band structures with high accuracy, owing to the inclusion of additional matrix elements and descriptors for non-Coulombic interactions. We apply this model to electronic structure at length and time scales inaccessible to first principles methods, finding an increase in band gap, carrier mass, and the sub-picosecond fluctuations in these quantities with increasing temperature as well as the onset of carrier localization in large supercells induced by thermal disorder at 300 K. We identify the length scale $L^*= 5$ nm as the onset of localization in the electronic structure, associated with associated with decreasing band edge fluctuations, increasing carrier mass, and Rashba splitting approaching zero.
△ Less
Submitted 19 July, 2021; v1 submitted 13 May, 2021;
originally announced May 2021.
-
Optimization of High Entropy Alloy Catalyst for Ammonia Decomposition and Ammonia Synthesis
Authors:
Wissam A. Saidi,
Waseem Shadid,
Götz Veser
Abstract:
The successful synthesis of high entropy alloy (HEA) nanoparticles, a long-sought goal in materials science, opens a new frontier in materials science with applications across catalysis, electronics, structural alloys, and energetic materials. Recently, a Co25Mo45Fe10Ni10Cu10 HEA made of earth-abundant elements was shown to have a high catalytic activity for ammonia decomposition, which rivals tha…
▽ More
The successful synthesis of high entropy alloy (HEA) nanoparticles, a long-sought goal in materials science, opens a new frontier in materials science with applications across catalysis, electronics, structural alloys, and energetic materials. Recently, a Co25Mo45Fe10Ni10Cu10 HEA made of earth-abundant elements was shown to have a high catalytic activity for ammonia decomposition, which rivals that of state-of-the-art, but prohibitively expensive, ruthenium catalyst. Using a computational approach based on first-principles calculations in conjunction with data analytics and machine learning, we build a model to rapidly compute the adsorption energy of H, N, and NHx (x=1,3) species on CoMoFeNiCu alloy surfaces with varied alloy compositions and atomic arrangement. We show that the 25/45 Co/Mo ratio identified experimentally as the most active composition for ammonia decomposition increases the likelihood that the surface adsorbs nitrogen equivalently to that of ruthenium while at the same time interacting moderately strongly with intermediates. Our study underscores the importance of computational modeling and machine learning to identify and optimize HEA alloys across their near-infinite materials design space.
△ Less
Submitted 26 May, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
First-Principles Phonon Quasiparticle Theory Applied to a Strongly Anharmonic Halide Perovskite
Authors:
Terumasa Tadano,
Wissam A. Saidi
Abstract:
Understanding and predicting lattice dynamics in strongly anharmonic crystals is one of the long-standing challenges in condensed matter physics. Here we propose a first-principles method that gives accurate quasiparticle (QP) peaks of the phonon spectrum with strong anharmonic broadening. On top of the conventional first-order self-consistent phonon (SC1) dynamical matrix, the proposed method inc…
▽ More
Understanding and predicting lattice dynamics in strongly anharmonic crystals is one of the long-standing challenges in condensed matter physics. Here we propose a first-principles method that gives accurate quasiparticle (QP) peaks of the phonon spectrum with strong anharmonic broadening. On top of the conventional first-order self-consistent phonon (SC1) dynamical matrix, the proposed method incorporates frequency renormalization effects by the bubble self-energy within the QP approximation. We apply the developed methodology to the strongly anharmonic $α$-CsPbBr$_3$ that displays phonon instability within the harmonic approximation in the whole Brillouin zone. While the SC1 theory significantly underestimates the cubic-to-tetragonal phase transition temperature (\tc) by more than 50\%, we show that our approach yields \tc = 404--423~K, in excellent agreement with the experimental value of 403~K. We also demonstrate that an accurate determination of QP peaks is paramount for quantitative prediction and elucidation of phonon linewidth..
△ Less
Submitted 19 April, 2022; v1 submitted 28 February, 2021;
originally announced March 2021.
-
Weighted Sum-Rate Maximization for Multi-IRS-assisted Full-Duplex Systems with Hardware Impairments
Authors:
Mohammad Amin Saeidi,
Mohammad Javad Emadi,
Hamed Masoumi,
Mohammad Robat Mili,
Derrick Wing Kwan Ng,
Ioannis Krikidis
Abstract:
Smart and reconfigurable wireless communication environments can be established by exploiting well-designed intelligent reflecting surfaces (IRSs) to shape the communication channels. In this paper, we investigate how multiple IRSs affect the performance of multi-user full-duplex communication systems under hardware impairment at each node, wherein the base station (BS) and the uplink users are su…
▽ More
Smart and reconfigurable wireless communication environments can be established by exploiting well-designed intelligent reflecting surfaces (IRSs) to shape the communication channels. In this paper, we investigate how multiple IRSs affect the performance of multi-user full-duplex communication systems under hardware impairment at each node, wherein the base station (BS) and the uplink users are subject to maximum transmission power constraints. Firstly, the uplink-downlink system weighted sum-rate (SWSR) is derived which serves as a system performance metric. Then, we formulate the resource allocation design for the maximization of SWSR as an optimization problem which jointly optimizes the beamforming and the combining vectors at the BS, the transmit powers of the uplink users, and the phase shifts of multiple IRSs. Since the SWSR optimization problem is non-convex, an efficient iterative alternating approach is proposed to obtain a suboptimal solution for the design problem considered and its complexity is also discussed. In particular, we firstly reformulate the main problem into an equivalent weighted minimum mean-square-error form and then transform it into several convex sub-problems which can be analytically solved for given phase shifts. Then, the IRSs phases are optimized via a gradient ascent-based algorithm. Finally, numerical results are presented to clarify how multiple IRSs enhance the performance metric under hardware impairment.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.
-
Response to Comment on "Low-frequency lattice phonons in halide perovskites explain high defect tolerance toward electron-hole recombination"
Authors:
Weibin Chu,
Qijing Zheng,
Oleg V. Prezhdo,
Jin Zhao,
Wissam A. Saidi
Abstract:
Recently we proposed that defect tolerance in the hybrid perovskites is due to their characteristic low-frequency lattice phonon modes that decrease the non-adiabatic coupling and weaken the overlap between the free carrier and defect states [Sci. Adv. 6 7, eaaw7453 (2020)]. Kim and Walsh disagree with the interpretation and argue that there are flaws in our employed methodology. Herein we address…
▽ More
Recently we proposed that defect tolerance in the hybrid perovskites is due to their characteristic low-frequency lattice phonon modes that decrease the non-adiabatic coupling and weaken the overlap between the free carrier and defect states [Sci. Adv. 6 7, eaaw7453 (2020)]. Kim and Walsh disagree with the interpretation and argue that there are flaws in our employed methodology. Herein we address their concerns and show that their conclusions are not valid due to misunderstandings of nonadiabatic transition.
△ Less
Submitted 26 April, 2020;
originally announced April 2020.
-
Optimization and Validation of a Deep Learning CuZr Atomistic Potential: Robust Applications for Crystalline and Amorphous Phases with near-DFT Accuracy
Authors:
Christopher M. Andolina,
Philip Williamson,
Wissam A. Saidi
Abstract:
We show that a deep-learning neural network potential (DP) based on density functional theory (DFT) calculations can well describe Cu-Zr materials, an example of a binary alloy system that can coexist in several ordered intermetallics and as an amorphous phase. The complex phase diagram for Cu-Zr makes it a challenging system for traditional atomistic force-fields that fail to describe well the di…
▽ More
We show that a deep-learning neural network potential (DP) based on density functional theory (DFT) calculations can well describe Cu-Zr materials, an example of a binary alloy system that can coexist in several ordered intermetallics and as an amorphous phase. The complex phase diagram for Cu-Zr makes it a challenging system for traditional atomistic force-fields that fail to describe well the different properties and phases. Instead, we show that a DP approach using a large database with ~300k configurations can render results generally on par with DFT. The training set includes configurations of pristine and bulk elementary metals and intermetallics in the liquid and solid phases in addition to slab and amorphous configurations. The DP model was validated by comparing bulk properties such as lattice constants, elastic constants, bulk moduli, phonon spectra, surface energies to DFT values for identical structures. Further, we contrast the DP results with values obtained using well-established two embedded atom method potentials. Overall, our DP potential provides near DFT accuracy for the different Cu-Zr phases but with a fraction of its computational cost, thus enabling accurate computations of realistic atomistic models especially for the amorphous phase.
△ Less
Submitted 16 February, 2020;
originally announced February 2020.
-
Negative Capacitance Ion-Sensitive Field-Effect Transistors with improved current sensitivity
Authors:
Francesco Bellando,
Ali Saeidi,
Adrian M. Ionescu
Abstract:
Ion-Sensitive Field-Effect Transistors (ISFETs) form a wide-spread technology for sensing, thanks to their label-free detection and intrinsic CMOS compatibility. Their current sensitivity, ΔID/ID, for a given ΔpH, however, is limited by the thermionic limit for the Subthreshold Slope (SS) of Metal-Oxide-Semiconductor Field-Effect Transistors(MOSFET) and by the Nernst limit. Obtaining ISFETs with a…
▽ More
Ion-Sensitive Field-Effect Transistors (ISFETs) form a wide-spread technology for sensing, thanks to their label-free detection and intrinsic CMOS compatibility. Their current sensitivity, ΔID/ID, for a given ΔpH, however, is limited by the thermionic limit for the Subthreshold Slope (SS) of Metal-Oxide-Semiconductor Field-Effect Transistors(MOSFET) and by the Nernst limit. Obtaining ISFETs with a steep slope transfer characteristics is extremely challenging. In this paper we combine the merits of traditional ISFETs with the performance boosts offered by the insertion of a Negative Capacitor in series with the Gate contact. In the proposed tests with NC PZT capacitors, we demonstrate experimentally a reduction of the SS by 44%, combined with a current efficiency improvement of more than two times. As a consequence of the steeper SS, the current sensitivity to pH is improved by 78%.
△ Less
Submitted 26 June, 2019;
originally announced June 2019.
-
Applications of Multi-view Learning Approaches for Software Comprehension
Authors:
Amir Saeidi,
Jurriaan Hage,
Ravi Khadka,
Slinger Jansen
Abstract:
Program comprehension concerns the ability of an individual to make an understanding of an existing software system to extend or transform it. Software systems comprise of data that are noisy and missing, which makes program understanding even more difficult. A software system consists of various views including the module dependency graph, execution logs, evolutionary information and the vocabula…
▽ More
Program comprehension concerns the ability of an individual to make an understanding of an existing software system to extend or transform it. Software systems comprise of data that are noisy and missing, which makes program understanding even more difficult. A software system consists of various views including the module dependency graph, execution logs, evolutionary information and the vocabulary used in the source code, that collectively defines the software system. Each of these views contain unique and complementary information; together which can more accurately describe the data. In this paper, we investigate various techniques for combining different sources of information to improve the performance of a program comprehension task. We employ state-of-the-art techniques from learning to 1) find a suitable similarity function for each view, and 2) compare different multi-view learning techniques to decompose a software system into high-level units and give component-level recommendations for refactoring of the system, as well as cross-view source code search. The experiments conducted on 10 relatively large Java software systems show that by fusing knowledge from different views, we can guarantee a lower bound on the quality of the modularization and even improve upon it. We proceed by integrating different sources of information to give a set of high-level recommendations as to how to refactor the software system. Furthermore, we demonstrate how learning a joint subspace allows for performing cross-modal retrieval across views, yielding results that are more aligned with what the user intends by the query. The multi-view approaches outlined in this paper can be employed for addressing problems in software engineering that can be encoded in terms of a learning problem, such as software bug prediction and feature location.
△ Less
Submitted 1 February, 2019;
originally announced February 2019.
-
Bargmann transform and generalized heat Cauchy problems
Authors:
Anouar Abdelmajid Saidi,
Ahmed Yahya Mahmoud,
Mohamed Vall Ould Moustapha
Abstract:
In this article we solve explicitly some Cauchy problems of the heat type attached to the generalized real and complex Dirac, Euler and Harmonic oscillator operators. Our principal tool is the Bargmann transform.
In this article we solve explicitly some Cauchy problems of the heat type attached to the generalized real and complex Dirac, Euler and Harmonic oscillator operators. Our principal tool is the Bargmann transform.
△ Less
Submitted 15 July, 2019; v1 submitted 17 December, 2018;
originally announced December 2018.
-
End-to-end Symmetry Preserving Inter-atomic Potential Energy Model for Finite and Extended Systems
Authors:
Linfeng Zhang,
Jiequn Han,
Han Wang,
Wissam A. Saidi,
Roberto Car,
Weinan E
Abstract:
Machine learning models are changing the paradigm of molecular modeling, which is a fundamental tool for material science, chemistry, and computational biology. Of particular interest is the inter-atomic potential energy surface (PES). Here we develop Deep Potential - Smooth Edition (DeepPot-SE), an end-to-end machine learning-based PES model, which is able to efficiently represent the PES for a w…
▽ More
Machine learning models are changing the paradigm of molecular modeling, which is a fundamental tool for material science, chemistry, and computational biology. Of particular interest is the inter-atomic potential energy surface (PES). Here we develop Deep Potential - Smooth Edition (DeepPot-SE), an end-to-end machine learning-based PES model, which is able to efficiently represent the PES for a wide variety of systems with the accuracy of ab initio quantum mechanics models. By construction, DeepPot-SE is extensive and continuously differentiable, scales linearly with system size, and preserves all the natural symmetries of the system. Further, we show that DeepPot-SE describes finite and extended systems including organic molecules, metals, semiconductors, and insulators with high fidelity.
△ Less
Submitted 20 December, 2018; v1 submitted 23 May, 2018;
originally announced May 2018.
-
Negative Capacitance as Digital and Analog Performance Booster for Complementary MOS Transistors
Authors:
Ali Saeidi,
Farzan Jazaeri,
Igor Stolichnov,
Christian C. Enz,
Adrian M. ionescu
Abstract:
Boltzmann tyranny poses a fundamental limit to lowering the energy dissipation of conventional MOS devices, a minimum increase of the gate voltage, i.e. 60 mV, is required for a 10-fold increase in drain-to-source current at 300 K. Negative Capacitance (NC) in ferroelectric materials is proposed in order to address this physical limitation of CMOS technology. A polarization destabilization in ferr…
▽ More
Boltzmann tyranny poses a fundamental limit to lowering the energy dissipation of conventional MOS devices, a minimum increase of the gate voltage, i.e. 60 mV, is required for a 10-fold increase in drain-to-source current at 300 K. Negative Capacitance (NC) in ferroelectric materials is proposed in order to address this physical limitation of CMOS technology. A polarization destabilization in ferroelectrics causes an effective negative permittivity, resulting in a differential voltage amplification and a reduced subthreshold swing when integrated into the gate stack of a transistor. Recent demonstrations of negative capacitance concerned mainly n-type MOSFETs and their subthreshold slope. An effective technology booster should be capable of improving the performance of both n- and p-type transistors. In this work, we report a significant enhancement in both digital (subthreshold swing, on-current over off-current ratio, and overdrive) and analog (transconductance and current efficiency factor) FoM of commercial 28nm CMOS process by exploiting a PZT capacitor as the negative capacitance booster. Accordingly, a sub-thermal swing down to 10 mV/decade together with an enhanced current efficiency factor up to 10$^5$ V$^{-1}$ is obtained in both n- and p-type MOSFETs at room temperature. The overdrive voltage is enhanced up to 0.45 V, leading to a supply voltage reduction of 50\%.
△ Less
Submitted 27 April, 2018; v1 submitted 25 April, 2018;
originally announced April 2018.
-
Brenier approach for optimal transportation between a quasi-discrete measure and a discrete measure
Authors:
Ying Lu,
Liming Chen,
Alexandre Saidi,
Xianfeng Gu
Abstract:
Correctly estimating the discrepancy between two data distributions has always been an important task in Machine Learning. Recently, Cuturi proposed the Sinkhorn distance which makes use of an approximate Optimal Transport cost between two distributions as a distance to describe distribution discrepancy. Although it has been successfully adopted in various machine learning applications (e.g. in Na…
▽ More
Correctly estimating the discrepancy between two data distributions has always been an important task in Machine Learning. Recently, Cuturi proposed the Sinkhorn distance which makes use of an approximate Optimal Transport cost between two distributions as a distance to describe distribution discrepancy. Although it has been successfully adopted in various machine learning applications (e.g. in Natural Language Processing and Computer Vision) since then, the Sinkhorn distance also suffers from two unnegligible limitations. The first one is that the Sinkhorn distance only gives an approximation of the real Wasserstein distance, the second one is the `divide by zero' problem which often occurs during matrix scaling when setting the entropy regularization coefficient to a small value. In this paper, we introduce a new Brenier approach for calculating a more accurate Wasserstein distance between two discrete distributions, this approach successfully avoids the two limitations shown above for Sinkhorn distance and gives an alternative way for estimating distribution discrepancy.
△ Less
Submitted 17 January, 2018;
originally announced January 2018.
-
Optimal Transport for Deep Joint Transfer Learning
Authors:
Ying Lu,
Liming Chen,
Alexandre Saidi
Abstract:
Training a Deep Neural Network (DNN) from scratch requires a large amount of labeled data. For a classification task where only small amount of training data is available, a common solution is to perform fine-tuning on a DNN which is pre-trained with related source data. This consecutive training process is time consuming and does not consider explicitly the relatedness between different source an…
▽ More
Training a Deep Neural Network (DNN) from scratch requires a large amount of labeled data. For a classification task where only small amount of training data is available, a common solution is to perform fine-tuning on a DNN which is pre-trained with related source data. This consecutive training process is time consuming and does not consider explicitly the relatedness between different source and target tasks.
In this paper, we propose a novel method to jointly fine-tune a Deep Neural Network with source data and target data. By adding an Optimal Transport loss (OT loss) between source and target classifier predictions as a constraint on the source classifier, the proposed Joint Transfer Learning Network (JTLN) can effectively learn useful knowledge for target classification from source data. Furthermore, by using different kind of metric as cost matrix for the OT loss, JTLN can incorporate different prior knowledge about the relatedness between target categories and source categories.
We carried out experiments with JTLN based on Alexnet on image classification datasets and the results verify the effectiveness of the proposed JTLN in comparison with standard consecutive fine-tuning. This Joint Transfer Learning with OT loss is general and can also be applied to other kind of Neural Networks.
△ Less
Submitted 9 September, 2017;
originally announced September 2017.
-
On the Effect of Semantically Enriched Context Models on Software Modularization
Authors:
Amir Saeidi,
Jurriaan Hage,
Ravi Khadka,
Slinger Jansen
Abstract:
Many of the existing approaches for program comprehension rely on the linguistic information found in source code, such as identifier names and comments. Semantic clustering is one such technique for modularization of the system that relies on the informal semantics of the program, encoded in the vocabulary used in the source code. Treating the source code as a collection of tokens loses the seman…
▽ More
Many of the existing approaches for program comprehension rely on the linguistic information found in source code, such as identifier names and comments. Semantic clustering is one such technique for modularization of the system that relies on the informal semantics of the program, encoded in the vocabulary used in the source code. Treating the source code as a collection of tokens loses the semantic information embedded within the identifiers. We try to overcome this problem by introducing context models for source code identifiers to obtain a semantic kernel, which can be used for both deriving the topics that run through the system as well as their clustering. In the first model, we abstract an identifier to its type representation and build on this notion of context to construct contextual vector representation of the source code. The second notion of context is defined based on the flow of data between identifiers to represent a module as a dependency graph where the nodes correspond to identifiers and the edges represent the data dependencies between pairs of identifiers. We have applied our approach to 10 medium-sized open source Java projects, and show that by introducing contexts for identifiers, the quality of the modularization of the software systems is improved. Both of the context models give results that are superior to the plain vector representation of documents. In some cases, the authoritativeness of decompositions is improved by 67%. Furthermore, a more detailed evaluation of our approach on JEdit, an open source editor, demonstrates that inferred topics through performing topic analysis on the contextual representations are more meaningful compared to the plain representation of the documents. The proposed approach in introducing a context model for source code identifiers paves the way for building tools that support developers in program comprehension tasks such as application and domain concept location, software modularization and topic analysis.
△ Less
Submitted 4 August, 2017;
originally announced August 2017.
-
Temperature Dependence of the Energy Levels of Methylammonium Lead Iodide Perovskite from First Principles
Authors:
Wissam A. Saidi,
Samuel Poncé,
Bartomeu Monserrat
Abstract:
Environmental effects and intrinsic energy-loss processes lead to fluctuations in the operational temperature of solar cells, which can profoundly influence their power conversion efficiency. Here we determine from first principles the effects of temperature on the band gap and band edges of the hybrid pervoskite CH$_3$NH$_3$PbI$_3$ by accounting for electron-phonon coupling and thermal expansion.…
▽ More
Environmental effects and intrinsic energy-loss processes lead to fluctuations in the operational temperature of solar cells, which can profoundly influence their power conversion efficiency. Here we determine from first principles the effects of temperature on the band gap and band edges of the hybrid pervoskite CH$_3$NH$_3$PbI$_3$ by accounting for electron-phonon coupling and thermal expansion. From $290$ to $380$ K, the computed band gap change of $40$ meV coincides with the experimental change of $30$-$40$ meV. The calculation of electron-phonon coupling in CH$_3$NH$_3$PbI$_3$ is particularly intricate, as the commonly used Allen-Heine-Cardona theory overestimates the band gap change with temperature, and excellent agreement with experiment is only obtained when including high-order terms in the electron-phonon interaction. We also find that spin-orbit coupling enhances the electron-phonon coupling strength, but that the inclusion of nonlocal correlations using hybrid functionals has little effect. We reach similar conclusions in the metal-halide perovskite CsPbI$_3$. Our results unambiguously confirm for the first time the importance of high-order terms in the electron-phonon coupling by direct comparison with experiment.
△ Less
Submitted 5 December, 2016;
originally announced December 2016.
-
Modiffied Schottky emission to explain thickness dependence and slow depolarization in BaTiO$_3$ nanowires
Authors:
Y. Qi,
J. M. P. Martirez,
Wissam A. Saidi,
J. J. Urban,
W. S. Yun,
J. E. Spanier,
A. M. Rappe
Abstract:
We investigate the origin of the depolarization rates in ultrathin adsorbate-stabilized ferroelectric wires. By applying density functional theory calculations and analytic modeling, we demonstrate that the depolarization results from the leakage of charges stored at the surface adsorbates, which play an important role in the polarization stabilization. The depolarization speed varies with thickne…
▽ More
We investigate the origin of the depolarization rates in ultrathin adsorbate-stabilized ferroelectric wires. By applying density functional theory calculations and analytic modeling, we demonstrate that the depolarization results from the leakage of charges stored at the surface adsorbates, which play an important role in the polarization stabilization. The depolarization speed varies with thickness and temperature, following several complex trends. A comprehensive physical model is presented, in which quantum tunneling, Schottky emission and temperature dependent electron mobility are taken into consideration. This model simulates experimental results, validating the physical mechanism. We also expect that this improved tunneling-Schottky emission model could be applied to predict the retention time of polarization and the leakage current for various ferroelectric materials with different thicknesses and temperatures.
△ Less
Submitted 13 February, 2015;
originally announced February 2015.
-
Weighted rooted trees and deformations of operads
Authors:
Abdellatif Saïdi
Abstract:
We will define an operad $\mathcal{B}^0$ on planar rooted trees. $\mathcal{B}^0$ is analgous to the $NAP$-operad in the non-planar tree setting. We will define a family of "current-preserving" operads $\mathcal{B}^λ$ depending on a scalar parameter $λ$, which can be seen as a deformation of the operad $\mathcal{B}^0$. Forgetting the extra "current preserving" notion above give back the Brace opera…
▽ More
We will define an operad $\mathcal{B}^0$ on planar rooted trees. $\mathcal{B}^0$ is analgous to the $NAP$-operad in the non-planar tree setting. We will define a family of "current-preserving" operads $\mathcal{B}^λ$ depending on a scalar parameter $λ$, which can be seen as a deformation of the operad $\mathcal{B}^0$. Forgetting the extra "current preserving" notion above give back the Brace operad for $λ=1$ and the $\mathcal{B}^0$ operad for $λ=0$. A natural map from non-planar rooted trees to plane ones gives back the current-preserving interpolation between $NAP$ and pre-Lie investigated in a previous article.
△ Less
Submitted 27 May, 2014;
originally announced May 2014.
-
The best decay rate of the damped plate equation in a square
Authors:
Kaïs Ammari,
Abdelkader Saïdi
Abstract:
In this paper we study the best decay rate of the solutions of a damped plate equation in a square and with a homogeneous Dirichlet boundary conditions. We show that the fastest decay rate is given by the supremum of the real part of the spectrum of the infinitesimal generator of the underlying semigroup, if the damping coefficient is in $L^\infty(Ω).$ Moreover, we give some numerical illustration…
▽ More
In this paper we study the best decay rate of the solutions of a damped plate equation in a square and with a homogeneous Dirichlet boundary conditions. We show that the fastest decay rate is given by the supremum of the real part of the spectrum of the infinitesimal generator of the underlying semigroup, if the damping coefficient is in $L^\infty(Ω).$ Moreover, we give some numerical illustrations by spectral computation of the spectrum associated to the damped plate equation. The numerical results obtained for various cases of damping are in a good agreement with theoretical ones. Computation of the spectrum and energy of discrete solution of damped plate show that the best decay rate is given by spectral abscissa of numerical solution.
△ Less
Submitted 13 March, 2014;
originally announced March 2014.
-
Theorical and Numerical Analysis of the Rapid Pointwise Stabilization of Coupled String-Beam Systems
Authors:
Alia Barhoumi,
Abdelkader Saïdi
Abstract:
We consider a pointwise stabilization problem for a coupled wave and plate equations. We prove under rather general assumptions, that such systems can stabilized so as to have arbitrarily high decay rates and are exactly controllable. We propose a numerical approximation of the model and we study numerically the construction of the feedbak law leading to exponential decay with arbtrarily large rat…
▽ More
We consider a pointwise stabilization problem for a coupled wave and plate equations. We prove under rather general assumptions, that such systems can stabilized so as to have arbitrarily high decay rates and are exactly controllable. We propose a numerical approximation of the model and we study numerically the construction of the feedbak law leading to exponential decay with arbtrarily large rate.
△ Less
Submitted 24 May, 2011;
originally announced May 2011.
-
Unconditionnally stable scheme for Riccati equation
Authors:
François Dubois,
Abdelkader Saïdi
Abstract:
We present a numerical scheme for the resolution of matrix Riccati equation used in control problems. The scheme is unconditionnally stable and the solution is definite positive at each time step of the resolution. We prove the convergence in the scalar case and present several numerical experiments for classical test cases.
We present a numerical scheme for the resolution of matrix Riccati equation used in control problems. The scheme is unconditionnally stable and the solution is definite positive at each time step of the resolution. We prove the convergence in the scalar case and present several numerical experiments for classical test cases.
△ Less
Submitted 21 January, 2011;
originally announced January 2011.
-
Homographic scheme for Riccati equation
Authors:
François Dubois,
Abdelkader Saïdi
Abstract:
In this paper we present a numerical scheme for the resolution of matrix Riccati equation, usualy used in control problems. The scheme is unconditionnaly stable and the solution is definite positive at each time step of the resolution. We prove the convergence in the scalar case and present several numerical experiments for classical test cases.
In this paper we present a numerical scheme for the resolution of matrix Riccati equation, usualy used in control problems. The scheme is unconditionnaly stable and the solution is definite positive at each time step of the resolution. We prove the convergence in the scalar case and present several numerical experiments for classical test cases.
△ Less
Submitted 8 May, 2011; v1 submitted 11 January, 2011;
originally announced January 2011.
-
The pre-Lie operad as a deformation of NAP
Authors:
Abdellatif Saidi
Abstract:
We define a family of multigraded operads $O_λ$ depending on a scalar parameter, such that forgetting the multigraduation gives back the pre-Lie operad when the parameter $λ$ is equal to one, and the NAP operad governing Non-Associative Permutative algebras when $λ$ is equal to zero.
We define a family of multigraded operads $O_λ$ depending on a scalar parameter, such that forgetting the multigraduation gives back the pre-Lie operad when the parameter $λ$ is equal to one, and the NAP operad governing Non-Associative Permutative algebras when $λ$ is equal to zero.
△ Less
Submitted 19 November, 2010;
originally announced November 2010.
-
On covariant functions and distributions under the action of a compact group
Authors:
Anouar Saidi
Abstract:
Let $G$ be a compact subgroup of $GL_n(\R)$ acting linearly on a finite dimensional vector space $E$. B. Malgrange has shown that the space $\mathcal{C}^\infty(\R^n,E)^G$ of $\mathcal{C}^\infty$ and $G$-covariant functions is a finite module over the ring $\mathcal{C}^\infty(\R^n)^G$ of $\mathcal{C}^\infty$ and $G$-invariant functions. First, we generalize this result for the Schwartz space…
▽ More
Let $G$ be a compact subgroup of $GL_n(\R)$ acting linearly on a finite dimensional vector space $E$. B. Malgrange has shown that the space $\mathcal{C}^\infty(\R^n,E)^G$ of $\mathcal{C}^\infty$ and $G$-covariant functions is a finite module over the ring $\mathcal{C}^\infty(\R^n)^G$ of $\mathcal{C}^\infty$ and $G$-invariant functions. First, we generalize this result for the Schwartz space $\mathscr{S}(\R^n,E)^G$ of $G$-covariant functions. Secondly, we prove that any $G$-covariant distribution can be decomposed into a sum of $G$-invariant distributions multiplied with a fixed family of $G$-covariant polynomials. This gives a generalization of an Oksak result proved in ([O]).
△ Less
Submitted 9 February, 2009;
originally announced February 2009.
-
Lois pré-Lie en interaction
Authors:
Dominique Manchon,
Abdellatif Saidi
Abstract:
D. Calaque, K. Ebrahimi-Fard and D. Manchon have recently defined a Hopf algebra by introducing a new coproduct on a commutative algebra of rooted forests. The space of primitive elements of the graded dual is endowed with a left pre-Lie product defined in terms of insertion of a tree inside another. In this work we prove a ``derivation'' relation between this pre-Lie structure and the left pre-…
▽ More
D. Calaque, K. Ebrahimi-Fard and D. Manchon have recently defined a Hopf algebra by introducing a new coproduct on a commutative algebra of rooted forests. The space of primitive elements of the graded dual is endowed with a left pre-Lie product defined in terms of insertion of a tree inside another. In this work we prove a ``derivation'' relation between this pre-Lie structure and the left pre-Lie product defined by grafting.
△ Less
Submitted 7 July, 2009; v1 submitted 13 November, 2008;
originally announced November 2008.