Search | arXiv e-print repository

Super-ballistic transport in an open quantum ring

Authors: Moumita Patra, Bijay Kumar Agarwalla, Santanu K. Maiti

Abstract: When the degeneracies of the ring-Hamiltonian are removed by the asymmetric ring-to-electrodes configuration for an open quantum ring (OQR), the overall junction transmission function exhibits fano-type antiresonance, resulting a net circular current appears within the channel, that is the ring around the degenerate energy levels of the ring-Hamiltonian. We investigate the system size scaling prop… ▽ More When the degeneracies of the ring-Hamiltonian are removed by the asymmetric ring-to-electrodes configuration for an open quantum ring (OQR), the overall junction transmission function exhibits fano-type antiresonance, resulting a net circular current appears within the channel, that is the ring around the degenerate energy levels of the ring-Hamiltonian. We investigate the system size scaling properties of the channel conductance and the overall junction conductance of an OQR. Ballistic transport is the unhindered flow of a charge carrier within a conductor. Here we find beyond-ballistic transport near both the degenerate and non-degenerate eigenenergies of the ring-Hamiltonian, depending on the ring-to-lead configuration. This is a purely OQR phenomenon associated with the quantum interference effect between two counter-propagating electronic waves having nearly equal and opposite momenta. Thus there is no equivalent phenomenon in open quantum junctions with linear channel. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 9 pages, 10 figures

arXiv:2407.00837 [pdf, other]

Towards Robust Speech Representation Learning for Thousands of Languages

Authors: William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe

Abstract: Self-supervised learning (SSL) has helped extend speech technologies to more languages by reducing the need for labeled data. However, models are still far from supporting the world's 7000+ languages. We propose XEUS, a Cross-lingual Encoder for Universal Speech, trained on over 1 million hours of data across 4057 languages, extending the language coverage of SSL models 4-fold. We combine 1 millio… ▽ More Self-supervised learning (SSL) has helped extend speech technologies to more languages by reducing the need for labeled data. However, models are still far from supporting the world's 7000+ languages. We propose XEUS, a Cross-lingual Encoder for Universal Speech, trained on over 1 million hours of data across 4057 languages, extending the language coverage of SSL models 4-fold. We combine 1 million hours of speech from existing publicly accessible corpora with a newly created corpus of 7400+ hours from 4057 languages, which will be publicly released. To handle the diverse conditions of multilingual speech data, we augment the typical SSL masked prediction approach with a novel dereverberation objective, increasing robustness. We evaluate XEUS on several benchmarks, and show that it consistently outperforms or achieves comparable results to state-of-the-art (SOTA) SSL models across a variety of tasks. XEUS sets a new SOTA on the ML-SUPERB benchmark: it outperforms MMS 1B and w2v-BERT 2.0 v2 by 0.8% and 4.4% respectively, despite having less parameters or pre-training data. Checkpoints, code, and data are found in https://www.wavlab.org/activities/2024/xeus/. △ Less

Submitted 2 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

Comments: Updated affiliations; 20 pages

arXiv:2406.08219 [pdf, ps, other]

Impact of environmental interaction on bias induced circular current in a ring nanojunction

Authors: Moumita Mondal, Santanu K. Maiti

Abstract: The specific role of environmental interaction on bias driven circular current in a ring nanojunction is explored, for the first time to the best of our concern, within a tight-binding framework based on wave-guide theory. The environmental interaction is implemented through disorder in backbone sites where these sites are directly coupled to parent lattice sites of the ring via single bonds. In a… ▽ More The specific role of environmental interaction on bias driven circular current in a ring nanojunction is explored, for the first time to the best of our concern, within a tight-binding framework based on wave-guide theory. The environmental interaction is implemented through disorder in backbone sites where these sites are directly coupled to parent lattice sites of the ring via single bonds. In absence of backbone disorder circular current becomes zero for a lengthwise symmetric nanojunction, while it increases with disorder which is quite unusual, and after reaching a maximum it eventually drops to zero in the limit of high disorder. The effects of ring-electrode interface configuration, ring-backbone coupling, different types of backbone disorder and system temperature are critically investigated to make the present analysis comprehensive. All the studied results are valid for a broad range of physical parameters, giving us confidence that the outcomes of this theoretical work can be verified in a laboratory. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 7 pages, 9 figures (comments are welcome)

arXiv:2405.13619 [pdf]

Drastic modification in thermal conductivity of TiCoSb Half-Heusler alloy: Phonon engineering by lattice softening and ionic polarization

Authors: S. Mahakal, Avijit Jana, Diptasikha Das, Nabakumar Rana, Pallabi Sardar, Aritra Banerjee, Shamima Hussain, Santanu K. Maiti, K. Malik

Abstract: A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron mi… ▽ More A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron microscopy data. Local structures of the synthesized samples are explored for the first time by X-ray absorption spectroscopy measurements for TiCoSb system and corroborated with Rietveld refinement data. Lattice dynamics are revealed using Raman Spectroscopy (RS) measurements in unprecedented attempts for TiCoSb system. XRD and RS data accomplishes that variation in \k{appa} as a function of Sb concentration is observed owing to an alteration in phonon group velocity related to lattice softening. Polar nature of TiCoSb HH sample is revealed. LO-TO splitting (related to polar optical phonon scattering) in phonon vibration is observed due to polar nature of TiCoSb synthesized samples. Tailoring in LO-TO splitting due to screening effect, correlated with Co vacancies is reported for TiCoSb1+x synthesized samples. Lattice softening and LO-TO splitting lead to decreases in \k{appa}~47% for TiCoSb1.02 synthesized sample. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Main article (17 pages, 10 figures), Supplemental article (5 pages, 7 figures), Comments are welcome

arXiv:2403.01916 [pdf]

Enhancing thermoelectric performance of 2D Janus ISbTe by strain engineering: A first principle study

Authors: Anuja Kumari, Abhinav Nag, Santanu K. Maiti, Jagdish Kumar

Abstract: Recent developments in the 2D materials laid emphasis on finding the materials with robust properties for variety of applications including the energy harvesting. The recent discovery of Janus monolayers with broken symmetry has opened up new options for engineering the properties of 2D layered materials. Present study focuses on enhancing thermoelectric properties of 2H-ISbTe 2D Janus monolayer.… ▽ More Recent developments in the 2D materials laid emphasis on finding the materials with robust properties for variety of applications including the energy harvesting. The recent discovery of Janus monolayers with broken symmetry has opened up new options for engineering the properties of 2D layered materials. Present study focuses on enhancing thermoelectric properties of 2H-ISbTe 2D Janus monolayer. All the calculations have been performed using fully relaxed unit cell and employing the pseudo potential based quantum espresso code. Calculated structural parameters are in good agreement with previous literature reports. The lattice dynamics calculations predicts this monolayer can withstand a strain of up to 4% beyond which imaginary frequencies appear in the phonon dispersion curves. Computed electronic structure reveals that the monolayer is an indirect wide bandgap material and the bandgap decreases with tensile strain. Furthermore, the computed thermoelectric properties show that the studied monolayer has high Seebeck coefficient of ~ 300 μみゅーV/K and low thermal conductivity which yields reasonably high ZT of ~ 1.31 for a strain of 2% at 300 K with p-type doping. Therefore, our study signifies the fact that tensile strain and p-type doping of 2D Janus monolayer ISbTe can enhance ZT from 0.87 to 1.31 at room temperature which makes it a promising candidate for thermoelectric applications. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: 18 pages, 8 figures, Comments are Welcome

arXiv:2402.16021 [pdf, other]

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Authors: Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro

Abstract: The capability to jointly process multi-modal information is becoming an essential task. However, the limited number of paired multi-modal data and the large computational requirements in multi-modal learning hinder the development. We propose a novel Tri-Modal Translation (TMT) model that translates between arbitrary modalities spanning speech, image, and text. We introduce a novel viewpoint, whe… ▽ More The capability to jointly process multi-modal information is becoming an essential task. However, the limited number of paired multi-modal data and the large computational requirements in multi-modal learning hinder the development. We propose a novel Tri-Modal Translation (TMT) model that translates between arbitrary modalities spanning speech, image, and text. We introduce a novel viewpoint, where we interpret different modalities as different languages, and treat multi-modal translation as a well-established machine translation problem. To this end, we tokenize speech and image data into discrete tokens, which provide a unified interface across modalities and significantly decrease the computational cost. In the proposed TMT, a multi-modal encoder-decoder conducts the core translation, whereas modality-specific processing is conducted only within the tokenization and detokenization stages. We evaluate the proposed TMT on all six modality translation tasks. TMT outperforms single model counterparts consistently, demonstrating that unifying tasks is beneficial not only for practicality but also for performance. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2402.05654 [pdf, ps, other]

Bias induced circular current in a loop nanojunction with AAH modulation: Role of hopping dimerization

Authors: Moumita Mondal, Santanu K. Maiti

Abstract: In this work, we put forward, for the first time, the interplay between correlated disorder and hopping dimerization on bias driven circular current in a loop conductor that is clamped between two electrodes. The correlated disorder is introduced in site energies of the ring in the form of Aubry-André-Harper (AAH) model. Simulating the quantum system within a tight-binding framework all the result… ▽ More In this work, we put forward, for the first time, the interplay between correlated disorder and hopping dimerization on bias driven circular current in a loop conductor that is clamped between two electrodes. The correlated disorder is introduced in site energies of the ring in the form of Aubry-André-Harper (AAH) model. Simulating the quantum system within a tight-binding framework all the results are worked out based on the standard wave-guide theory. Unlike transport current, commonly referred to drain current, circular current in the loop conductor can get enhanced with increasing disorder strength. This enhancement becomes much effective when hopping dimerization is included which is taken following the Su-Schrieffer-Heeger (SSH) model. The characteristic features of bias driven circular current are studied under different input conditions and we find the results are robust for wide range of physical parameters. Our analysis may provide a new insight in analyzing transport behavior in different disordered lattices in presence of additional restrictions in hopping integrals. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 6 pages, 8 figures (comments are welcome)

arXiv:2401.18045 [pdf, other]

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

Authors: Yihan Wu, Soumi Maiti, Yifan Peng, Wangyou Zhang, Chenda Li, Yuyue Wang, Xihua Wang, Shinji Watanabe, Ruihua Song

Abstract: Recent advancements in language models have significantly enhanced performance in multiple speech-related tasks. Existing speech language models typically utilize task-dependent prompt tokens to unify various speech tasks in a single model. However, this design omits the intrinsic connections between different speech tasks, which can potentially boost the performance of each task. In this work, we… ▽ More Recent advancements in language models have significantly enhanced performance in multiple speech-related tasks. Existing speech language models typically utilize task-dependent prompt tokens to unify various speech tasks in a single model. However, this design omits the intrinsic connections between different speech tasks, which can potentially boost the performance of each task. In this work, we propose a novel decoder-only speech language model, SpeechComposer, that can unify common speech tasks by composing a fixed set of prompt tokens. Built upon four primary tasks -- speech synthesis, speech recognition, speech language modeling, and text language modeling -- SpeechComposer can easily extend to more speech tasks via compositions of well-designed prompt tokens, like voice conversion and speech enhancement. The unification of prompt tokens also makes it possible for knowledge sharing among different speech tasks in a more structured manner. Experimental results demonstrate that our proposed SpeechComposer can improve the performance of both primary tasks and composite tasks, showing the effectiveness of the shared prompt tokens. Remarkably, the unified decoder-only model achieves a comparable and even better performance than the baselines which are expert models designed for single tasks. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 11 pages, 2 figures

arXiv:2401.16812 [pdf, other]

SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics

Authors: Takaaki Saeki, Soumi Maiti, Shinnosuke Takamichi, Shinji Watanabe, Hiroshi Saruwatari

Abstract: While subjective assessments have been the gold standard for evaluating speech generation, there is a growing need for objective metrics that are highly correlated with human subjective judgments due to their cost efficiency. This paper proposes reference-aware automatic evaluation methods for speech generation inspired by evaluation metrics in natural language processing. The proposed SpeechBERTS… ▽ More While subjective assessments have been the gold standard for evaluating speech generation, there is a growing need for objective metrics that are highly correlated with human subjective judgments due to their cost efficiency. This paper proposes reference-aware automatic evaluation methods for speech generation inspired by evaluation metrics in natural language processing. The proposed SpeechBERTScore computes the BERTScore for self-supervised dense speech features of the generated and reference speech, which can have different sequential lengths. We also propose SpeechBLEU and SpeechTokenDistance, which are computed on speech discrete tokens. The evaluations on synthesized speech show that our method correlates better with human subjective ratings than mel cepstral distortion and a recent mean opinion score prediction model. Also, they are effective in noisy speech evaluation and have cross-lingual applicability. △ Less

Submitted 12 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted by Interspeech 2024. An extended version with Appendix. Code: https://github.com/Takaaki-Saeki/DiscreteSpeechMetrics

arXiv:2401.10157 [pdf, other]

A qubit regularization of asymptotic freedom without fine-tuning

Authors: Sandip Maiti, Debasish Banerjee, Shailesh Chandrasekharan, Marina Krstic Marinkovic

Abstract: Other than the commonly used Wilson's regularization of quantum field theories (QFTs), there is a growing interest in regularizations that explore lattice models with a strictly finite local Hilbert space, in anticipation of the upcoming era of quantum simulations of QFTs. A notable example is Euclidean qubit regularization, which provides a natural way to recover continuum QFTs that emerge via in… ▽ More Other than the commonly used Wilson's regularization of quantum field theories (QFTs), there is a growing interest in regularizations that explore lattice models with a strictly finite local Hilbert space, in anticipation of the upcoming era of quantum simulations of QFTs. A notable example is Euclidean qubit regularization, which provides a natural way to recover continuum QFTs that emerge via infrared fixed points of lattice theories. Can such regularizations also capture the physics of ultraviolet fixed points? We present a novel regularization of the asymptotically free massive continuum QFT that emerges at the Berezenski-Kosterlitz-Thouless (BKT) transition through a hard core loop-gas model, discussing the advantages this model provides compared to traditional regularizations. In particular, we demonstrate that without the need for fine-tuning, it can reproduce the universal step-scaling function of the classical lattice XY model in the massive phase as we approach the phase transition. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 8 pages, 8 figures, contribution to the 40th International Symposium on Lattice Field Theory (Lattice 2023), July 31st - August 4th, 2023, Fermi National Accelerator Laboratory, Batavia, Illinois, USA

arXiv:2401.01864 [pdf, other]

Constraining inflationary magnetogenesis and reheating via GWs in light of PTA data

Authors: Subhasis Maiti, Debaprasad Maity, L. Sriramkumar

Abstract: Utilizing the bounds on primordial magnetic fields (PMFs), their contributions to secondary gravitational waves (GWs) and the results from the pulsar timing arrays (PTAs), we arrive at constraints on the epoch of reheating. We find that the combined spectral density of primary and secondary GWs (generated by the PMFs) can, in general, be described as a broken power law with five different indices.… ▽ More Utilizing the bounds on primordial magnetic fields (PMFs), their contributions to secondary gravitational waves (GWs) and the results from the pulsar timing arrays (PTAs), we arrive at constraints on the epoch of reheating. We find that the combined spectral density of primary and secondary GWs (generated by the PMFs) can, in general, be described as a broken power law with five different indices. We show that the PMFs that have a blue tilt and satisfy the other observational constraints can generate secondary GWs of strengths suggested by the PTA data. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 14 pages(8+6); 4 figures

arXiv:2401.01683 [pdf]

Enhanced Thermoelectric Properties of 2D Janus Ferromagnetic LaBrI with Strain-induced Valley Degeneracy

Authors: Anuja Kumari, Abhinav Nag, Santanu K. Maiti, Jagdish Kumar

Abstract: Since the successful synthesis of the MoSSe monolayer, which violated the out-of-plane mirror symmetry of TMDs monolayers, considerable and systematic research has been conducted on Janus monolayer materials. By systematically analyzing the LaBrI monolayer, we are able to learn more about the novel Janus material by focusing on the halogen family next to group VIA (S, Se, Te). The structural optim… ▽ More Since the successful synthesis of the MoSSe monolayer, which violated the out-of-plane mirror symmetry of TMDs monolayers, considerable and systematic research has been conducted on Janus monolayer materials. By systematically analyzing the LaBrI monolayer, we are able to learn more about the novel Janus material by focusing on the halogen family next to group VIA (S, Se, Te). The structural optimizations have been carried out using the FP-LAPW (Full Potential Linear Augmented Plane Wave) basis, as implemented in the ELK using tb-mBJ exchange correlation potential. Computed structural parameters are in good comparison with literature reports. Further, optimized crystal structures were used for computing effect of strain on electronic and thermoelectric properties using pseudo potential based Quantum espresso code. Dynamical stability predicts material can withstand strain upto 10% strain. Computed electronic structure reveals material to be indirect wide bandgap ferromagnetic material with magnetic moment 1μみゅーB. With increase in the biaxial tensile strain the band gap increases. Furthermore, the computed magneto-thermoelectric properties predicts high Seebeck coefficient of ~ 400 μみゅーV/K and low thermal conductivity of ~ 1.13 X 1014 W/msK in LaBrI which results high ZT of ~ 1.92 with 8% strain at 800 K with p-type doping. Thus, present study supports the fact that tensile strain on ferromagnetic LaBrI material can further enhance TE properties and making it to be a promising material for TE applications at higher temperature. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 20 pages, 7 figures, Comments are Welcome

arXiv:2312.09091 [pdf, ps, other]

Multiple New Important Conjectures on Equivalence to Perfect Cuboid and Euler Brick

Authors: S Maiti

Abstract: Nobody has discovered any perfect cuboid and there is no formula to deliver all possible Euler bricks. During investigations of famous open problems related to the perfect cuboid and Euler brick; I have found some new important conjectures on Pythagorean triples and biquadratic Diophantine equation [4] which are reduced & quivalence form for perfect cuboid and Euler brick problems. The details of… ▽ More Nobody has discovered any perfect cuboid and there is no formula to deliver all possible Euler bricks. During investigations of famous open problems related to the perfect cuboid and Euler brick; I have found some new important conjectures on Pythagorean triples and biquadratic Diophantine equation [4] which are reduced & quivalence form for perfect cuboid and Euler brick problems. The details of the conjectures have been provided in Sections 2-3. If any perfect cuboid exists it will be only among the solutions of six conjectures and all the Euler bricks are only among the solutions of next three conjectures [4]. For example, if any odd $n\in \mathbb{N}$ satisfy $n=e^2-f^2=g^2-h^2=k^2-l^2$ and $e^2f^2=g^2h^2+k^2l^2$; then we can discover a perfect cuboid of type 1 as $\{e^2-f^2,2gh,2kl,g^2+h^2,k^2+l^2,2ef,e^2+f^2\}$ having $(e^2-f^2,2gh,2kl)$ having $(e^2-f^2,2gh,2kl)$ as its edges; $(g^2+h^2,k^2+l^2,2ef)$ as its face diagonals and $e^2+f^2$ as its body diagonal where $e,f,g,h,k,l~(>1)\in \mathbb{N}$. △ Less

Submitted 1 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: 16 pages, 12 conjectures

MSC Class: 11B13 ACM Class: G.2.0

arXiv:2312.02778 [pdf, ps, other]

Spin-dependent multiple reentrant localization in an antiferromagnetic helix with transverse electric field: Hopping dimerization-free scenario

Authors: Sudin Ganguly, Kallol Mondal, Santanu K. Maiti

Abstract: Reentrant localization (RL), a recently prominent phenomenon, traditionally links to the interplay of staggered correlated disorder and hopping dimerization, as indicated by prior research. Contrary to this paradigm, our present study demonstrates that hopping dimerization is not a pivotal factor in realizing RL. Considering a helical magnetic system with antiferromagnetic ordering, we uncover spi… ▽ More Reentrant localization (RL), a recently prominent phenomenon, traditionally links to the interplay of staggered correlated disorder and hopping dimerization, as indicated by prior research. Contrary to this paradigm, our present study demonstrates that hopping dimerization is not a pivotal factor in realizing RL. Considering a helical magnetic system with antiferromagnetic ordering, we uncover spin-dependent RL at multiple energy regions, in the {\em absence} of hopping dimerization. This phenomenon persists even in the thermodynamic limit. The correlated disorder in the form of Aubry-André-Harper model is introduced by applying a transverse electric field to the helical system, circumventing the use of traditional substitutional disorder. Described within a tight-binding framework, present work provides a novel outlook on RL, highlighting the crucial role of electric field, antiferromagnetic ordering, and the helicity of the geometry. △ Less

Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

Comments: 6 pages, 4 figures, comments are Welcome

arXiv:2311.13345 [pdf, other]

doi 10.1103/PhysRevB.109.104505

Many-body physics-induced selection rules: application to Raman spectroscopy

Authors: Igor Benek-Lins, Saurabh Maiti

Abstract: Spectroscopic measurements in quantum systems are subject to selection rules, usually based on space-time symmetries, that allow or disallow transitions between states. In many-body systems, in addition to the single-particle states, there emerge new ones due to collective excitations of the system. Here we demonstrate the existence of a "fragile" selection rule that emerges as a manifestation of… ▽ More Spectroscopic measurements in quantum systems are subject to selection rules, usually based on space-time symmetries, that allow or disallow transitions between states. In many-body systems, in addition to the single-particle states, there emerge new ones due to collective excitations of the system. Here we demonstrate the existence of a "fragile" selection rule that emerges as a manifestation of many-body effects and outlines the conditions for collective excitations to couple to a given spectroscopic probe beyond the usual symmetry considerations. As an example, we apply the rule to Raman spectroscopy of multiband superconductors and settle some unresolved features in experiments. △ Less

Submitted 22 November, 2023; originally announced November 2023.

Comments: 14 pages, 4 figures; includes supplementary material

arXiv:2310.20640 [pdf, other]

doi 10.1103/PhysRevB.109.094515

Electronic Raman response of a superconductor across a time reversal symmetry breaking phase transition

Authors: Surajit Sarkar, Saurabh Maiti

Abstract: Polarization-resolved electronic Raman spectroscopy is an important experimental tool to investigate collective excitations in superconductors. In this work, we present a general theory that allows us to study the evolution of all Raman active collective modes in multiple symmetry channels across a time-reversal symmetry (TRS) breaking superconducting transition. This comprehensive approach reveal… ▽ More Polarization-resolved electronic Raman spectroscopy is an important experimental tool to investigate collective excitations in superconductors. In this work, we present a general theory that allows us to study the evolution of all Raman active collective modes in multiple symmetry channels across a time-reversal symmetry (TRS) breaking superconducting transition. This comprehensive approach reveals that multiple modes belonging to different symmetry channels show a tendency to soften, even when the interactions in the subleading channel are held constant. This indicates an increased competition induced by the proximity to the TRS breaking transition. The entry into the TRS broken phase is marked by the introduction of an additional mode into the gap in multiple symmetry channels. These new modes have a phase character complementary to the ones that are already present. Even though all the modes in the TRS broken phase acquire an amplitude character, we explicitly demonstrate that the coupling to the Raman probe is exclusively through the phase sector. We demonstrate that the Raman spectrum collected in lower symmetry channels shows a selective sensitivity to the sign of the ground state order parameters and the sign of the interband interactions. Finally, we demonstrate the applicability of an interaction induced selection rule that clearly explains the spectral weights of various modes in various irreps, including the possible of $``$dark$"$ Leggett and Bardasis-Schrieffer modes. △ Less

Submitted 31 October, 2023; originally announced October 2023.

arXiv:2310.08322 [pdf, other]

A 3D Kinetic Distribution that Yields Observed Plasma Density in the Inner Van Allen Belt

Authors: Snehanshu Maiti, Harishankar Ramachandran

Abstract: A steady-state distribution is obtained that approximately yields the observed plasma density profile of the inner Van Allen radiation belt. The model assumes a collisionless, magnetized plasma with zero electric field present. The inner Van Allen belt consists of a plasma comprising high-energy protons and relativistic electrons. The particle trajectories are obtained from the collisionless Loren… ▽ More A steady-state distribution is obtained that approximately yields the observed plasma density profile of the inner Van Allen radiation belt. The model assumes a collisionless, magnetized plasma with zero electric field present. The inner Van Allen belt consists of a plasma comprising high-energy protons and relativistic electrons. The particle trajectories are obtained from the collisionless Lorentz Force equation for different initial distributions. The resulting steady-state distributions obtained after particles lost to the loss cone are eliminated and are used to generate the density profile. The distribution's dependence on energy and magnetic moment is adjusted to make the density profile agree with observations. For a distribution that is a function of energy times a function of magnetic moment, the calculation leads to the desired type of density profile. The kinetic distribution and the type of density profile obtained are presented. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2310.04394 [pdf, other]

doi 10.1103/PhysRevB.109.L041111

Spin-Mediated Direct Photon Scattering by Plasmons in BiTeI

Authors: A. C. Lee, S. Sarkar, K. Du, H. -H. Kung, C. J. Won, K. Wang, S. -W. Cheong, S. Maiti, G. Blumberg

Abstract: We use polarization resolved Raman spectroscopy to demonstrate that for a 3D giant Rashba system the bulk plasmon collective mode can directly couple to the Raman response even in the long wavelength $\mathbf q \rightarrow 0$ limit. Although conventional theory predicts the plasmon spectral weight to be suppressed as the square of its quasi-momentum and thus negligibly weak in the Raman spectra, w… ▽ More We use polarization resolved Raman spectroscopy to demonstrate that for a 3D giant Rashba system the bulk plasmon collective mode can directly couple to the Raman response even in the long wavelength $\mathbf q \rightarrow 0$ limit. Although conventional theory predicts the plasmon spectral weight to be suppressed as the square of its quasi-momentum and thus negligibly weak in the Raman spectra, we observe a sharp in-gap plasmon mode in the Raman spectrum of BiTeI below the Rashba continuum. This coupling, in a polar system with spin-orbit coupling, occurs without assistance from phonons when the incoming photon excitation is resonant with Rashba-split intermediate states. We discuss the distinctive features of BiTeI's giant Rashba system band structure that enable the direct observation of plasmon in Raman scattering. △ Less

Submitted 18 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: Editors' Suggestion

Journal ref: Phys. Rev. B 109, L041111 (2024)

arXiv:2310.03757 [pdf, other]

Enhancing Healthcare with EOG: A Novel Approach to Sleep Stage Classification

Authors: Suvadeep Maiti, Shivam Kumar Sharma, Raju S. Bapi

Abstract: We introduce an innovative approach to automated sleep stage classification using EOG signals, addressing the discomfort and impracticality associated with EEG data acquisition. In addition, it is important to note that this approach is untapped in the field, highlighting its potential for novel insights and contributions. Our proposed SE-Resnet-Transformer model provides an accurate classificatio… ▽ More We introduce an innovative approach to automated sleep stage classification using EOG signals, addressing the discomfort and impracticality associated with EEG data acquisition. In addition, it is important to note that this approach is untapped in the field, highlighting its potential for novel insights and contributions. Our proposed SE-Resnet-Transformer model provides an accurate classification of five distinct sleep stages from raw EOG signal. Extensive validation on publically available databases (SleepEDF-20, SleepEDF-78, and SHHS) reveals noteworthy performance, with macro-F1 scores of 74.72, 70.63, and 69.26, respectively. Our model excels in identifying REM sleep, a crucial aspect of sleep disorder investigations. We also provide insight into the internal mechanisms of our model using techniques such as 1D-GradCAM and t-SNE plots. Our method improves the accessibility of sleep stage classification while decreasing the need for EEG modalities. This development will have promising implications for healthcare and the incorporation of wearable technology into sleep studies, thereby advancing the field's potential for enhanced diagnostics and patient comfort. △ Less

Submitted 25 September, 2023; originally announced October 2023.

arXiv:2310.00706 [pdf, other]

Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

Authors: Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh

Abstract: Modern speech synthesis systems have improved significantly, with synthetic speech being indistinguishable from real speech. However, efficient and holistic evaluation of synthetic speech still remains a significant challenge. Human evaluation using Mean Opinion Score (MOS) is ideal, but inefficient due to high costs. Therefore, researchers have developed auxiliary automatic metrics like Word Erro… ▽ More Modern speech synthesis systems have improved significantly, with synthetic speech being indistinguishable from real speech. However, efficient and holistic evaluation of synthetic speech still remains a significant challenge. Human evaluation using Mean Opinion Score (MOS) is ideal, but inefficient due to high costs. Therefore, researchers have developed auxiliary automatic metrics like Word Error Rate (WER) to measure intelligibility. Prior works focus on evaluating synthetic speech based on pre-trained speech recognition models, however, this can be limiting since this approach primarily measures speech intelligibility. In this paper, we propose an evaluation technique involving the training of an ASR model on synthetic speech and assessing its performance on real speech. Our main assumption is that by training the ASR model on the synthetic speech, the WER on real speech reflects the similarity between distributions, a broader assessment of synthetic speech quality beyond intelligibility. Our proposed metric demonstrates a strong correlation with both MOS naturalness and MOS intelligibility when compared to SpeechLMScore and MOSNet on three recent Text-to-Speech (TTS) systems: MQTTS, StyleTTS, and YourTTS. △ Less

Submitted 1 October, 2023; originally announced October 2023.

arXiv:2309.15800 [pdf, other]

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Authors: Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

Abstract: Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling. High-dimensional speech features such as spectrograms are often used as the input for the subsequent model. However, they can still be redundant. Recent investigations proposed the use of discrete speech units derived from self-supervised learning repre… ▽ More Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling. High-dimensional speech features such as spectrograms are often used as the input for the subsequent model. However, they can still be redundant. Recent investigations proposed the use of discrete speech units derived from self-supervised learning representations, which significantly compresses the size of speech data. Applying various methods, such as de-duplication and subword modeling, can further compress the speech sequence length. Hence, training time is significantly reduced while retaining notable performance. In this study, we undertake a comprehensive and systematic exploration into the application of discrete units within end-to-end speech processing models. Experiments on 12 automatic speech recognition, 3 speech translation, and 1 spoken language understanding corpora demonstrate that discrete units achieve reasonably good results in almost all the settings. We intend to release our configurations and trained models to foster future research efforts. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: Submitted to IEEE ICASSP 2024

arXiv:2309.15317 [pdf, other]

Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning

Authors: William Chen, Jiatong Shi, Brian Yan, Dan Berrebbi, Wangyou Zhang, Yifan Peng, Xuankai Chang, Soumi Maiti, Shinji Watanabe

Abstract: Multilingual self-supervised learning (SSL) has often lagged behind state-of-the-art (SOTA) methods due to the expenses and complexity required to handle many languages. This further harms the reproducibility of SSL, which is already limited to few research groups due to its resource usage. We show that more powerful techniques can actually lead to more efficient pre-training, opening SSL to more… ▽ More Multilingual self-supervised learning (SSL) has often lagged behind state-of-the-art (SOTA) methods due to the expenses and complexity required to handle many languages. This further harms the reproducibility of SSL, which is already limited to few research groups due to its resource usage. We show that more powerful techniques can actually lead to more efficient pre-training, opening SSL to more research groups. We propose WavLabLM, which extends WavLM's joint prediction and denoising to 40k hours of data across 136 languages. To build WavLabLM, we devise a novel multi-stage pre-training method, designed to address the language imbalance of multilingual data. WavLabLM achieves comparable performance to XLS-R on ML-SUPERB with less than 10% of the training data, making SSL realizable with academic compute. We show that further efficiency can be achieved with a vanilla HuBERT Base model, which can maintain 94% of XLS-R's performance with only 3% of the data, 4 GPUs, and limited trials. We open-source all code and models in ESPnet. △ Less

Submitted 27 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

Comments: Accepted to ASRU 2023

arXiv:2309.13876 [pdf, other]

Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

Authors: Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-weon Jung, Soumi Maiti, Shinji Watanabe

Abstract: Pre-training speech models on large volumes of data has achieved remarkable success. OpenAI Whisper is a multilingual multitask model trained on 680k hours of supervised speech data. It generalizes well to various speech recognition and translation benchmarks even in a zero-shot setup. However, the full pipeline for developing such models (from data collection to training) is not publicly accessib… ▽ More Pre-training speech models on large volumes of data has achieved remarkable success. OpenAI Whisper is a multilingual multitask model trained on 680k hours of supervised speech data. It generalizes well to various speech recognition and translation benchmarks even in a zero-shot setup. However, the full pipeline for developing such models (from data collection to training) is not publicly accessible, which makes it difficult for researchers to further improve its performance and address training-related issues such as efficiency, robustness, fairness, and bias. This work presents an Open Whisper-style Speech Model (OWSM), which reproduces Whisper-style training using an open-source toolkit and publicly available data. OWSM even supports more translation directions and can be more efficient to train. We will publicly release all scripts used for data preparation, training, inference, and scoring as well as pre-trained models and training logs to promote open science. △ Less

Submitted 24 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

Comments: Accepted at ASRU 2023

arXiv:2309.08531 [pdf, other]

Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens

Authors: Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro

Abstract: In this paper, we propose methods to build a powerful and efficient Image-to-Speech captioning (Im2Sp) model. To this end, we start with importing the rich knowledge related to image comprehension and language modeling from a large-scale pre-trained vision-language model into Im2Sp. We set the output of the proposed Im2Sp as discretized speech units, i.e., the quantized speech features of a self-s… ▽ More In this paper, we propose methods to build a powerful and efficient Image-to-Speech captioning (Im2Sp) model. To this end, we start with importing the rich knowledge related to image comprehension and language modeling from a large-scale pre-trained vision-language model into Im2Sp. We set the output of the proposed Im2Sp as discretized speech units, i.e., the quantized speech features of a self-supervised speech model. The speech units mainly contain linguistic information while suppressing other characteristics of speech. This allows us to incorporate the language modeling capability of the pre-trained vision-language model into the spoken language modeling of Im2Sp. With the vision-language pre-training strategy, we set new state-of-the-art Im2Sp performances on two widely used benchmark databases, COCO and Flickr8k. Then, we further improve the efficiency of the Im2Sp model. Similar to the speech unit case, we convert the original image into image units, which are derived through vector quantization of the raw image. With these image units, we can drastically reduce the required data storage for saving image data to just 0.8% when compared to the original image data in terms of bits. Demo page: https://ms-dot-k.github.io/Image-to-Speech-Captioning. △ Less

Submitted 15 September, 2023; originally announced September 2023.

arXiv:2309.07937 [pdf, other]

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

Authors: Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe

Abstract: We propose a decoder-only language model, VoxtLM, that can perform four tasks: speech recognition, speech synthesis, text generation, and speech continuation. VoxtLM integrates text vocabulary with discrete speech tokens from self-supervised speech features and uses special tokens to enable multitask learning. Compared to a single-task model, VoxtLM exhibits a significant improvement in speech syn… ▽ More We propose a decoder-only language model, VoxtLM, that can perform four tasks: speech recognition, speech synthesis, text generation, and speech continuation. VoxtLM integrates text vocabulary with discrete speech tokens from self-supervised speech features and uses special tokens to enable multitask learning. Compared to a single-task model, VoxtLM exhibits a significant improvement in speech synthesis, with improvements in both speech intelligibility from 28.9 to 5.6 and objective quality from 2.68 to 3.90. VoxtLM also improves speech generation and speech recognition performance over the single-task counterpart. Further, VoxtLM is trained with publicly available data and training recipes and model checkpoints are open-sourced to make fully reproducible work. △ Less

Submitted 24 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

arXiv:2309.07156 [pdf, other]

Transparency in Sleep Staging: Deep Learning Method for EEG Sleep Stage Classification with Model Interpretability

Authors: Shivam Sharma, Suvadeep Maiti, S. Mythirayee, Srijithesh Rajendran, Raju Surampudi Bapi

Abstract: Automated Sleep stage classification using raw single channel EEG is a critical tool for sleep quality assessment and disorder diagnosis. However, modelling the complexity and variability inherent in this signal is a challenging task, limiting their practicality and effectiveness in clinical settings. To mitigate these challenges, this study presents an end-to-end deep learning (DL) model which in… ▽ More Automated Sleep stage classification using raw single channel EEG is a critical tool for sleep quality assessment and disorder diagnosis. However, modelling the complexity and variability inherent in this signal is a challenging task, limiting their practicality and effectiveness in clinical settings. To mitigate these challenges, this study presents an end-to-end deep learning (DL) model which integrates squeeze and excitation blocks within the residual network to extract features and stacked Bi-LSTM to understand complex temporal dependencies. A distinctive aspect of this study is the adaptation of GradCam for sleep staging, marking the first instance of an explainable DL model in this domain with alignment of its decision-making with sleep expert's insights. We evaluated our model on the publically available datasets (SleepEDF-20, SleepEDF-78, and SHHS), achieving Macro-F1 scores of 82.5, 78.9, and 81.9, respectively. Additionally, a novel training efficiency enhancement strategy was implemented by increasing stride size, leading to 8x faster training times with minimal impact on performance. Comparative analyses underscore our model outperforms all existing baselines, indicating its potential for clinical usage. △ Less

Submitted 14 January, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

Comments: 12 pages, 9 figures, Under review at IEEE Journal of Biomedical and Health Informatics

arXiv:2308.02784 [pdf, other]

Semi-supervised Contrastive Regression for Estimation of Eye Gaze

Authors: Somsukla Maiti, Akshansh Gupta

Abstract: With the escalated demand of human-machine interfaces for intelligent systems, development of gaze controlled system have become a necessity. Gaze, being the non-intrusive form of human interaction, is one of the best suited approach. Appearance based deep learning models are the most widely used for gaze estimation. But the performance of these models is entirely influenced by the size of labeled… ▽ More With the escalated demand of human-machine interfaces for intelligent systems, development of gaze controlled system have become a necessity. Gaze, being the non-intrusive form of human interaction, is one of the best suited approach. Appearance based deep learning models are the most widely used for gaze estimation. But the performance of these models is entirely influenced by the size of labeled gaze dataset and in effect affects generalization in performance. This paper aims to develop a semi-supervised contrastive learning framework for estimation of gaze direction. With a small labeled gaze dataset, the framework is able to find a generalized solution even for unseen face images. In this paper, we have proposed a new contrastive loss paradigm that maximizes the similarity agreement between similar images and at the same time reduces the redundancy in embedding representations. Our contrastive regression framework shows good performance in comparison to several state of the art contrastive learning techniques used for gaze estimation. △ Less

Submitted 5 August, 2023; originally announced August 2023.

Comments: Accepted for International Conference on Pattern Recognition and Machine Intelligence 2023 (PReMI 2023)

Report number: Paper 057, https://www.isical.ac.in/~premi23/List_of_Accepted_Papers.pdf

arXiv:2307.06117 [pdf, other]

A qubit regularization of asymptotic freedom at the BKT transition without fine-tuning

Authors: Sandip Maiti, Debasish Banerjee, Shailesh Chandrasekharan, Marina K. Marinkovic

Abstract: We propose a two-dimensional hard core loop-gas model as a way to regularize the asymptotically free massive continuum quantum field theory that emerges at the BKT transition. Without fine-tuning, our model can reproduce the universal step-scaling function of the classical lattice XY model in the massive phase as we approach the phase transition. This is achieved by lowering the fugacity of Fock-v… ▽ More We propose a two-dimensional hard core loop-gas model as a way to regularize the asymptotically free massive continuum quantum field theory that emerges at the BKT transition. Without fine-tuning, our model can reproduce the universal step-scaling function of the classical lattice XY model in the massive phase as we approach the phase transition. This is achieved by lowering the fugacity of Fock-vacuum sites in the loop-gas configuration space to zero in the thermodynamic limit. Some of the universal quantities at the BKT transition show smaller finite size effects in our model as compared to the traditional XY model. Our model is a prime example of qubit regularization of an asymptotically free massive quantum field theory in Euclidean space-time and helps understand how asymptotic freedom can arise as a relevant perturbation at a decoupled fixed point without fine-tuning. △ Less

Submitted 8 July, 2023; originally announced July 2023.

Comments: 21 pages(6+15), 12 figures

arXiv:2307.03148 [pdf, other]

On the Computation of Accessibility Provided by Shared Mobility

Authors: Severin Diepolder, Andrea Araldo, Tarek Chouaki, Santa Maiti, Sebastian Hörl, Constantinos Antoniou

Abstract: Shared Mobility Services (SMS), e.g., Demand-Responsive Transit (DRT) or ride-sharing, can improve mobility in low-density areas, often poorly served by conventional Public Transport (PT). Such improvement is mostly quantified via basic performance indicators, like wait or travel time. However, accessibility indicators, measuring the ease of reaching surrounding opportunities (e.g., jobs, schools,… ▽ More Shared Mobility Services (SMS), e.g., Demand-Responsive Transit (DRT) or ride-sharing, can improve mobility in low-density areas, often poorly served by conventional Public Transport (PT). Such improvement is mostly quantified via basic performance indicators, like wait or travel time. However, accessibility indicators, measuring the ease of reaching surrounding opportunities (e.g., jobs, schools, shops, ...), would be a more comprehensive indicator. To date, no method exists to quantify the accessibility of SMS based on empirical measurements. Indeed, accessibility is generally computed on graph representations of PT networks, but SMS are dynamic and do not follow a predefined network. We propose a spatial-temporal statistical method that takes as input observed trips of a SMS acting as a feeder for PT and summarized such trips in a graph. On such a graph, we compute classic accessibility indicators. We apply our method to a MATSim simulation study concerning DRT in Paris-Saclay. △ Less

Submitted 12 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

ACM Class: J.2

Journal ref: hEART 2023: 11th Symposium of the European Association for Research in Transportation

arXiv:2306.14452 [pdf, ps, other]

Phenomenon of multiple reentrant localization in a double-stranded helix with transverse electric field

Authors: Sudin Ganguly, Suparna Sarkar, Kallol Mondal, Santanu K. Maiti

Abstract: The present work explores the potential for observing multiple reentrant localization behavior in a double-stranded helical (DSH) system, extending beyond the conventional nearest-neighbor hopping interaction. The DSH system is considered to have hopping dimerization in each strand, while also being subjected to a transverse electric field. The inclusion of an electric field serves the dual purpos… ▽ More The present work explores the potential for observing multiple reentrant localization behavior in a double-stranded helical (DSH) system, extending beyond the conventional nearest-neighbor hopping interaction. The DSH system is considered to have hopping dimerization in each strand, while also being subjected to a transverse electric field. The inclusion of an electric field serves the dual purpose of inducing quasiperiodic disorder and strand-wise staggered site energies. Two reentrant localization regions are identified: one exhibiting true extended behavior in the thermodynamic limit, while the second region shows quasi-extended characteristics with partial spreading within the helix. The DSH system exhibits three distinct single-particle mobility edges linked to localization transitions present in the system. The analysis in this study involves examining various parameters such as the single-particle energy spectrum, inverse participation ratio, local probability amplitude, and more. Our proposal, combining achievable hopping dimerization and induced correlated disorder, presents a unique opportunity to study phenomenon of reentrant localization, generating significant research interest. △ Less

Submitted 10 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 7 pages, 6 figures, comments are Welcome

arXiv:2306.11240 [pdf, other]

doi 10.1103/PhysRevB.109.035160

Spin-orbit interaction enabled electronic Raman scattering from charge collective modes

Authors: Surajit Sarkar, Alexander Lee, Girsh Blumberg, Saurabh Maiti

Abstract: Electronic Raman scattering in the fully symmetric channel couples to the charge excitations in the system, including the plasmons. However, the plasmon response has a spectral weight of $\sim q^2$, where $q$, the momentum transferred by light, is small. In this work, we show that in inversion symmetry broken systems where Rashba type spin-orbit coupling affects the states at the Fermi energy (whi… ▽ More Electronic Raman scattering in the fully symmetric channel couples to the charge excitations in the system, including the plasmons. However, the plasmon response has a spectral weight of $\sim q^2$, where $q$, the momentum transferred by light, is small. In this work, we show that in inversion symmetry broken systems where Rashba type spin-orbit coupling affects the states at the Fermi energy (which is a known low energy effect) as well as the transition elements to other states (a high energy effect), there is an additional coupling of the plasmons to the Raman vertex, even at zero momentum transfer, that results in a spectral weight that is proportional to the spin-orbit coupling. The high energy effect is due to the breaking of SU(2) spin invariance in the spin-flip transitions to the intermediate state. We present a theory for this coupling near the resonant regime of Raman scattering and show that in giant Rashba systems it can dominate over the conventional $q^2$ weighted coupling. We also provide experimental support along with a symmetry based justification for this spin-mediated coupling by identifying a prominent c-axis plasmon peak in the fully symmetric channel of the resonant Raman spectrum of the giant Rashba material BiTeI. This new coupling could lead to novel ways of manipulating coherent charge excitations in inversion-broken systems. This process is also relevant for spectroscopic studies in ultrafast spectroscopies, certain driven Floquet systems and topologically non-trivial phases of matter where strong inversion-breaking spin-orbit coupling plays a role. △ Less

Submitted 13 February, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

Comments: 22p with 9 figures, replaced with journal version

Journal ref: Phys Rev B, 109, 035160 (2024)

arXiv:2306.09057 [pdf, other]

A Learning Assisted Method for Uncovering Power Grid Generation and Distribution System Vulnerabilities

Authors: Suman Maiti, Anjana B, Sunandan Adhikary, Ipsita Koley, Soumyajit Dey

Abstract: Intelligent attackers can suitably tamper sensor/actuator data at various Smart grid surfaces causing intentional power oscillations, which if left undetected, can lead to voltage disruptions. We develop a novel combination of formal methods and machine learning tools that learns power system dynamics with the objective of generating unsafe yet stealthy false data based attack sequences. We enable… ▽ More Intelligent attackers can suitably tamper sensor/actuator data at various Smart grid surfaces causing intentional power oscillations, which if left undetected, can lead to voltage disruptions. We develop a novel combination of formal methods and machine learning tools that learns power system dynamics with the objective of generating unsafe yet stealthy false data based attack sequences. We enable the grid with anomaly detectors in a generalized manner so that it is difficult for an attacker to remain undetected. Our methodology, when applied on an IEEE 14 bus power grid model, uncovers stealthy attack vectors even in presence of such detectors. △ Less

Submitted 15 June, 2023; originally announced June 2023.

arXiv:2306.06672 [pdf, other]

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

Authors: William Chen, Xuankai Chang, Yifan Peng, Zhaoheng Ni, Soumi Maiti, Shinji Watanabe

Abstract: Self-supervised learning (SSL) has led to great strides in speech processing. However, the resources needed to train these models has become prohibitively large as they continue to scale. Currently, only a few groups with substantial resources are capable of creating SSL models, which harms reproducibility. In this work, we optimize HuBERT SSL to fit in academic constraints. We reproduce HuBERT in… ▽ More Self-supervised learning (SSL) has led to great strides in speech processing. However, the resources needed to train these models has become prohibitively large as they continue to scale. Currently, only a few groups with substantial resources are capable of creating SSL models, which harms reproducibility. In this work, we optimize HuBERT SSL to fit in academic constraints. We reproduce HuBERT independently from the original implementation, with no performance loss. Our code and training optimizations make SSL feasible with only 8 GPUs, instead of the 32 used in the original work. We also explore a semi-supervised route, using an ASR model to skip the first pre-training iteration. Within one iteration of pre-training, our models improve over HuBERT on several tasks. Furthermore, our HuBERT Large variant requires only 8 GPUs, achieving similar performance to the original trained on 128. As our contribution to the community, all models, configurations, and code are made open-source in ESPnet. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: Accepted at INTERSPEECH 2023

arXiv:2305.15303 [pdf]

Transport phenomena of TiCoSb: Defects induced modification in structure and density of states

Authors: S. Mahakal, Diptasikha Das, Pintu Singha, Aritra Banerjee, S. Chatterjee, Santanu K. Maiti, S. Assa Aravindh, K. Malik

Abstract: TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacanc… ▽ More TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacancy, interstitial, anti-site defects. Detail structural calculation, employing Rietveld refinement reveals the presence of embedded phases, vacancy and interstitial atom, which is also supported by the theoretical calculations. Lattice strain, crystalline size and dislocation density have been estimated by Williamson-Hall and modified Williamson-Hall methods. Thermal variation of resistivity [\r{ho}(T)] and thermopower [S(T)] have been explained using Mott equation and density of states (DOS) modification near the Fermi surface due to Co vancancy and embedded phases. Figure of merit (ZT) has been calculated and 4 to 5 times higher ZT for TiCoSb than earlier reported value is obtained at room temperature. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 12 pages, 12 figures (comments are welcome)

arXiv:2305.05582 [pdf, ps, other]

doi 10.1103/PhysRevB.108.195401

Thermoelectric phenomena in an antiferromagnetic helix: Role of electric field

Authors: Kallol Mondal, Sudin Ganguly, Santanu K. Maiti

Abstract: The charge and spin-dependent thermoelectric responses are investigated on a single-helical molecule possessing a collinear antiferromagnetic spin arrangement with zero net magnetization in the presence of a transverse electric field. Both the short and long-range hopping scenarios are considered, which mimic biological systems like single-stranded DNA and $αあるふぁ$-protein molecules. A non-equilibrium… ▽ More The charge and spin-dependent thermoelectric responses are investigated on a single-helical molecule possessing a collinear antiferromagnetic spin arrangement with zero net magnetization in the presence of a transverse electric field. Both the short and long-range hopping scenarios are considered, which mimic biological systems like single-stranded DNA and $αあるふぁ$-protein molecules. A non-equilibrium Green's function formalism is employed following the Landauer-Buttiker prescription to study the thermoelectric phenomena. The detailed dependence of the basic thermoelectric quantities on helicity, electric field, temperature etc., are elaborated on, and the underlying physics is explained accordingly. The charge and spin \textit{figure of merits} are computed and compared critically. For a more accurate estimation, the phononic contribution towards thermal conductance is also included. The present proposition shows a favorable spin-dependent thermoelectric response compared to the charge counterpart. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 11 pages, 8 figures, comments are Welcome

Journal ref: Phys. Rev. B 108, 195401 (2023)

arXiv:2304.12826 [pdf, ps, other]

doi 10.1007/s10665-022-10234-7

Electroosmotic flow of a rheological fluid in non-uniform micro-vessels

Authors: S. Maiti, S. K. Pandey, J. C. Misra

Abstract: The paper deals with a theoretical study of electrokinetic flow of a rheological Herschel-Bulkley fluid through a cylindrical tube of variable cross-section. The concern of this study is to analyze combined pressure-driven and electroosmotic flow of Herschel-Bulkley fluid. The wall potential is considered to vary slowly and periodically along the axis of the tube. With reference to flow in the mic… ▽ More The paper deals with a theoretical study of electrokinetic flow of a rheological Herschel-Bulkley fluid through a cylindrical tube of variable cross-section. The concern of this study is to analyze combined pressure-driven and electroosmotic flow of Herschel-Bulkley fluid. The wall potential is considered to vary slowly and periodically along the axis of the tube. With reference to flow in the micro-vessels, the problem has been solved using the lubrication theory. The Helmholtz-Smoluchowski (HS) slip boundary condition has been employed in this study. Volumetric flow rate $Q$ is found to be significantly affected by the yield stress parameter $νにゅー$ only if an applied pressure force is active. The linear superposition of flow components separately due to the hydrodynamic and electric force occurs only for a strictly uniform tube. This linear relationship fails if non-uniformity appears in either tube radius or in distribution of the electrokinetic slip boundary condition. Moreover, converging/diverging nature of the mean tube radius plays a crucial role on the fluid transport. For the benefit of readers, along with the original contribution, some applications of external electrical stimulation (ES) in the human body and HS slip velocity, studied in the past by previous researchers have been discussed in the paper. △ Less

Submitted 15 February, 2023; originally announced April 2023.

Comments: 44 pages and 28 figures

Journal ref: Journal of Engineering Mathematics 135, 8 (2022)

arXiv:2304.08081 [pdf, ps, other]

Thermal signature of helical molecule: Beyond nearest-neighbor electron hopping

Authors: Suparna Sarkar, Santanu K. Maiti, David Laroze

Abstract: We investigate, for the first time, the thermal signature of a single-stranded helical molecule, subjected to a transverse electric field, by analyzing electronic specific heat (ESH). Depending on the hopping of electrons, two different kinds of helical systems are considered. In one case the hopping is confined within a few neighboring lattice sites which is referred to as short-range hopping (SR… ▽ More We investigate, for the first time, the thermal signature of a single-stranded helical molecule, subjected to a transverse electric field, by analyzing electronic specific heat (ESH). Depending on the hopping of electrons, two different kinds of helical systems are considered. In one case the hopping is confined within a few neighboring lattice sites which is referred to as short-range hopping (SRH) helix, while in the other case, electrons can hop in all possible sites making the system a long-range hopping (LRH) one. The interplay between helicity and the electric field is quite significant. Our detailed study shows that, in the low-temperature limit, the SRH helix is more sensitive to temperature than its counterpart. Whereas, the situation gets reversed in the limit of high temperatures. The thermal response of the helix can be modified selectively by means of the electric field, and the difference between specific heats of the two helices gradually decreases with increasing the field strength. The molecular handedness (viz, left-handed or right-handed) rather has no appreciable effect on the thermal signature. Finally, one important usefulness of ESH is discussed. If the helix contains a point defect, then by comparing the results of perfect and defective helices, one can estimate the location of the defect, which might be useful in diagnosing bad cells and different diseases. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 8 pages, 10 figures

arXiv:2304.04596 [pdf, other]

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

Authors: Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe

Abstract: ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) -- each task is supported with a wide variety of approaches, differentiating ESPnet-… ▽ More ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) -- each task is supported with a wide variety of approaches, differentiating ESPnet-ST-v2 from other open source spoken language translation toolkits. This toolkit offers state-of-the-art architectures such as transducers, hybrid CTC/attention, multi-decoders with searchable intermediates, time-synchronous blockwise CTC/attention, Translatotron models, and direct discrete unit models. In this paper, we describe the overall design, example models for each task, and performance benchmarking behind ESPnet-ST-v2, which is publicly available at https://github.com/espnet/espnet. △ Less

Submitted 6 July, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

Comments: ACL 2023; System Demonstration

arXiv:2303.15983 [pdf, ps, other]

doi 10.1038/s41598-023-40690-9

Electrical analogue of one-dimensional and quasi-one-dimensional Aubry-André-Harper lattices

Authors: Sudin Ganguly, Santanu K. Maiti

Abstract: The present work discusses the possibility to realize correlated disorder in electrical circuits and studies the localization phenomena in terms of two-port impedance. The correlated disorder is incorporated using the Aubry-André-Harper (AAH) model. One-dimensional and quasi-one-dimensional AAH structures are explored and directly mapped with their tight-binding analogues. Transitions from the hig… ▽ More The present work discusses the possibility to realize correlated disorder in electrical circuits and studies the localization phenomena in terms of two-port impedance. The correlated disorder is incorporated using the Aubry-André-Harper (AAH) model. One-dimensional and quasi-one-dimensional AAH structures are explored and directly mapped with their tight-binding analogues. Transitions from the high-conducting phase to the low-conducting one are observed for the circuits. △ Less

Submitted 28 March, 2023; originally announced March 2023.

Comments: 6 pages, 7 figures, Comments are Welcome

Journal ref: Scientific Reports 2023

arXiv:2303.12728 [pdf, other]

LocalEyenet: Deep Attention framework for Localization of Eyes

Authors: Somsukla Maiti, Akshansh Gupta

Abstract: Development of human machine interface has become a necessity for modern day machines to catalyze more autonomy and more efficiency. Gaze driven human intervention is an effective and convenient option for creating an interface to alleviate human errors. Facial landmark detection is very crucial for designing a robust gaze detection system. Regression based methods capacitate good spatial localiza… ▽ More Development of human machine interface has become a necessity for modern day machines to catalyze more autonomy and more efficiency. Gaze driven human intervention is an effective and convenient option for creating an interface to alleviate human errors. Facial landmark detection is very crucial for designing a robust gaze detection system. Regression based methods capacitate good spatial localization of the landmarks corresponding to different parts of the faces. But there are still scope of improvements which have been addressed by incorporating attention. In this paper, we have proposed a deep coarse-to-fine architecture called LocalEyenet for localization of only the eye regions that can be trained end-to-end. The model architecture, build on stacked hourglass backbone, learns the self-attention in feature maps which aids in preserving global as well as local spatial dependencies in face image. We have incorporated deep layer aggregation in each hourglass to minimize the loss of attention over the depth of architecture. Our model shows good generalization ability in cross-dataset evaluation and in real-time localization of eyes. △ Less

Submitted 13 March, 2023; originally announced March 2023.

arXiv:2302.12829 [pdf, other]

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Authors: William Chen, Brian Yan, Jiatong Shi, Yifan Peng, Soumi Maiti, Shinji Watanabe

Abstract: Multilingual Automatic Speech Recognition (ASR) models have extended the usability of speech technologies to a wide variety of languages. With how many languages these models have to handle, however, a key to understanding their imbalanced performance across different languages is to examine if the model actually knows which language it should transcribe. In this paper, we introduce our work on im… ▽ More Multilingual Automatic Speech Recognition (ASR) models have extended the usability of speech technologies to a wide variety of languages. With how many languages these models have to handle, however, a key to understanding their imbalanced performance across different languages is to examine if the model actually knows which language it should transcribe. In this paper, we introduce our work on improving performance on FLEURS, a 102-language open ASR benchmark, by conditioning the entire model on language identity (LID). We investigate techniques inspired from recent Connectionist Temporal Classification (CTC) studies to help the model handle the large number of languages, conditioning on the LID predictions of auxiliary tasks. Our experimental results demonstrate the effectiveness of our technique over standard CTC/Attention-based hybrid models. Furthermore, our state-of-the-art systems using self-supervised models with the Conformer architecture improve over the results of prior work on FLEURS by a relative 28.4% CER. Trained models and reproducible recipes are available at https://github.com/espnet/espnet/tree/master/egs2/fleurs/asr1 . △ Less

Submitted 27 February, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 5 pages, 1 figure, accepted at ICASSP 2023; fixed typo and URL in abstract

arXiv:2302.10187 [pdf]

Effect of pH on structure and surface charge of Fe$_2$O$_3$ nanoparticles synthesized at different pH conditions and correlation to antibacterial properties

Authors: Farzana Naushin, Srishti Sen, Mukul Kumar, Hemang Bairagi, Siddhartha Maiti, Jaydeep Bhattacharya, Somaditya Sen

Abstract: pH of a solution is the ratio of H+/OH- ions. The relative ratio of these charges may affect forming bonds during a hydrothermal synthesis by influencing electronic clouds of participant ions, which can modify the structure and hence crystallinity, strain, disorder, surface termination etc. These factors may modify physical properties including the surface charge. This work uses hematite nanoparti… ▽ More pH of a solution is the ratio of H+/OH- ions. The relative ratio of these charges may affect forming bonds during a hydrothermal synthesis by influencing electronic clouds of participant ions, which can modify the structure and hence crystallinity, strain, disorder, surface termination etc. These factors may modify physical properties including the surface charge. This work uses hematite nanoparticles to correlate the structural modifications to all these properties and finally to the antibacterial properties due to the surface charge interaction of the nanoparticles and the bacterial cell walls. △ Less

Submitted 19 February, 2023; originally announced February 2023.

Comments: 35 pages, 11 figures, to be submitted to PRL

arXiv:2301.12596 [pdf, other]

Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining

Authors: Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi, Hiroshi Saruwatari

Abstract: While neural text-to-speech (TTS) has achieved human-like natural synthetic speech, multilingual TTS systems are limited to resource-rich languages due to the need for paired text and studio-quality audio data. This paper proposes a method for zero-shot multilingual TTS using text-only data for the target language. The use of text-only data allows the development of TTS systems for low-resource la… ▽ More While neural text-to-speech (TTS) has achieved human-like natural synthetic speech, multilingual TTS systems are limited to resource-rich languages due to the need for paired text and studio-quality audio data. This paper proposes a method for zero-shot multilingual TTS using text-only data for the target language. The use of text-only data allows the development of TTS systems for low-resource languages for which only textual resources are available, making TTS accessible to thousands of languages. Inspired by the strong cross-lingual transferability of multilingual language models, our framework first performs masked language model pretraining with multilingual text-only data. Then we train this model with a paired data in a supervised manner, while freezing a language-aware embedding layer. This allows inference even for languages not included in the paired data but present in the text-only data. Evaluation results demonstrate highly intelligible zero-shot TTS with a character error rate of less than 12% for an unseen language. △ Less

Submitted 27 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

Comments: To appear in IJCAI 2023

arXiv:2301.09099 [pdf, ps, other]

Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study

Authors: Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe, Wassim El-Hajj, Ahmed Ali

Abstract: Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech. In contrast, low-resource languages, including Arabic, have very limited TTS systems due to the lack of resources. We propose a fully unsupervised method for building TTS, including automatic data selection and pre-training/fine-tuning strategies for TTS training, using broadcast news… ▽ More Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech. In contrast, low-resource languages, including Arabic, have very limited TTS systems due to the lack of resources. We propose a fully unsupervised method for building TTS, including automatic data selection and pre-training/fine-tuning strategies for TTS training, using broadcast news as a case study. We show how careful selection of data, yet smaller amounts, can improve the efficiency of TTS system in generating more natural speech than a system trained on a bigger dataset. We adopt to propose different approaches for the: 1) data: we applied automatic annotations using DNSMOS, automatic vowelization, and automatic speech recognition (ASR) for fixing transcriptions' errors; 2) model: we used transfer learning from high-resource language in TTS model and fine-tuned it with one hour broadcast recording then we used this model to guide a FastSpeech2-based Conformer model for duration. Our objective evaluation shows 3.9% character error rate (CER), while the groundtruth has 1.3% CER. As for the subjective evaluation, where 1 is bad and 5 is excellent, our FastSpeech2-based Conformer model achieved a mean opinion score (MOS) of 4.4 for intelligibility and 4.2 for naturalness, where many annotators recognized the voice of the broadcaster, which proves the effectiveness of our proposed unsupervised method. △ Less

Submitted 26 January, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

arXiv:2212.04559 [pdf, other]

SpeechLMScore: Evaluating speech generation using speech language model

Authors: Soumi Maiti, Yifan Peng, Takaaki Saeki, Shinji Watanabe

Abstract: While human evaluation is the most reliable metric for evaluating speech generation systems, it is generally costly and time-consuming. Previous studies on automatic speech quality assessment address the problem by predicting human evaluation scores with machine learning models. However, they rely on supervised learning and thus suffer from high annotation costs and domain-shift problems. We propo… ▽ More While human evaluation is the most reliable metric for evaluating speech generation systems, it is generally costly and time-consuming. Previous studies on automatic speech quality assessment address the problem by predicting human evaluation scores with machine learning models. However, they rely on supervised learning and thus suffer from high annotation costs and domain-shift problems. We propose SpeechLMScore, an unsupervised metric to evaluate generated speech using a speech-language model. SpeechLMScore computes the average log-probability of a speech signal by mapping it into discrete tokens and measures the average probability of generating the sequence of tokens. Therefore, it does not require human annotation and is a highly scalable framework. Evaluation results demonstrate that the proposed metric shows a promising correlation with human evaluation scores on different speech generation tasks including voice conversion, text-to-speech, and speech enhancement. △ Less

Submitted 8 December, 2022; originally announced December 2022.

arXiv:2212.03210 [pdf, other]

doi 10.21468/SciPostPhys.15.4.139

Isolated flat bands in 2D lattices based on a novel path-exchange symmetry

Authors: Jun-Hyung Bae, Tigran Sedrakyan, Saurabh Maiti

Abstract: The increased ability to engineer two-dimensional (2D) systems, either using materials, photonic lattices, or cold atoms, has led to the search for 2D structures with interesting properties. One such property is the presence of flat bands. Typically, the presence of these requires long-ranged hoppings, fine-tuning of nearest neighbor hoppings, or breaking time-reversal symmetry by using a staggere… ▽ More The increased ability to engineer two-dimensional (2D) systems, either using materials, photonic lattices, or cold atoms, has led to the search for 2D structures with interesting properties. One such property is the presence of flat bands. Typically, the presence of these requires long-ranged hoppings, fine-tuning of nearest neighbor hoppings, or breaking time-reversal symmetry by using a staggered flux distribution in the unit cell. We provide a prescription based on carrying out projections from a parent system to generate different flat band systems. We identify the conditions for maintaining the flatness and identify a path-exchange symmetry in such systems that cause the flat band to be degenerate with the other dispersive ones. Breaking this symmetry leads to lifting the degeneracy while still preserving the flatness of the band. This technique does not require changing the topology nor breaking time-reversal symmetry as was suggested earlier in the literature. The prescription also eliminates the need for any fine-tuning. Moreover, it is shown that the subsequent projected systems inherit the precise fine-tuning conditions that were discussed in the literature for similar systems, in order to have and isolate a flat band. As examples, we demonstrate the use of our prescription to arrive at the flat band conditions for popular systems like the Kagome, the Lieb, and the Dice lattices. Finally, we are also able to show that a flat band exists in a recently proposed chiral spin-liquid state of the Kagome lattice only if it is associated with a gauge field that produces a flux modulation of the Chern-Simons type. △ Less

Submitted 27 January, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

Comments: 37 pages, 20 figures, References are updated and an additional section on flux-attachment to lattice is also included

Journal ref: SciPost Phys. 15, 139 (2023)

arXiv:2209.03957 [pdf, other]

An Unified Statistical Procedure to Analyse Irreversible Thermal Curves

Authors: Jhimli Bhattacharyya, Gopinatha Suresh Kumar, Souvik Maiti, Daisuke Miyoshi, Sanjay Chaudhuri

Abstract: The phenomenon of hysteresis is commonly observed in many UV thermal experiments involving unmodified or modified nucleic acids. In presence of hysteresis, the thermal curves are irreversible and demand a significant effort to produce the reaction-specific kinetic and thermodynamic parameters. In this article, we describe a unified statistical procedure to analyze such thermal curves. Our method a… ▽ More The phenomenon of hysteresis is commonly observed in many UV thermal experiments involving unmodified or modified nucleic acids. In presence of hysteresis, the thermal curves are irreversible and demand a significant effort to produce the reaction-specific kinetic and thermodynamic parameters. In this article, we describe a unified statistical procedure to analyze such thermal curves. Our method applies to experiments with intramolecular as well as intermolecular reactions. More specifically, the proposed method allows one to handle the thermal curves for the formation of duplexes, triplexes, and various quadruplexes in exactly the same way. The proposed method uses a local polynomial regression for finding the smoothed thermal curves and calculating their slopes. This method is more flexible and easy to implement than the least squares polynomial smoothing which is currently almost universally used for such purposes. Full analyses of the curves including computation of kinetic and thermodynamic parameters can be done using freely available statistical software. In the end, we illustrate our method by analyzing irreversible curves encountered in the formations of a G-quadruplex and an LNA-modified parallel duplex. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2208.05123 [pdf, other]

doi 10.1134/S1063776122100077

Collective spin modes in Fermi liquids with spin-orbit coupling

Authors: Dmitrii L. Maslov, Abhishek Kumar, Saurabh Maiti

Abstract: A combination of spin-orbit coupling and electron-electron interaction gives rise to a new type of collective spin modes, which correspond to oscillations of magnetization even in the absence of the external magnetic field. We review recent progress in theoretical understanding and experimental observation of such modes, focusing on three examples of real-life systems: a two-dimensional electron g… ▽ More A combination of spin-orbit coupling and electron-electron interaction gives rise to a new type of collective spin modes, which correspond to oscillations of magnetization even in the absence of the external magnetic field. We review recent progress in theoretical understanding and experimental observation of such modes, focusing on three examples of real-life systems: a two-dimensional electron gas with Rashba and/or Dresselhaus spin-orbit coupling, graphene with proximity-induced spin-orbit coupling, and the Dirac state on the surface of a three-dimensional topological insulator. This paper is dedicated to the 95th birthday of Professor Emmanuel I. Rashba. △ Less

Submitted 31 October, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: This short review is dedicated to the 95th birthday of Prof. Emmanuel I. Rashba. 26 pages and 17 figures

Journal ref: JETP 135, 549-574 (2022)

arXiv:2207.03308 [pdf, ps, other]

doi 10.1140/epjp/s13360-022-03016-8

Transport characteristics of a $\mathcal{PT}$-symmetric non-Hermitian system: Effect of environmental interaction

Authors: Sudin Ganguly, Souvik Roy, Santanu K. Maiti

Abstract: The environmental influence is inevitable but often ignored in the study of electronic transport properties of small-scale systems. Such an environment-mediated interaction can generally be described by a parity-time symmetric non-Hermitian system with a balanced distribution of physical gain and loss. It is quite known in the literature that along with the conventional junction current, another c… ▽ More The environmental influence is inevitable but often ignored in the study of electronic transport properties of small-scale systems. Such an environment-mediated interaction can generally be described by a parity-time symmetric non-Hermitian system with a balanced distribution of physical gain and loss. It is quite known in the literature that along with the conventional junction current, another current called bias-driven circular current can be established in a loop geometry depending upon the junction configuration. This current, further, induces a strong magnetic field that can even reach to few Tesla. What will happen to these quantities when the system interacts with its surrounding environment? Would it exhibit a detrimental response? We address such issues considering a two-terminal ring geometry where the junction setup is described within a tight-binding framework. All the transport quantities are evaluated using the standard Green's function formalism based on the Landauer-Büttiker approach. △ Less

Submitted 7 July, 2022; originally announced July 2022.

Comments: 12 pages, 9 figures

Journal ref: The European Physical Journal Plus 137, 780 (2022)

arXiv:2206.06156 [pdf]

Advanced Quantitative Techniques to Solve Center of Gravity Problem in Supply Chain

Authors: Brian Houck, Chetan Sampat, Srijit Maiti, Shivam S, Anurag Vaishistha, Sumit Banerjee

Abstract: Activities involving transformation of raw materials, various resources and components into final products and also delivering it to the end customer incur a significant cost during the selection of location of a warehouse that can be easily accessed by various actors of the supply chain. To minimize upstream and downstream transportation costs, the center of gravity (CoG) analysis method is used… ▽ More Activities involving transformation of raw materials, various resources and components into final products and also delivering it to the end customer incur a significant cost during the selection of location of a warehouse that can be easily accessed by various actors of the supply chain. To minimize upstream and downstream transportation costs, the center of gravity (CoG) analysis method is used to find the potential warehouse locations for a given demand network which have an impact on the entire supply chain network. Mixed Integer Linear Programming (MILP), an open source tool is developed for implementing CoG method along with certain service level constraints to find optimal potential locations with the least cost. In this paper, an optimization tool has been designed for a forward logistics network with several novel methods like Customer Location Selection (CLS), Customer Packets along with other business heuristics that optimize and enhance the existing MILP to get the optimal solutions with low computational cost and runtime. Finally, recommending an alternative network of facilities which reduces overall costs compared to the existing network. An user interface has also been developed to make a user friendly interaction with the model. We can conclude that this model can significantly help companies reduce costs during the logistics network design. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: 7 pages, 3 figures, 2 tables

Showing 1–50 of 277 results for author: Maiti, S