(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 277 results for author: Maiti, S

.
  1. arXiv:2407.05830  [pdf, ps, other

    cond-mat.mes-hall

    Super-ballistic transport in an open quantum ring

    Authors: Moumita Patra, Bijay Kumar Agarwalla, Santanu K. Maiti

    Abstract: When the degeneracies of the ring-Hamiltonian are removed by the asymmetric ring-to-electrodes configuration for an open quantum ring (OQR), the overall junction transmission function exhibits fano-type antiresonance, resulting a net circular current appears within the channel, that is the ring around the degenerate energy levels of the ring-Hamiltonian. We investigate the system size scaling prop… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 9 pages, 10 figures

  2. arXiv:2407.00837  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Towards Robust Speech Representation Learning for Thousands of Languages

    Authors: William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe

    Abstract: Self-supervised learning (SSL) has helped extend speech technologies to more languages by reducing the need for labeled data. However, models are still far from supporting the world's 7000+ languages. We propose XEUS, a Cross-lingual Encoder for Universal Speech, trained on over 1 million hours of data across 4057 languages, extending the language coverage of SSL models 4-fold. We combine 1 millio… ▽ More

    Submitted 2 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: Updated affiliations; 20 pages

  3. arXiv:2406.08219  [pdf, ps, other

    cond-mat.mes-hall cond-mat.dis-nn

    Impact of environmental interaction on bias induced circular current in a ring nanojunction

    Authors: Moumita Mondal, Santanu K. Maiti

    Abstract: The specific role of environmental interaction on bias driven circular current in a ring nanojunction is explored, for the first time to the best of our concern, within a tight-binding framework based on wave-guide theory. The environmental interaction is implemented through disorder in backbone sites where these sites are directly coupled to parent lattice sites of the ring via single bonds. In a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 9 figures (comments are welcome)

  4. arXiv:2405.13619  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Drastic modification in thermal conductivity of TiCoSb Half-Heusler alloy: Phonon engineering by lattice softening and ionic polarization

    Authors: S. Mahakal, Avijit Jana, Diptasikha Das, Nabakumar Rana, Pallabi Sardar, Aritra Banerjee, Shamima Hussain, Santanu K. Maiti, K. Malik

    Abstract: A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron mi… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Main article (17 pages, 10 figures), Supplemental article (5 pages, 7 figures), Comments are welcome

  5. arXiv:2403.01916  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Enhancing thermoelectric performance of 2D Janus ISbTe by strain engineering: A first principle study

    Authors: Anuja Kumari, Abhinav Nag, Santanu K. Maiti, Jagdish Kumar

    Abstract: Recent developments in the 2D materials laid emphasis on finding the materials with robust properties for variety of applications including the energy harvesting. The recent discovery of Janus monolayers with broken symmetry has opened up new options for engineering the properties of 2D layered materials. Present study focuses on enhancing thermoelectric properties of 2H-ISbTe 2D Janus monolayer.… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 18 pages, 8 figures, Comments are Welcome

  6. arXiv:2402.16021  [pdf, other

    cs.CL cs.AI cs.CV eess.AS

    TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

    Authors: Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro

    Abstract: The capability to jointly process multi-modal information is becoming an essential task. However, the limited number of paired multi-modal data and the large computational requirements in multi-modal learning hinder the development. We propose a novel Tri-Modal Translation (TMT) model that translates between arbitrary modalities spanning speech, image, and text. We introduce a novel viewpoint, whe… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  7. arXiv:2402.05654  [pdf, ps, other

    cond-mat.mes-hall cond-mat.dis-nn

    Bias induced circular current in a loop nanojunction with AAH modulation: Role of hopping dimerization

    Authors: Moumita Mondal, Santanu K. Maiti

    Abstract: In this work, we put forward, for the first time, the interplay between correlated disorder and hopping dimerization on bias driven circular current in a loop conductor that is clamped between two electrodes. The correlated disorder is introduced in site energies of the ring in the form of Aubry-André-Harper (AAH) model. Simulating the quantum system within a tight-binding framework all the result… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 6 pages, 8 figures (comments are welcome)

  8. arXiv:2401.18045  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

    Authors: Yihan Wu, Soumi Maiti, Yifan Peng, Wangyou Zhang, Chenda Li, Yuyue Wang, Xihua Wang, Shinji Watanabe, Ruihua Song

    Abstract: Recent advancements in language models have significantly enhanced performance in multiple speech-related tasks. Existing speech language models typically utilize task-dependent prompt tokens to unify various speech tasks in a single model. However, this design omits the intrinsic connections between different speech tasks, which can potentially boost the performance of each task. In this work, we… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 11 pages, 2 figures

  9. arXiv:2401.16812  [pdf, other

    cs.SD eess.AS

    SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics

    Authors: Takaaki Saeki, Soumi Maiti, Shinnosuke Takamichi, Shinji Watanabe, Hiroshi Saruwatari

    Abstract: While subjective assessments have been the gold standard for evaluating speech generation, there is a growing need for objective metrics that are highly correlated with human subjective judgments due to their cost efficiency. This paper proposes reference-aware automatic evaluation methods for speech generation inspired by evaluation metrics in natural language processing. The proposed SpeechBERTS… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by Interspeech 2024. An extended version with Appendix. Code: https://github.com/Takaaki-Saeki/DiscreteSpeechMetrics

  10. arXiv:2401.10157  [pdf, other

    hep-lat cond-mat.str-el hep-th quant-ph

    A qubit regularization of asymptotic freedom without fine-tuning

    Authors: Sandip Maiti, Debasish Banerjee, Shailesh Chandrasekharan, Marina Krstic Marinkovic

    Abstract: Other than the commonly used Wilson's regularization of quantum field theories (QFTs), there is a growing interest in regularizations that explore lattice models with a strictly finite local Hilbert space, in anticipation of the upcoming era of quantum simulations of QFTs. A notable example is Euclidean qubit regularization, which provides a natural way to recover continuum QFTs that emerge via in… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages, 8 figures, contribution to the 40th International Symposium on Lattice Field Theory (Lattice 2023), July 31st - August 4th, 2023, Fermi National Accelerator Laboratory, Batavia, Illinois, USA

  11. arXiv:2401.01864  [pdf, other

    astro-ph.CO hep-ph hep-th

    Constraining inflationary magnetogenesis and reheating via GWs in light of PTA data

    Authors: Subhasis Maiti, Debaprasad Maity, L. Sriramkumar

    Abstract: Utilizing the bounds on primordial magnetic fields (PMFs), their contributions to secondary gravitational waves (GWs) and the results from the pulsar timing arrays (PTAs), we arrive at constraints on the epoch of reheating. We find that the combined spectral density of primary and secondary GWs (generated by the PMFs) can, in general, be described as a broken power law with five different indices.… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 14 pages(8+6); 4 figures

  12. arXiv:2401.01683  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph physics.comp-ph

    Enhanced Thermoelectric Properties of 2D Janus Ferromagnetic LaBrI with Strain-induced Valley Degeneracy

    Authors: Anuja Kumari, Abhinav Nag, Santanu K. Maiti, Jagdish Kumar

    Abstract: Since the successful synthesis of the MoSSe monolayer, which violated the out-of-plane mirror symmetry of TMDs monolayers, considerable and systematic research has been conducted on Janus monolayer materials. By systematically analyzing the LaBrI monolayer, we are able to learn more about the novel Janus material by focusing on the halogen family next to group VIA (S, Se, Te). The structural optim… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 20 pages, 7 figures, Comments are Welcome

  13. arXiv:2312.09091  [pdf, ps, other

    math.GM

    Multiple New Important Conjectures on Equivalence to Perfect Cuboid and Euler Brick

    Authors: S Maiti

    Abstract: Nobody has discovered any perfect cuboid and there is no formula to deliver all possible Euler bricks. During investigations of famous open problems related to the perfect cuboid and Euler brick; I have found some new important conjectures on Pythagorean triples and biquadratic Diophantine equation [4] which are reduced & quivalence form for perfect cuboid and Euler brick problems. The details of… ▽ More

    Submitted 1 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 16 pages, 12 conjectures

    MSC Class: 11B13 ACM Class: G.2.0

  14. arXiv:2312.02778  [pdf, ps, other

    cond-mat.dis-nn cond-mat.str-el physics.comp-ph quant-ph

    Spin-dependent multiple reentrant localization in an antiferromagnetic helix with transverse electric field: Hopping dimerization-free scenario

    Authors: Sudin Ganguly, Kallol Mondal, Santanu K. Maiti

    Abstract: Reentrant localization (RL), a recently prominent phenomenon, traditionally links to the interplay of staggered correlated disorder and hopping dimerization, as indicated by prior research. Contrary to this paradigm, our present study demonstrates that hopping dimerization is not a pivotal factor in realizing RL. Considering a helical magnetic system with antiferromagnetic ordering, we uncover spi… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures, comments are Welcome

  15. arXiv:2311.13345  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Many-body physics-induced selection rules: application to Raman spectroscopy

    Authors: Igor Benek-Lins, Saurabh Maiti

    Abstract: Spectroscopic measurements in quantum systems are subject to selection rules, usually based on space-time symmetries, that allow or disallow transitions between states. In many-body systems, in addition to the single-particle states, there emerge new ones due to collective excitations of the system. Here we demonstrate the existence of a "fragile" selection rule that emerges as a manifestation of… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 14 pages, 4 figures; includes supplementary material

  16. arXiv:2310.20640  [pdf, other

    cond-mat.mes-hall cond-mat.supr-con

    Electronic Raman response of a superconductor across a time reversal symmetry breaking phase transition

    Authors: Surajit Sarkar, Saurabh Maiti

    Abstract: Polarization-resolved electronic Raman spectroscopy is an important experimental tool to investigate collective excitations in superconductors. In this work, we present a general theory that allows us to study the evolution of all Raman active collective modes in multiple symmetry channels across a time-reversal symmetry (TRS) breaking superconducting transition. This comprehensive approach reveal… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  17. arXiv:2310.08322  [pdf, other

    physics.plasm-ph astro-ph.EP astro-ph.HE physics.comp-ph physics.geo-ph physics.space-ph

    A 3D Kinetic Distribution that Yields Observed Plasma Density in the Inner Van Allen Belt

    Authors: Snehanshu Maiti, Harishankar Ramachandran

    Abstract: A steady-state distribution is obtained that approximately yields the observed plasma density profile of the inner Van Allen radiation belt. The model assumes a collisionless, magnetized plasma with zero electric field present. The inner Van Allen belt consists of a plasma comprising high-energy protons and relativistic electrons. The particle trajectories are obtained from the collisionless Loren… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  18. Spin-Mediated Direct Photon Scattering by Plasmons in BiTeI

    Authors: A. C. Lee, S. Sarkar, K. Du, H. -H. Kung, C. J. Won, K. Wang, S. -W. Cheong, S. Maiti, G. Blumberg

    Abstract: We use polarization resolved Raman spectroscopy to demonstrate that for a 3D giant Rashba system the bulk plasmon collective mode can directly couple to the Raman response even in the long wavelength $\mathbf q \rightarrow 0$ limit. Although conventional theory predicts the plasmon spectral weight to be suppressed as the square of its quasi-momentum and thus negligibly weak in the Raman spectra, w… ▽ More

    Submitted 18 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Editors' Suggestion

    Journal ref: Phys. Rev. B 109, L041111 (2024)

  19. arXiv:2310.03757  [pdf, other

    eess.SP cs.CV cs.LG

    Enhancing Healthcare with EOG: A Novel Approach to Sleep Stage Classification

    Authors: Suvadeep Maiti, Shivam Kumar Sharma, Raju S. Bapi

    Abstract: We introduce an innovative approach to automated sleep stage classification using EOG signals, addressing the discomfort and impracticality associated with EEG data acquisition. In addition, it is important to note that this approach is untapped in the field, highlighting its potential for novel insights and contributions. Our proposed SE-Resnet-Transformer model provides an accurate classificatio… ▽ More

    Submitted 25 September, 2023; originally announced October 2023.

  20. arXiv:2310.00706  [pdf, other

    cs.CL cs.SD eess.AS

    Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

    Authors: Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh

    Abstract: Modern speech synthesis systems have improved significantly, with synthetic speech being indistinguishable from real speech. However, efficient and holistic evaluation of synthetic speech still remains a significant challenge. Human evaluation using Mean Opinion Score (MOS) is ideal, but inefficient due to high costs. Therefore, researchers have developed auxiliary automatic metrics like Word Erro… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  21. arXiv:2309.15800  [pdf, other

    cs.CL cs.SD eess.AS

    Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

    Authors: Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

    Abstract: Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling. High-dimensional speech features such as spectrograms are often used as the input for the subsequent model. However, they can still be redundant. Recent investigations proposed the use of discrete speech units derived from self-supervised learning repre… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE ICASSP 2024

  22. arXiv:2309.15317  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning

    Authors: William Chen, Jiatong Shi, Brian Yan, Dan Berrebbi, Wangyou Zhang, Yifan Peng, Xuankai Chang, Soumi Maiti, Shinji Watanabe

    Abstract: Multilingual self-supervised learning (SSL) has often lagged behind state-of-the-art (SOTA) methods due to the expenses and complexity required to handle many languages. This further harms the reproducibility of SSL, which is already limited to few research groups due to its resource usage. We show that more powerful techniques can actually lead to more efficient pre-training, opening SSL to more… ▽ More

    Submitted 27 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to ASRU 2023

  23. arXiv:2309.13876  [pdf, other

    cs.CL cs.SD eess.AS

    Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

    Authors: Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-weon Jung, Soumi Maiti, Shinji Watanabe

    Abstract: Pre-training speech models on large volumes of data has achieved remarkable success. OpenAI Whisper is a multilingual multitask model trained on 680k hours of supervised speech data. It generalizes well to various speech recognition and translation benchmarks even in a zero-shot setup. However, the full pipeline for developing such models (from data collection to training) is not publicly accessib… ▽ More

    Submitted 24 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ASRU 2023

  24. arXiv:2309.08531  [pdf, other

    cs.CV cs.CL eess.AS eess.IV

    Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens

    Authors: Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro

    Abstract: In this paper, we propose methods to build a powerful and efficient Image-to-Speech captioning (Im2Sp) model. To this end, we start with importing the rich knowledge related to image comprehension and language modeling from a large-scale pre-trained vision-language model into Im2Sp. We set the output of the proposed Im2Sp as discretized speech units, i.e., the quantized speech features of a self-s… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  25. arXiv:2309.07937  [pdf, other

    eess.AS cs.LG cs.SD

    Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

    Authors: Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe

    Abstract: We propose a decoder-only language model, VoxtLM, that can perform four tasks: speech recognition, speech synthesis, text generation, and speech continuation. VoxtLM integrates text vocabulary with discrete speech tokens from self-supervised speech features and uses special tokens to enable multitask learning. Compared to a single-task model, VoxtLM exhibits a significant improvement in speech syn… ▽ More

    Submitted 24 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  26. arXiv:2309.07156  [pdf, other

    eess.SP cs.LG

    Transparency in Sleep Staging: Deep Learning Method for EEG Sleep Stage Classification with Model Interpretability

    Authors: Shivam Sharma, Suvadeep Maiti, S. Mythirayee, Srijithesh Rajendran, Raju Surampudi Bapi

    Abstract: Automated Sleep stage classification using raw single channel EEG is a critical tool for sleep quality assessment and disorder diagnosis. However, modelling the complexity and variability inherent in this signal is a challenging task, limiting their practicality and effectiveness in clinical settings. To mitigate these challenges, this study presents an end-to-end deep learning (DL) model which in… ▽ More

    Submitted 14 January, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: 12 pages, 9 figures, Under review at IEEE Journal of Biomedical and Health Informatics

  27. arXiv:2308.02784  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Semi-supervised Contrastive Regression for Estimation of Eye Gaze

    Authors: Somsukla Maiti, Akshansh Gupta

    Abstract: With the escalated demand of human-machine interfaces for intelligent systems, development of gaze controlled system have become a necessity. Gaze, being the non-intrusive form of human interaction, is one of the best suited approach. Appearance based deep learning models are the most widely used for gaze estimation. But the performance of these models is entirely influenced by the size of labeled… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted for International Conference on Pattern Recognition and Machine Intelligence 2023 (PReMI 2023)

    Report number: Paper 057, https://www.isical.ac.in/~premi23/List_of_Accepted_Papers.pdf

  28. arXiv:2307.06117  [pdf, other

    hep-lat cond-mat.str-el hep-th quant-ph

    A qubit regularization of asymptotic freedom at the BKT transition without fine-tuning

    Authors: Sandip Maiti, Debasish Banerjee, Shailesh Chandrasekharan, Marina K. Marinkovic

    Abstract: We propose a two-dimensional hard core loop-gas model as a way to regularize the asymptotically free massive continuum quantum field theory that emerges at the BKT transition. Without fine-tuning, our model can reproduce the universal step-scaling function of the classical lattice XY model in the massive phase as we approach the phase transition. This is achieved by lowering the fugacity of Fock-v… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 21 pages(6+15), 12 figures

  29. arXiv:2307.03148  [pdf, other

    cs.CY math.NA

    On the Computation of Accessibility Provided by Shared Mobility

    Authors: Severin Diepolder, Andrea Araldo, Tarek Chouaki, Santa Maiti, Sebastian Hörl, Constantinos Antoniou

    Abstract: Shared Mobility Services (SMS), e.g., Demand-Responsive Transit (DRT) or ride-sharing, can improve mobility in low-density areas, often poorly served by conventional Public Transport (PT). Such improvement is mostly quantified via basic performance indicators, like wait or travel time. However, accessibility indicators, measuring the ease of reaching surrounding opportunities (e.g., jobs, schools,… ▽ More

    Submitted 12 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    ACM Class: J.2

    Journal ref: hEART 2023: 11th Symposium of the European Association for Research in Transportation

  30. arXiv:2306.14452  [pdf, ps, other

    cond-mat.mes-hall cond-mat.dis-nn cond-mat.str-el physics.comp-ph quant-ph

    Phenomenon of multiple reentrant localization in a double-stranded helix with transverse electric field

    Authors: Sudin Ganguly, Suparna Sarkar, Kallol Mondal, Santanu K. Maiti

    Abstract: The present work explores the potential for observing multiple reentrant localization behavior in a double-stranded helical (DSH) system, extending beyond the conventional nearest-neighbor hopping interaction. The DSH system is considered to have hopping dimerization in each strand, while also being subjected to a transverse electric field. The inclusion of an electric field serves the dual purpos… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 7 pages, 6 figures, comments are Welcome

  31. arXiv:2306.11240  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Spin-orbit interaction enabled electronic Raman scattering from charge collective modes

    Authors: Surajit Sarkar, Alexander Lee, Girsh Blumberg, Saurabh Maiti

    Abstract: Electronic Raman scattering in the fully symmetric channel couples to the charge excitations in the system, including the plasmons. However, the plasmon response has a spectral weight of $\sim q^2$, where $q$, the momentum transferred by light, is small. In this work, we show that in inversion symmetry broken systems where Rashba type spin-orbit coupling affects the states at the Fermi energy (whi… ▽ More

    Submitted 13 February, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 22p with 9 figures, replaced with journal version

    Journal ref: Phys Rev B, 109, 035160 (2024)

  32. arXiv:2306.09057  [pdf, other

    cs.CR

    A Learning Assisted Method for Uncovering Power Grid Generation and Distribution System Vulnerabilities

    Authors: Suman Maiti, Anjana B, Sunandan Adhikary, Ipsita Koley, Soumyajit Dey

    Abstract: Intelligent attackers can suitably tamper sensor/actuator data at various Smart grid surfaces causing intentional power oscillations, which if left undetected, can lead to voltage disruptions. We develop a novel combination of formal methods and machine learning tools that learns power system dynamics with the objective of generating unsafe yet stealthy false data based attack sequences. We enable… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  33. arXiv:2306.06672  [pdf, other

    cs.CL cs.AI eess.AS

    Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

    Authors: William Chen, Xuankai Chang, Yifan Peng, Zhaoheng Ni, Soumi Maiti, Shinji Watanabe

    Abstract: Self-supervised learning (SSL) has led to great strides in speech processing. However, the resources needed to train these models has become prohibitively large as they continue to scale. Currently, only a few groups with substantial resources are capable of creating SSL models, which harms reproducibility. In this work, we optimize HuBERT SSL to fit in academic constraints. We reproduce HuBERT in… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  34. arXiv:2305.15303  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn cond-mat.mes-hall

    Transport phenomena of TiCoSb: Defects induced modification in structure and density of states

    Authors: S. Mahakal, Diptasikha Das, Pintu Singha, Aritra Banerjee, S. Chatterjee, Santanu K. Maiti, S. Assa Aravindh, K. Malik

    Abstract: TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacanc… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures (comments are welcome)

  35. arXiv:2305.05582  [pdf, ps, other

    cond-mat.mes-hall cond-mat.dis-nn physics.comp-ph quant-ph

    Thermoelectric phenomena in an antiferromagnetic helix: Role of electric field

    Authors: Kallol Mondal, Sudin Ganguly, Santanu K. Maiti

    Abstract: The charge and spin-dependent thermoelectric responses are investigated on a single-helical molecule possessing a collinear antiferromagnetic spin arrangement with zero net magnetization in the presence of a transverse electric field. Both the short and long-range hopping scenarios are considered, which mimic biological systems like single-stranded DNA and $αあるふぁ$-protein molecules. A non-equilibrium… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 11 pages, 8 figures, comments are Welcome

    Journal ref: Phys. Rev. B 108, 195401 (2023)

  36. arXiv:2304.12826  [pdf, ps, other

    physics.flu-dyn math-ph math.AP physics.med-ph

    Electroosmotic flow of a rheological fluid in non-uniform micro-vessels

    Authors: S. Maiti, S. K. Pandey, J. C. Misra

    Abstract: The paper deals with a theoretical study of electrokinetic flow of a rheological Herschel-Bulkley fluid through a cylindrical tube of variable cross-section. The concern of this study is to analyze combined pressure-driven and electroosmotic flow of Herschel-Bulkley fluid. The wall potential is considered to vary slowly and periodically along the axis of the tube. With reference to flow in the mic… ▽ More

    Submitted 15 February, 2023; originally announced April 2023.

    Comments: 44 pages and 28 figures

    Journal ref: Journal of Engineering Mathematics 135, 8 (2022)

  37. arXiv:2304.08081  [pdf, ps, other

    cond-mat.mes-hall cond-mat.dis-nn cond-mat.mtrl-sci cond-mat.soft

    Thermal signature of helical molecule: Beyond nearest-neighbor electron hopping

    Authors: Suparna Sarkar, Santanu K. Maiti, David Laroze

    Abstract: We investigate, for the first time, the thermal signature of a single-stranded helical molecule, subjected to a transverse electric field, by analyzing electronic specific heat (ESH). Depending on the hopping of electrons, two different kinds of helical systems are considered. In one case the hopping is confined within a few neighboring lattice sites which is referred to as short-range hopping (SR… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 8 pages, 10 figures

  38. arXiv:2304.04596  [pdf, other

    cs.SD cs.CL eess.AS

    ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

    Authors: Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe

    Abstract: ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) -- each task is supported with a wide variety of approaches, differentiating ESPnet-… ▽ More

    Submitted 6 July, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: ACL 2023; System Demonstration

  39. arXiv:2303.15983  [pdf, ps, other

    cond-mat.dis-nn cond-mat.mes-hall physics.comp-ph quant-ph

    Electrical analogue of one-dimensional and quasi-one-dimensional Aubry-André-Harper lattices

    Authors: Sudin Ganguly, Santanu K. Maiti

    Abstract: The present work discusses the possibility to realize correlated disorder in electrical circuits and studies the localization phenomena in terms of two-port impedance. The correlated disorder is incorporated using the Aubry-André-Harper (AAH) model. One-dimensional and quasi-one-dimensional AAH structures are explored and directly mapped with their tight-binding analogues. Transitions from the hig… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 6 pages, 7 figures, Comments are Welcome

    Journal ref: Scientific Reports 2023

  40. arXiv:2303.12728  [pdf, other

    cs.CV cs.LG

    LocalEyenet: Deep Attention framework for Localization of Eyes

    Authors: Somsukla Maiti, Akshansh Gupta

    Abstract: Development of human machine interface has become a necessity for modern day machines to catalyze more autonomy and more efficiency. Gaze driven human intervention is an effective and convenient option for creating an interface to alleviate human errors. Facial landmark detection is very crucial for designing a robust gaze detection system. Regression based methods capacitate good spatial localiza… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  41. arXiv:2302.12829  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Massively Multilingual ASR With Auxiliary CTC Objectives

    Authors: William Chen, Brian Yan, Jiatong Shi, Yifan Peng, Soumi Maiti, Shinji Watanabe

    Abstract: Multilingual Automatic Speech Recognition (ASR) models have extended the usability of speech technologies to a wide variety of languages. With how many languages these models have to handle, however, a key to understanding their imbalanced performance across different languages is to examine if the model actually knows which language it should transcribe. In this paper, we introduce our work on im… ▽ More

    Submitted 27 February, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 5 pages, 1 figure, accepted at ICASSP 2023; fixed typo and URL in abstract

  42. arXiv:2302.10187  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Effect of pH on structure and surface charge of Fe$_2$O$_3$ nanoparticles synthesized at different pH conditions and correlation to antibacterial properties

    Authors: Farzana Naushin, Srishti Sen, Mukul Kumar, Hemang Bairagi, Siddhartha Maiti, Jaydeep Bhattacharya, Somaditya Sen

    Abstract: pH of a solution is the ratio of H+/OH- ions. The relative ratio of these charges may affect forming bonds during a hydrothermal synthesis by influencing electronic clouds of participant ions, which can modify the structure and hence crystallinity, strain, disorder, surface termination etc. These factors may modify physical properties including the surface charge. This work uses hematite nanoparti… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 35 pages, 11 figures, to be submitted to PRL

  43. arXiv:2301.12596  [pdf, other

    eess.AS cs.CL

    Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining

    Authors: Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi, Hiroshi Saruwatari

    Abstract: While neural text-to-speech (TTS) has achieved human-like natural synthetic speech, multilingual TTS systems are limited to resource-rich languages due to the need for paired text and studio-quality audio data. This paper proposes a method for zero-shot multilingual TTS using text-only data for the target language. The use of text-only data allows the development of TTS systems for low-resource la… ▽ More

    Submitted 27 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: To appear in IJCAI 2023

  44. arXiv:2301.09099  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study

    Authors: Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe, Wassim El-Hajj, Ahmed Ali

    Abstract: Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech. In contrast, low-resource languages, including Arabic, have very limited TTS systems due to the lack of resources. We propose a fully unsupervised method for building TTS, including automatic data selection and pre-training/fine-tuning strategies for TTS training, using broadcast news… ▽ More

    Submitted 26 January, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

  45. arXiv:2212.04559  [pdf, other

    eess.AS cs.LG cs.SD

    SpeechLMScore: Evaluating speech generation using speech language model

    Authors: Soumi Maiti, Yifan Peng, Takaaki Saeki, Shinji Watanabe

    Abstract: While human evaluation is the most reliable metric for evaluating speech generation systems, it is generally costly and time-consuming. Previous studies on automatic speech quality assessment address the problem by predicting human evaluation scores with machine learning models. However, they rely on supervised learning and thus suffer from high annotation costs and domain-shift problems. We propo… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  46. arXiv:2212.03210  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci physics.optics

    Isolated flat bands in 2D lattices based on a novel path-exchange symmetry

    Authors: Jun-Hyung Bae, Tigran Sedrakyan, Saurabh Maiti

    Abstract: The increased ability to engineer two-dimensional (2D) systems, either using materials, photonic lattices, or cold atoms, has led to the search for 2D structures with interesting properties. One such property is the presence of flat bands. Typically, the presence of these requires long-ranged hoppings, fine-tuning of nearest neighbor hoppings, or breaking time-reversal symmetry by using a staggere… ▽ More

    Submitted 27 January, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 37 pages, 20 figures, References are updated and an additional section on flux-attachment to lattice is also included

    Journal ref: SciPost Phys. 15, 139 (2023)

  47. arXiv:2209.03957  [pdf, other

    physics.chem-ph stat.AP stat.CO stat.ME

    An Unified Statistical Procedure to Analyse Irreversible Thermal Curves

    Authors: Jhimli Bhattacharyya, Gopinatha Suresh Kumar, Souvik Maiti, Daisuke Miyoshi, Sanjay Chaudhuri

    Abstract: The phenomenon of hysteresis is commonly observed in many UV thermal experiments involving unmodified or modified nucleic acids. In presence of hysteresis, the thermal curves are irreversible and demand a significant effort to produce the reaction-specific kinetic and thermodynamic parameters. In this article, we describe a unified statistical procedure to analyze such thermal curves. Our method a… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  48. arXiv:2208.05123  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Collective spin modes in Fermi liquids with spin-orbit coupling

    Authors: Dmitrii L. Maslov, Abhishek Kumar, Saurabh Maiti

    Abstract: A combination of spin-orbit coupling and electron-electron interaction gives rise to a new type of collective spin modes, which correspond to oscillations of magnetization even in the absence of the external magnetic field. We review recent progress in theoretical understanding and experimental observation of such modes, focusing on three examples of real-life systems: a two-dimensional electron g… ▽ More

    Submitted 31 October, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: This short review is dedicated to the 95th birthday of Prof. Emmanuel I. Rashba. 26 pages and 17 figures

    Journal ref: JETP 135, 549-574 (2022)

  49. arXiv:2207.03308  [pdf, ps, other

    cond-mat.mes-hall physics.comp-ph quant-ph

    Transport characteristics of a $\mathcal{PT}$-symmetric non-Hermitian system: Effect of environmental interaction

    Authors: Sudin Ganguly, Souvik Roy, Santanu K. Maiti

    Abstract: The environmental influence is inevitable but often ignored in the study of electronic transport properties of small-scale systems. Such an environment-mediated interaction can generally be described by a parity-time symmetric non-Hermitian system with a balanced distribution of physical gain and loss. It is quite known in the literature that along with the conventional junction current, another c… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: 12 pages, 9 figures

    Journal ref: The European Physical Journal Plus 137, 780 (2022)

  50. arXiv:2206.06156  [pdf

    math.OC cs.CE

    Advanced Quantitative Techniques to Solve Center of Gravity Problem in Supply Chain

    Authors: Brian Houck, Chetan Sampat, Srijit Maiti, Shivam S, Anurag Vaishistha, Sumit Banerjee

    Abstract: Activities involving transformation of raw materials, various resources and components into final products and also delivering it to the end customer incur a significant cost during the selection of location of a warehouse that can be easily accessed by various actors of the supply chain. To minimize upstream and downstream transportation costs, the center of gravity (CoG) analysis method is used… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 7 pages, 3 figures, 2 tables