(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–26 of 26 results for author: Choubey, P

.
  1. arXiv:2406.09878  [pdf, other

    cond-mat.supr-con

    Theory of Josephson scanning microscopy with $s$-wave tip on unconventional superconducting surface: application to Bi$_2$Sr$_2$CaCu$_2$O$_{8+δでるた}$

    Authors: Peayush Choubey, P. J. Hirschfeld

    Abstract: Josephson scanning tunneling microscopy (JSTM) is a powerful probe of the local superconducting order parameter, but studies have been largely limited to cases where superconducting sample and superconducting tip both have the same gap symmetry -- either s-wave or d-wave. It has been generally assumed that in an ideal $s$-to-$d$ JSTM experiment the critical current would vanish everywhere, as expe… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures

  2. arXiv:2402.15538  [pdf, other

    cs.MA cs.AI

    AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

    Authors: Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese

    Abstract: The booming success of LLMs initiates rapid development in LLM agents. Though the foundation of an LLM agent is the generative model, it is critical to devise the optimal reasoning strategies and agent architectures. Accordingly, LLM agent research advances from the simple chain-of-thought prompting to more complex ReAct and Reflection reasoning strategy; agent architecture also evolves from singl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: preprint. Library is available at https://github.com/SalesforceAIResearch/AgentLite

  3. arXiv:2311.09458  [pdf, other

    cs.CL

    Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries

    Authors: Prafulla Kumar Choubey, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu

    Abstract: Ideal summarization models should generalize to novel summary-worthy content without remembering reference training summaries by rote. However, a single average performance score on the entire test set is inadequate in determining such model competencies. We propose a fine-grained evaluation protocol by partitioning a test set based on the lexical similarity of reference test summaries with traini… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023-Findings

  4. arXiv:2309.09369  [pdf, other

    cs.CL

    Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

    Authors: Kung-Hsiang Huang, Philippe Laban, Alexander R. Fabbri, Prafulla Kumar Choubey, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu

    Abstract: Previous research in multi-document news summarization has typically concentrated on collating information that all sources agree upon. However, the summarization of diverse information dispersed across multiple articles about an event remains underexplored. In this paper, we propose a new task of summarizing diverse information encountered in multiple news articles encompassing the same event. To… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: NAACL 2024

  5. arXiv:2309.03450  [pdf, other

    cs.CL cs.AI cs.LG

    XGen-7B Technical Report

    Authors: Erik Nijkamp, Tian Xie, Hiroaki Hayashi, Bo Pang, Congying Xia, Chen Xing, Jesse Vig, Semih Yavuz, Philippe Laban, Ben Krause, Senthil Purushwalkam, Tong Niu, Wojciech Kryściński, Lidiya Murakhovs'ka, Prafulla Kumar Choubey, Alex Fabbri, Ye Liu, Rui Meng, Lifu Tu, Meghana Bhat, Chien-Sheng Wu, Silvio Savarese, Yingbo Zhou, Shafiq Joty, Caiming Xiong

    Abstract: Large Language Models (LLMs) have become ubiquitous across various domains, transforming the way we interact with information and conduct research. However, most high-performing LLMs remain confined behind proprietary walls, hindering scientific progress. Most open-source LLMs, on the other hand, are limited in their ability to support longer sequence lengths, which is a key requirement for many t… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  6. arXiv:2211.06196  [pdf, other

    cs.CL

    Improving Factual Consistency in Summarization with Compression-Based Post-Editing

    Authors: Alexander R. Fabbri, Prafulla Kumar Choubey, Jesse Vig, Chien-Sheng Wu, Caiming Xiong

    Abstract: State-of-the-art summarization models still struggle to be factually consistent with the input text. A model-agnostic way to address this problem is post-editing the generated summaries. However, existing approaches typically fail to remove entity errors if a suitable input entity replacement is not available or may insert erroneous content. In our work, we focus on removing extrinsic entity error… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  7. arXiv:2210.12619  [pdf, other

    cs.CL

    Conformal Predictor for Improving Zero-shot Text Classification Efficiency

    Authors: Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen Rajani

    Abstract: Pre-trained language models (PLMs) have been shown effective for zero-shot (0shot) text classification. 0shot models based on natural language inference (NLI) and next sentence prediction (NSP) employ cross-encoder architecture and infer by making a forward pass through the model for each label-text pair separately. This increases the computational cost to make inferences linearly in the number of… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  8. arXiv:2210.12587  [pdf, other

    cs.CL

    Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning

    Authors: Xiangyu Peng, Chen Xing, Prafulla Kumar Choubey, Chien-Sheng Wu, Caiming Xiong

    Abstract: Prompt tuning approaches, which learn task-specific soft prompts for a downstream task conditioning on frozen pre-trained models, have attracted growing interest due to its parameter efficiency. With large language models and sufficient training data, prompt tuning performs comparably to full-model tuning. However, with limited training samples in few-shot settings, prompt tuning fails to match th… ▽ More

    Submitted 1 March, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

  9. arXiv:2210.11787  [pdf, other

    cs.CL

    Modeling Document-level Temporal Structures for Building Temporal Dependency Graphs

    Authors: Prafulla Kumar Choubey, Ruihong Huang

    Abstract: We propose to leverage news discourse profiling to model document-level temporal structures for building temporal dependency graphs. Our key observation is that the functional roles of sentences used for profiling news discourse signify different time frames relevant to a news story and can, therefore, help to recover the global temporal structure of a document. Our analyses and experiments with t… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: AACL 2022

  10. arXiv:2110.07280  [pdf, other

    cs.CL

    P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts

    Authors: Benjamin Newman, Prafulla Kumar Choubey, Nazneen Rajani

    Abstract: Recent work (e.g. LAMA (Petroni et al., 2019)) has found that the quality of the factual information extracted from Large Language Models (LLMs) depends on the prompts used to query them. This inconsistency is problematic because different users will query LLMs for the same information using different wording, but should receive the same, accurate responses regardless. In this work we aim to addre… ▽ More

    Submitted 19 April, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 15 pages, 6 figures, 4 tables

  11. arXiv:2110.07166  [pdf, other

    cs.CL

    CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization

    Authors: Prafulla Kumar Choubey, Alexander R. Fabbri, Jesse Vig, Chien-Sheng Wu, Wenhao Liu, Nazneen Fatema Rajani

    Abstract: Hallucination is a known issue for neural abstractive summarization models. Recent work suggests that the degree of hallucination may depend on errors in the training data. In this work, we propose a new method called Contrastive Parameter Ensembling (CaPE) to use training data more effectively, utilizing variations in noise in training samples to reduce hallucination. We first select clean and no… ▽ More

    Submitted 20 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  12. Electronic Theory for Scanning Tunneling Microscopy Spectra in Infinite-Layer Nickelate Superconductors

    Authors: Peayush Choubey, Ilya M. Eremin

    Abstract: Recent scanning tunneling microscopy (STM) observation of U-shaped and V-shaped spectra (and their mixture) in superconducting Nd$_{1-x}$Sr$_x$NiO$_2$ thin films has been interpreted as presence of two distinct gap symmetries in this nickelate superconductor [Gu et al., Nat. Comm. 11, 6027 (2020)]. Here, using a two-band model of nickelates capturing dominant contributions from Ni-$3d_{x^2-y^2}$ a… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 10 pages, 8 Figures

    Journal ref: Phys. Rev. B 104, 144504 (2021)

  13. arXiv:2107.09967  [pdf

    cond-mat.supr-con

    Direct Visualization of a Static Incommensurate Antiferromagnetic Order by Suppressing the Superconducting Phase Coherence in Fe-doped Bi2Sr2CaCu2O8+delta

    Authors: Siyuan Wan, Huazhou Li, Peayush Choubey, Qiangqiang Gu, Han Li, Huan Yang, Ilya M. Eremin, G. D. Gu, Hai-Hu Wen

    Abstract: In cuprate superconductors, due to strong electronic correlations, there are multiple intertwined orders which either coexist or compete with superconductivity. Among them the antiferromagnetic (AF) order is the most prominent one. In the region where superconductivity sets in, the long-range AF order is destroyed. Yet the residual short-range AF fluctuations are present up to a much higher doping… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Main text with 4 figures and 5 extended figures. Supplementary with 6 figures. Total 34 pages

    Journal ref: PNAS 118, 51 e2115317118 (2021)

  14. arXiv:2105.06518  [pdf

    cond-mat.supr-con

    Scattering Interference Signature of a Pair Density Wave State in the Cuprate Pseudogap Phase

    Authors: Shuqiu Wang, Peayush Choubey, Yi Xue Chong, Weijiong Chen, Wangping Ren, H. Eisaki, Shin-ichi Uchida, Peter J. Hirschfeld, J. C. Séamus Davis

    Abstract: An unidentified quantum fluid designated as the pseudogap (PG) phase is produced by electron-density depletion in the CuO$_2$ antiferromagnetic insulator. Current theories suggest that the PG phase may be a pair density wave (PDW) state characterized by a spatially modulating density of electron pairs. Such a state should exhibit a periodically modulating energy gap $Δでるた_P(\pmb r)$ in real-space, an… ▽ More

    Submitted 26 October, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: 30 pages, 5 figures and Supplementary Information

    Journal ref: Nature Communications, 12, 6087 (2021)

  15. arXiv:2104.07695  [pdf, other

    cs.CL

    Improving Gender Translation Accuracy with Filtered Self-Training

    Authors: Prafulla Kumar Choubey, Anna Currey, Prashant Mathur, Georgiana Dinu

    Abstract: Targeted evaluations have found that machine translation systems often output incorrect gender, even when the gender is clear from context. Furthermore, these incorrectly gendered translations have the potential to reflect or amplify social biases. We propose a gender-filtered self-training technique to improve gender translation accuracy on unambiguously gendered inputs. This approach uses a sour… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  16. arXiv:2104.07367  [pdf, other

    cs.CL cs.SI

    BERT based Transformers lead the way in Extraction of Health Information from Social Media

    Authors: Sidharth R, Abhiraj Tiwari, Parthivi Choubey, Saisha Kashyap, Sahil Khose, Kumud Lakara, Nishesh Singh, Ujjwal Verma

    Abstract: This paper describes our submissions for the Social Media Mining for Health (SMM4H)2021 shared tasks. We participated in 2 tasks:(1) Classification, extraction and normalization of adverse drug effect (ADE) mentions in English tweets (Task-1) and (2) Classification of COVID-19 tweets containing symptoms(Task-6). Our approach for the first task uses the language representation model RoBERTa with a… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: 6 pages, 1 figure

  17. arXiv:2002.11654  [pdf

    cond-mat.supr-con

    Atomic-scale Electronic Structure of the Cuprate Pair Density Wave State Coexisting with Superconductivity

    Authors: Peayush Choubey, Sang Hyun Joo, K. Fujita, Zengyi Du, S. D. Edkins, M. H. Hamidian, H. Eisaki, S. Uchida, A. P. Mackenzie, Jinho Lee, J. C. Séamus Davis, P. J. Hirschfeld

    Abstract: The defining characteristic of hole-doped cuprates is $d$-wave high temperature superconductivity. However, intense theoretical interest is now focused on whether a pair density wave state (PDW) could coexist with cuprate superconductivity (D. F. Agterberg et al., Annual Review of Condensed Matter Physics 11, 231 (2020)). Here, we use a strong-coupling mean-field theory of cuprates, to model the a… ▽ More

    Submitted 27 April, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  18. arXiv:1909.02670  [pdf, other

    cs.CL

    In Plain Sight: Media Bias Through the Lens of Factual Reporting

    Authors: Lisa Fan, Marshall White, Eva Sharma, Ruisi Su, Prafulla Kumar Choubey, Ruihong Huang, Lu Wang

    Abstract: The increasing prevalence of political bias in news media calls for greater public awareness of it, as well as robust methods for its detection. While prior work in NLP has primarily focused on the lexical bias captured by linguistic attributes such as word choice and syntax, other types of bias stem from the actual content selected for inclusion in the text. In this work, we investigate the effec… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: To appear as a short paper in EMNLP 2019

  19. arXiv:1904.02800  [pdf, other

    cs.CL

    Improving Dialogue State Tracking by Discerning the Relevant Context

    Authors: Sanuj Sharma, Prafulla Kumar Choubey, Ruihong Huang

    Abstract: A typical conversation comprises of multiple turns between participants where they go back-and-forth between different topics. At each user turn, dialogue state tracking (DST) aims to estimate user's goal by processing the current utterance. However, in many turns, users implicitly refer to the previous goal, necessitating the use of relevant dialogue history. Nonetheless, distinguishing relevant… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: NAACL 2019

  20. arXiv:1711.02162  [pdf, other

    cs.CL

    TAMU at KBP 2017: Event Nugget Detection and Coreference Resolution

    Authors: Prafulla Kumar Choubey, Ruihong Huang

    Abstract: In this paper, we describe TAMU's system submitted to the TAC KBP 2017 event nugget detection and coreference resolution task. Our system builds on the statistical and empirical observations made on training and development data. We found that modifiers of event nuggets tend to have unique syntactic distribution. Their parts-of-speech tags and dependency relations provides them essential character… ▽ More

    Submitted 25 February, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: TAC KBP 2017

  21. arXiv:1707.07344  [pdf, other

    cs.CL

    Event Coreference Resolution by Iteratively Unfolding Inter-dependencies among Events

    Authors: Prafulla Kumar Choubey, Ruihong Huang

    Abstract: We introduce a novel iterative approach for event coreference resolution that gradually builds event clusters by exploiting inter-dependencies among event mentions within the same chain as well as across event chains. Among event mentions in the same chain, we distinguish within- and cross-document event coreference links by using two distinct pairwise classifiers, trained separately to capture di… ▽ More

    Submitted 23 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  22. arXiv:1707.07343  [pdf, other

    cs.CL

    A Sequential Model for Classifying Temporal Relations between Intra-Sentence Events

    Authors: Prafulla Kumar Choubey, Ruihong Huang

    Abstract: We present a sequential model for temporal relation classification between intra-sentence events. The key observation is that the overall syntactic structure and compositional meanings of the multi-word context between events are important for distinguishing among fine-grained temporal relations. Specifically, our approach first extracts a sequence of context words that indicates the temporal rela… ▽ More

    Submitted 23 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  23. Universality of scanning tunneling microscopy in cuprate superconductors

    Authors: Peayush Choubey, Andreas Kreisel, T. Berlijn, Brian M. Andersen, P. J. Hirschfeld

    Abstract: We consider the problem of local tunneling into cuprate superconductors, combining model based calculations for the superconducting order parameter with wavefunction information obtained from first principles electronic structure. For some time it has been proposed that scanning tunneling microscopy (STM) spectra do not reflect the properties of the superconducting layer in the CuO$_2$ plane direc… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Report number: CMT NBI 2017

    Journal ref: Phys. Rev. B 96, 174523 (2017)

  24. Incommensurate charge ordered states in the $\mathit{t-t^{\prime}-J}$ model

    Authors: Peayush Choubey, Wei-Lin Tu, Ting-Kuo Lee, P. J. Hirschfeld

    Abstract: We study the incommensurate charge ordered states in the $\mathit{t-t^{\prime}-J}$ model using the Gutzwiller mean field theory on large systems. In particular, we explore the properties of incommensurate charge modulated states referred to as nodal pair density waves (nPDW) in the literature. nPDW states intertwine site and bond charge order with modulated $d$-wave pair order, and are characteriz… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Journal ref: New J. Phys. 19, 013028 (2017)

  25. arXiv:1407.1846  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Interpretation of scanning tunneling quasiparticle interference and impurity states in cuprates

    Authors: A. Kreisel, Peayush Choubey, T. Berlijn, B. M. Andersen, P. J. Hirschfeld

    Abstract: We apply a recently developed method combining first principles based Wannier functions with solutions to the Bogoliubov-de Gennes equations to the problem of interpreting STM data in cuprate superconductors. We show that the observed images of Zn on the surface of Bi$_2$Sr$_2$CaCu$_2$O$_8$ can only be understood by accounting for the tails of the Cu Wannier functions, which include significant we… ▽ More

    Submitted 28 May, 2015; v1 submitted 7 July, 2014; originally announced July 2014.

    Comments: 5 pages, 5 figures, published version (Supplemental Material: 5 pages, 11 figures) for associated video file, see http://itp.uni-frankfurt.de/~kreisel/QPI_BSCCO_BdG_p_W.mp4

    Report number: NBI CMT 2014

    Journal ref: Phys. Rev. Lett. 114, 217002 (2015)

  26. Visualization of atomic-scale phenomena in superconductors: application to FeSe

    Authors: Peayush Choubey, T. Berlijn, A. Kreisel, C. Cao, P. J. Hirschfeld

    Abstract: We propose a simple method of calculating inhomogeneous, atomic-scale phenomena in superconductors which makes use of the wave function information traditionally discarded in the construction of tight-binding models used in the Bogoliubov-de Gennes equations. The method uses symmetry based first principles Wannier functions to visualize the effects of superconducting pairing on the distribution of… ▽ More

    Submitted 19 November, 2014; v1 submitted 29 January, 2014; originally announced January 2014.

    Journal ref: Phys. Rev. B 90, 134520 (2014)