(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–23 of 23 results for author: Nangia, N

.
  1. arXiv:2301.07473  [pdf, other

    cs.LG stat.ML

    Discrete Latent Structure in Neural Networks

    Authors: Vlad Niculae, Caio F. Corro, Nikita Nangia, Tsvetomila Mihaylova, André F. T. Martins

    Abstract: Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a powerful tool for learning to extract such representations, offering a way to incorporate structural bias, discover insight about the data, and interpret decisions.… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    ACM Class: I.2.6

  2. arXiv:2210.10860  [pdf, other

    cs.CL

    Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions

    Authors: Alicia Parrish, Harsh Trivedi, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Amanpreet Singh Saimbhi, Samuel R. Bowman

    Abstract: The use of language-model-based question-answering systems to aid humans in completing difficult tasks is limited, in part, by the unreliability of the text these systems generate. Using hard multiple-choice reading comprehension questions as a testbed, we assess whether presenting humans with arguments for two competing answer options, where one is correct and the other is incorrect, allows human… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures, 7 tables

  3. arXiv:2208.12852  [pdf, other

    cs.CL cs.AI

    What Do NLP Researchers Believe? Results of the NLP Community Metasurvey

    Authors: Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman

    Abstract: We present the results of the NLP Community Metasurvey. Run from May to June 2022, the survey elicited opinions on controversial issues, including industry influence in the field, concerns about AGI, and ethics. Our results put concrete numbers to several controversies: For example, respondents are split almost exactly in half on questions about the importance of artificial general intelligence, w… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 31 pages, 19 figures, 3 tables; more information at https://nlpsurvey.net

    ACM Class: I.2.7

  4. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  5. arXiv:2204.05212  [pdf, other

    cs.CL

    Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions

    Authors: Alicia Parrish, Harsh Trivedi, Ethan Perez, Angelica Chen, Nikita Nangia, Jason Phang, Samuel R. Bowman

    Abstract: Current QA systems can generate reasonable-sounding yet false answers without explanation or evidence for the generated answer, which is especially problematic when humans cannot readily check the model's answers. This presents a challenge for building trust in machine learning systems. We take inspiration from real-world situations where difficult questions are answered by considering opposing si… ▽ More

    Submitted 13 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to the 2022 ACL Workshop on Learning with Natural Language Supervision. 12 pages total, 9 figures, 2 tables

  6. arXiv:2203.06342  [pdf, other

    cs.CL cs.AI

    What Makes Reading Comprehension Questions Difficult?

    Authors: Saku Sugawara, Nikita Nangia, Alex Warstadt, Samuel R. Bowman

    Abstract: For a natural language understanding benchmark to be useful in research, it has to consist of examples that are diverse and difficult enough to discriminate among current and near-future state-of-the-art systems. However, we do not yet know how best to select text sources to collect a variety of challenging examples. In this study, we crowdsource multiple-choice reading comprehension questions for… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  7. arXiv:2112.08608  [pdf, other

    cs.CL

    QuALITY: Question Answering with Long Input Texts, Yes!

    Authors: Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny Ma, Jana Thompson, He He, Samuel R. Bowman

    Abstract: To enable building and testing models on long-document comprehension, we introduce QuALITY, a multiple-choice QA dataset with context passages in English that have an average length of about 5,000 tokens, much longer than typical current models can process. Unlike in prior work with passages, our questions are written and validated by contributors who have read the entire passage, rather than rely… ▽ More

    Submitted 11 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  8. arXiv:2110.08193  [pdf, other

    cs.CL

    BBQ: A Hand-Built Bias Benchmark for Question Answering

    Authors: Alicia Parrish, Angelica Chen, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Jana Thompson, Phu Mon Htut, Samuel R. Bowman

    Abstract: It is well documented that NLP models learn social biases, but little work has been done on how these biases manifest in model outputs for applied tasks like question answering (QA). We introduce the Bias Benchmark for QA (BBQ), a dataset of question sets constructed by the authors that highlight attested social biases against people belonging to protected classes along nine social dimensions rele… ▽ More

    Submitted 15 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted to ACL 2022 Findings. 20 pages, 10 figures

  9. arXiv:2106.00794  [pdf, other

    cs.CL cs.AI cs.HC

    What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?

    Authors: Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania, Samuel R. Bowman

    Abstract: Crowdsourcing is widely used to create data for common natural language understanding tasks. Despite the importance of these datasets for measuring and refining model understanding of language, there has been little focus on the crowdsourcing methods used for collecting the datasets. In this paper, we compare the efficacy of interventions that have been proposed in prior work as ways of improving… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  10. arXiv:2104.07179  [pdf, other

    cs.CL

    Does Putting a Linguist in the Loop Improve NLU Data Collection?

    Authors: Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen, Samuel R. Bowman

    Abstract: Many crowdsourced NLP datasets contain systematic gaps and biases that are identified only after data collection is complete. Identifying these issues from early data samples during crowdsourcing should make mitigation more efficient, especially when done iteratively. We take natural language inference as a test case and ask whether it is beneficial to put a linguist `in the loop' during data coll… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 14 pages, 10 figures

  11. Critique on "Volume penalization for inhomogeneous Neumann boundary conditions modeling scalar flux in complicated geometry"

    Authors: Ramakrishnan Thirumalaisamy, Nishant Nangia, Amneet Pal Singh Bhalla

    Abstract: In this letter, we provide counter-examples to demonstrate that it is possible to retain second-order accuracy using Sakurai et al.'s method, even when different flux boundary conditions are imposed on multiple interfaces that do not conform to the Cartesian grid. We consider both continuous and discontinuous indicator functions in our test problems. Both indicator functions yield a similar conver… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  12. arXiv:2010.00133  [pdf, other

    cs.CL cs.AI

    CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models

    Authors: Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman

    Abstract: Pretrained language models, especially masked language models (MLMs) have seen success across many NLP tasks. However, there is ample evidence that they use the cultural biases that are undoubtedly present in the corpora they are trained on, implicitly creating harm with biased representations. To measure some forms of social bias in language models against protected demographic groups in the US,… ▽ More

    Submitted 30 September, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  13. arXiv:2005.06108  [pdf, other

    physics.flu-dyn physics.comp-ph

    The inertial sea wave energy converter (ISWEC) technology: device-physics, multiphase modeling and simulations

    Authors: Kaustubh Khedkar, Nishant Nangia, Ramakrishnan Thirumalaisamy, Amneet Pal Singh Bhalla

    Abstract: In this paper we investigate the dynamics of the inertial wave energy converter (ISWEC) device using fully-resolved computational fluid dynamics (CFD) simulations. Originally prototyped by Polytechnic University of Turin, the device consists of a floating, boat-shaped hull that is slack-moored to the sea bed. Internally, a gyroscopic power take off (PTO) unit converts the wave-induced pitch motion… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Figures are compressed to comply with arXiv size requirements

  14. arXiv:1907.01041  [pdf

    cs.CL cs.LG

    Natural Language Understanding with the Quora Question Pairs Dataset

    Authors: Lakshay Sharma, Laura Graesser, Nikita Nangia, Utku Evci

    Abstract: This paper explores the task Natural Language Understanding (NLU) by looking at duplicate question detection in the Quora dataset. We conducted extensive exploration of the dataset and used various machine learning models, including linear and tree-based models. Our final finding was that a simple Continuous Bag of Words neural network model had the best performance, outdoing more complicated recu… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  15. arXiv:1905.10425  [pdf, other

    cs.CL cs.AI

    Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark

    Authors: Nikita Nangia, Samuel R. Bowman

    Abstract: The GLUE benchmark (Wang et al., 2019b) is a suite of language understanding tasks which has seen dramatic progress in the past year, with average performance moving from 70.0 at launch to 83.9, state of the art at the time of writing (May 24, 2019). Here, we measure human performance on the benchmark, in order to learn whether significant headroom remains for further progress. We provide a conser… ▽ More

    Submitted 1 June, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: ACL 2019

  16. arXiv:1905.00537  [pdf, other

    cs.CL cs.AI

    SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

    Authors: Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman

    Abstract: In the last year, new models and methods for pretraining and transfer learning have driven striking performance improvements across a range of language understanding tasks. The GLUE benchmark, introduced a little over one year ago, offers a single-number metric that summarizes progress on a diverse set of such tasks, but performance on the benchmark has recently surpassed the level of non-expert h… ▽ More

    Submitted 12 February, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019, super.gluebenchmark.com updating acknowledegments

  17. arXiv:1904.04078  [pdf, other

    math.NA physics.flu-dyn

    Simulating water-entry/exit problems using Eulerian-Lagrangian and fully-Eulerian fictitious domain methods within the open-source IBAMR library

    Authors: Amneet Pal Singh Bhalla, Nishant Nangia, Panagiotis Dafnakis, Giovanni Bracco, Giuliana Mattiazzo

    Abstract: In this paper we employ two implementations of the fictitious domain (FD) method to simulate water-entry and water-exit problems and demonstrate their ability to simulate practical marine engineering problems. In FD methods, the fluid momentum equation is extended within the solid domain using an additional body force that constrains the structure velocity to be that of a rigid body. Using this fo… ▽ More

    Submitted 3 July, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: The current paper builds on arXiv:1901.07892 and re-explains some parts of it for the reader's convenience

  18. arXiv:1901.07892  [pdf, other

    physics.flu-dyn math.NA physics.comp-ph

    A DLM immersed boundary method based wave-structure interaction solver for high density ratio multiphase flows

    Authors: Nishant Nangia, Neelesh A. Patankar, Amneet Pal Singh Bhalla

    Abstract: We present a robust immersed boundary (IB) method for high density ratio multiphase flows that is capable of modeling complex wave-structure interaction (WSI) problems arising in marine and coastal engineering applications. The IB/WSI methodology is enabled by combining the distributed Lagrange multiplier (DLM) method of Sharma and Patankar (J Comp Phys, 2005) with a robust level set method based… ▽ More

    Submitted 1 September, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: Figures are compressed to comply with arXiv size requirements

  19. arXiv:1809.01008  [pdf, other

    physics.comp-ph math.NA physics.flu-dyn

    A robust incompressible Navier-Stokes solver for high density ratio multiphase flows

    Authors: Nishant Nangia, Boyce E. Griffith, Neelesh A. Patankar, Amneet Pal Singh Bhalla

    Abstract: This paper presents a robust, adaptive numerical scheme for simulating high density ratio and high shear multiphase flows on locally refined Cartesian grids that adapt to the evolving interfaces and track regions of high vorticity. The algorithm combines the interface capturing level set method with a variable-coefficient incompressible Navier-Stokes solver that is demonstrated to stably resolve m… ▽ More

    Submitted 1 September, 2019; v1 submitted 31 August, 2018; originally announced September 2018.

    Comments: Figures are compressed to comply with arXiv size requirements

  20. arXiv:1804.06028  [pdf, other

    cs.CL

    ListOps: A Diagnostic Dataset for Latent Tree Learning

    Authors: Nikita Nangia, Samuel R. Bowman

    Abstract: Latent tree learning models learn to parse a sentence without syntactic supervision, and use that parse to build the sentence representation. Existing work on such models has shown that, while they perform well on tasks like sentence classification, they do not learn grammars that conform to any plausible semantic or syntactic formalism (Williams et al., 2018a). Studying the parsing ability of suc… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: 8 pages, 4 figures, 3 tables, NAACL-SRW (2018)

  21. arXiv:1707.08172  [pdf, other

    cs.CL

    The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations

    Authors: Nikita Nangia, Adina Williams, Angeliki Lazaridou, Samuel R. Bowman

    Abstract: This paper presents the results of the RepEval 2017 Shared Task, which evaluated neural network sentence representation learning models on the Multi-Genre Natural Language Inference corpus (MultiNLI) recently introduced by Williams et al. (2017). All of the five participating teams beat the bidirectional LSTM (BiLSTM) and continuous bag of words baselines reported in Williams et al.. The best sing… ▽ More

    Submitted 25 July, 2017; originally announced July 2017.

    Comments: 10 pages, 1 figure, 6 tables, in Proceedings of The Second Workshop on Evaluating Vector Space Representations for NLP (RepEval 2017)

  22. arXiv:1704.05426  [pdf, ps, other

    cs.CL

    A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

    Authors: Adina Williams, Nikita Nangia, Samuel R. Bowman

    Abstract: This paper introduces the Multi-Genre Natural Language Inference (MultiNLI) corpus, a dataset designed for use in the development and evaluation of machine learning models for sentence understanding. In addition to being one of the largest corpora available for the task of NLI, at 433k examples, this corpus improves upon available resources in its coverage: it offers data from ten distinct genres… ▽ More

    Submitted 19 February, 2018; v1 submitted 18 April, 2017; originally announced April 2017.

    Comments: 10 pages, 1 figures, 5 tables. v2 corrects a misreported accuracy number for the CBOW model in the 'matched' setting. v3 adds a discussion of the difficulty of the corpus to the analysis section. v4 is the version that was accepted to NAACL2018

  23. A moving control volume approach to computing hydrodynamic forces and torques on immersed bodies

    Authors: Nishant Nangia, Hans Johansen, Neelesh A. Patankar, Amneet Pal Singh Bhalla

    Abstract: We present a moving control volume (CV) approach to computing hydrodynamic forces and torques on complex geometries. The method requires surface and volumetric integrals over a simple and regular Cartesian box that moves with an arbitrary velocity to enclose the body at all times. The moving box is aligned with Cartesian grid faces, which makes the integral evaluation straightforward in an immerse… ▽ More

    Submitted 15 June, 2017; v1 submitted 1 April, 2017; originally announced April 2017.