(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–10 of 10 results for author: Cases, I

.
  1. arXiv:2407.13729  [pdf, other

    cs.CL

    Baba Is AI: Break the Rules to Beat the Benchmark

    Authors: Nathan Cloos, Meagan Jens, Michelangelo Naim, Yen-Ling Kuo, Ignacio Cases, Andrei Barbu, Christopher J. Cueva

    Abstract: Humans solve problems by following existing rules and procedures, and also by leaps of creativity to redefine those rules and objectives. To probe these abilities, we developed a new benchmark based on the game Baba Is You where an agent manipulates both objects in the environment and rules, represented by movable tiles with words written on them, to reach a specified goal and win the game. We tes… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures

  2. arXiv:2406.14481  [pdf, other

    cs.LG cs.AI cs.NE q-bio.NC

    Revealing Vision-Language Integration in the Brain with Multimodal Networks

    Authors: Vighnesh Subramaniam, Colin Conwell, Christopher Wang, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

    Abstract: We use (multi)modal deep neural networks (DNNs) to probe for sites of multimodal integration in the human brain by predicting stereoencephalography (SEEG) recordings taken while human subjects watched movies. We operationalize sites of multimodal integration as regions where a multimodal vision-language model predicts recordings better than unimodal language, unimodal vision, or linearly-integrate… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: ICML 2024; 23 pages, 11 figures

  3. arXiv:2302.14367  [pdf, other

    cs.LG eess.SP q-bio.NC

    BrainBERT: Self-supervised representation learning for intracranial recordings

    Authors: Christopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

    Abstract: We create a reusable Transformer, BrainBERT, for intracranial recordings bringing modern representation learning approaches to neuroscience. Much like in NLP and speech recognition, this Transformer enables classifying complex concepts, i.e., decoding neural data, with higher accuracy and with much less data by being pretrained in an unsupervised manner on a large corpus of unannotated neural reco… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 9 pages, 6 figures, ICLR 2023

  4. arXiv:2112.07066  [pdf, other

    cs.LG

    Continual Learning In Environments With Polynomial Mixing Times

    Authors: Matthew Riemer, Sharath Chandra Raparthy, Ignacio Cases, Gopeshh Subbaraj, Maximilian Puelma Touzel, Irina Rish

    Abstract: The mixing time of the Markov chain induced by a policy limits performance in real-world continual learning scenarios. Yet, the effect of mixing times on learning in continual reinforcement learning (RL) remains underexplored. In this paper, we characterize problems that are of long-term interest to the development of continual RL, which we call scalable MDPs, through the lens of mixing times. In… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted at NeurIPS 2022

  5. arXiv:1912.13408  [pdf, other

    cs.LG stat.ML

    On the Role of Weight Sharing During Deep Option Learning

    Authors: Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro

    Abstract: The options framework is a popular approach for building temporally extended actions in reinforcement learning. In particular, the option-critic architecture provides general purpose policy gradient theorems for learning actions from scratch that are extended in time. However, past work makes the key assumption that each of the components of option-critic has independent parameters. In this work w… ▽ More

    Submitted 6 February, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

    Comments: AAAI 2020

  6. arXiv:1911.00811  [pdf, ps, other

    cs.CL

    Posing Fair Generalization Tasks for Natural Language Inference

    Authors: Atticus Geiger, Ignacio Cases, Lauri Karttunen, Chris Potts

    Abstract: Deep learning models for semantics are generally evaluated using naturalistic corpora. Adversarial methods, in which models are evaluated on new examples with known semantic properties, have begun to reveal that good performance at these naturalistic tasks can hide serious shortcomings. However, we should insist that these evaluations be fair -that the models are given data sufficient to support t… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the 2019 conference on Empircal Methods in Natural Lanuage Processing

  7. arXiv:1904.12774  [pdf, other

    cs.LG cs.NE stat.ML

    Routing Networks and the Challenges of Modular and Compositional Computation

    Authors: Clemens Rosenbaum, Ignacio Cases, Matthew Riemer, Tim Klinger

    Abstract: Compositionality is a key strategy for addressing combinatorial complexity and the curse of dimensionality. Recent work has shown that compositional solutions can be learned and offer substantial gains across a variety of domains, including multi-task learning, language modeling, visual question answering, machine comprehension, and others. However, such models present unique challenges during tra… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

  8. arXiv:1810.13033  [pdf, ps, other

    cs.CL

    Stress-Testing Neural Models of Natural Language Inference with Multiply-Quantified Sentences

    Authors: Atticus Geiger, Ignacio Cases, Lauri Karttunen, Christopher Potts

    Abstract: Standard evaluations of deep learning models for semantics using naturalistic corpora are limited in what they can tell us about the fidelity of the learned representations, because the corpora rarely come with good measures of semantic complexity. To overcome this limitation, we present a method for generating data sets of multiply-quantified natural language inference (NLI) examples in which sem… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

  9. arXiv:1810.11910  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference

    Authors: Matthew Riemer, Ignacio Cases, Robert Ajemian, Miao Liu, Irina Rish, Yuhai Tu, Gerald Tesauro

    Abstract: Lack of performance when it comes to continual learning over non-stationary distributions of data remains a major challenge in scaling neural network learning to more human realistic settings. In this work we propose a new conceptualization of the continual learning problem in terms of a temporally symmetric trade-off between transfer and interference that can be optimized by enforcing gradient al… ▽ More

    Submitted 2 May, 2019; v1 submitted 28 October, 2018; originally announced October 2018.

    Comments: ICLR 2019

  10. arXiv:1710.02076  [pdf, other

    cs.CL

    On the Effective Use of Pretraining for Natural Language Inference

    Authors: Ignacio Cases, Minh-Thang Luong, Christopher Potts

    Abstract: Neural networks have excelled at many NLP tasks, but there remain open questions about the performance of pretrained distributed word representations and their interaction with weight initialization and other hyperparameters. We address these questions empirically using attention-based sequence-to-sequence models for natural language inference (NLI). Specifically, we compare three types of embeddi… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: This manuscript dates from late Winter 2016