(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 163 results for author: Griffiths, T

.
  1. arXiv:2409.05890  [pdf, other

    cs.CY physics.soc-ph

    Automating the Practice of Science -- Opportunities, Challenges, and Implications

    Authors: Sebastian Musslick, Laura K. Bartlett, Suyog H. Chandramouli, Marina Dubova, Fernand Gobet, Thomas L. Griffiths, Jessica Hullman, Ross D. King, J. Nathan Kutz, Christopher G. Lucas, Suhas Mahesh, Franco Pestilli, Sabina J. Sloman, William R. Holmes

    Abstract: Automation transformed various aspects of our human civilization, revolutionizing industries and streamlining processes. In the domain of scientific inquiry, automated approaches emerged as powerful tools, holding promise for accelerating discovery, enhancing reproducibility, and overcoming the traditional impediments to scientific progress. This article evaluates the scope of automation within sc… ▽ More

    Submitted 27 August, 2024; originally announced September 2024.

  2. arXiv:2408.07865  [pdf, other

    econ.GN cs.GT cs.LG

    Capturing the Complexity of Human Strategic Decision-Making with Machine Learning

    Authors: Jian-Qiao Zhu, Joshua C. Peterson, Benjamin Enke, Thomas L. Griffiths

    Abstract: Understanding how people behave in strategic settings--where they make decisions based on their expectations about the behavior of others--is a long-standing problem in the behavioral sciences. We conduct the largest study to date of strategic decision-making in the context of initial play in two-player matrix games, analyzing over 90,000 human decisions across more than 2,400 procedurally generat… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  3. arXiv:2408.03943  [pdf, other

    cs.HC cs.AI cs.LG

    Building Machines that Learn and Think with People

    Authors: Katherine M. Collins, Ilia Sucholutsky, Umang Bhatt, Kartik Chandra, Lionel Wong, Mina Lee, Cedegao E. Zhang, Tan Zhi-Xuan, Mark Ho, Vikash Mansinghka, Adrian Weller, Joshua B. Tenenbaum, Thomas L. Griffiths

    Abstract: What do we want from machine intelligence? We envision machines that are not just tools for thought, but partners in thought: reasonable, insightful, knowledgeable, reliable, and trustworthy systems that think with us. Current artificial intelligence (AI) systems satisfy some of these criteria, some of the time. In this Perspective, we show how the science of collaborative cognition can be put to… ▽ More

    Submitted 21 July, 2024; originally announced August 2024.

  4. arXiv:2407.01687  [pdf, other

    cs.CL cs.AI

    Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning

    Authors: Akshara Prabhakar, Thomas L. Griffiths, R. Thomas McCoy

    Abstract: Chain-of-Thought (CoT) prompting has been shown to enhance the multi-step reasoning capabilities of Large Language Models (LLMs). However, debates persist about whether LLMs exhibit abstract generalization or rely on shallow heuristics when given CoT prompts. To understand the factors influencing CoT reasoning we provide a detailed case study of the symbolic reasoning task of decoding shift cipher… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 9 pages plus references and appendices

  5. arXiv:2406.17055  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Large Language Models Assume People are More Rational than We Really are

    Authors: Ryan Liu, Jiayi Geng, Joshua C. Peterson, Ilia Sucholutsky, Thomas L. Griffiths

    Abstract: In order for AI systems to communicate effectively with people, they must understand how we make decisions. However, people's decisions are not always rational, so the implicit internal models of human decision-making in Large Language Models (LLMs) must account for this. Previous empirical evidence seems to suggest that these implicit models are accurate -- LLMs offer believable proxies of human… ▽ More

    Submitted 30 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  6. arXiv:2406.04302  [pdf, other

    cs.LG

    Representational Alignment Supports Effective Machine Teaching

    Authors: Ilia Sucholutsky, Katherine M. Collins, Maya Malaviya, Nori Jacoby, Weiyang Liu, Theodore R. Sumers, Michalis Korakakis, Umang Bhatt, Mark Ho, Joshua B. Tenenbaum, Brad Love, Zachary A. Pardos, Adrian Weller, Thomas L. Griffiths

    Abstract: A good teacher should not only be knowledgeable; but should be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we integrate insights from machine teaching and pragmatic communication with the burgeoning literature on representational alignment to characterize a utility curve defining a relationship between representatio… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint

  7. arXiv:2406.03707  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions

    Authors: Liyi Zhang, Michael Y. Li, Thomas L. Griffiths

    Abstract: Autoregressive language models have demonstrated a remarkable ability to extract latent structure from text. The embeddings from large language models have been shown to capture aspects of the syntax and semantics of language. But what {\em should} embeddings represent? We connect the autoregressive prediction objective to the idea of constructing predictive sufficient statistics to summarize the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

    ACM Class: I.2; I.5

  8. arXiv:2406.02268  [pdf, other

    cs.LG

    Analyzing the Benefits of Prototypes for Semi-Supervised Category Learning

    Authors: Liyi Zhang, Logan Nelson, Thomas L. Griffiths

    Abstract: Categories can be represented at different levels of abstraction, from prototypes focused on the most typical members to remembering all observed exemplars of the category. These representations have been explored in the context of supervised learning, where stimuli are presented with known category labels. We examine the benefits of prototype-based representations in a less-studied domain: semi-s… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures

    ACM Class: I.2; I.5

  9. arXiv:2406.01860  [pdf, other

    cs.CL

    Eliciting the Priors of Large Language Models using Iterated In-Context Learning

    Authors: Jian-Qiao Zhu, Thomas L. Griffiths

    Abstract: As Large Language Models (LLMs) are increasingly deployed in real-world settings, understanding the knowledge they implicitly use when making decisions is critical. One way to capture this knowledge is in the form of Bayesian prior distributions. We develop a prompt-based workflow for eliciting prior distributions from LLMs. Our approach is based on iterated learning, a Markov chain Monte Carlo me… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2405.19420  [pdf, other

    cs.LG cs.AI q-bio.NC

    Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases

    Authors: Raja Marjieh, Sreejan Kumar, Declan Campbell, Liyi Zhang, Gianluca Bencomo, Jake Snell, Thomas L. Griffiths

    Abstract: Humans rely on strong inductive biases to learn from few examples and abstract useful information from sensory data. Instilling such biases in machine learning models has been shown to improve their performance on various benchmarks including few-shot learning, robustness, and alignment. However, finding effective training procedures to achieve that goal can be challenging as psychologically-rich… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  11. arXiv:2405.19313  [pdf, other

    cs.AI cs.CL econ.GN

    Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice

    Authors: Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

    Abstract: The observed similarities in the behavior of humans and Large Language Models (LLMs) have prompted researchers to consider the potential of using LLMs as models of human cognition. However, several significant challenges must be addressed before LLMs can be legitimately regarded as cognitive models. For instance, LLMs are trained on far more data than humans typically encounter, and may have been… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  12. arXiv:2403.19669  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Analyzing the Roles of Language and Vision in Learning from Limited Data

    Authors: Allison Chen, Ilia Sucholutsky, Olga Russakovsky, Thomas L. Griffiths

    Abstract: Does language help make sense of the visual world? How important is it to actually see the world rather than having it described with words? These basic questions about the nature of intelligence have been difficult to answer because we only had one example of an intelligent system -- humans -- and limited access to cases that isolated language or vision. However, the development of sophisticated… ▽ More

    Submitted 10 May, 2024; v1 submitted 15 February, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  13. arXiv:2403.12482  [pdf, other

    cs.AI cs.CL cs.CY cs.MA

    Embodied LLM Agents Learn to Cooperate in Organized Teams

    Authors: Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang

    Abstract: Large Language Models (LLMs) have emerged as integral tools for reasoning, planning, and decision-making, drawing upon their extensive world knowledge and proficiency in language-related tasks. LLMs thus hold tremendous potential for natural language interaction within multi-agent systems to foster cooperation. However, LLM agents tend to over-report and comply with any instruction, which may resu… ▽ More

    Submitted 23 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  14. arXiv:2402.18759  [pdf, other

    cs.RO cs.AI cs.LG

    Learning with Language-Guided State Abstractions

    Authors: Andi Peng, Ilia Sucholutsky, Belinda Z. Li, Theodore R. Sumers, Thomas L. Griffiths, Jacob Andreas, Julie A. Shah

    Abstract: We describe a framework for using natural language to design state abstractions for imitation learning. Generalizable policy learning in high-dimensional observation spaces is facilitated by well-designed state representations, which can surface important features of an environment and hide irrelevant ones. These state representations are typically manually specified, or derived from other labor-i… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  15. arXiv:2402.16668  [pdf, other

    cs.LG cs.AI

    Program-Based Strategy Induction for Reinforcement Learning

    Authors: Carlos G. Correa, Thomas L. Griffiths, Nathaniel D. Daw

    Abstract: Typical models of learning assume incremental estimation of continuously-varying decision variables like expected rewards. However, this class of models fails to capture more idiosyncratic, discrete heuristics and strategies that people and animals appear to exhibit. Despite recent advances in strategy discovery using tools like recurrent networks that generalize the classic models, the resulting… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  16. arXiv:2402.07282  [pdf, other

    cs.CL cs.AI cs.LG

    How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

    Authors: Ryan Liu, Theodore R. Sumers, Ishita Dasgupta, Thomas L. Griffiths

    Abstract: In day-to-day communication, people often approximate the truth - for example, rounding the time or omitting details - in order to be maximally helpful to the listener. How do large language models (LLMs) handle such nuanced trade-offs? To address this question, we use psychological models and experiments designed to characterize human behavior to analyze LLMs. We test a range of LLMs and explore… ▽ More

    Submitted 13 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  17. arXiv:2402.07035  [pdf, other

    cs.LG cs.AI

    Distilling Symbolic Priors for Concept Learning into Neural Networks

    Authors: Ioana Marinescu, R. Thomas McCoy, Thomas L. Griffiths

    Abstract: Humans can learn new concepts from a small number of examples by drawing on their inductive biases. These inductive biases have previously been captured by using Bayesian models defined over symbolic hypothesis spaces. Is it possible to create a neural network that displays the same inductive biases? We show that inductive biases that enable rapid concept learning can be instantiated in artificial… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures, 4 tables

  18. arXiv:2402.06992  [pdf, other

    q-bio.NC cs.AI cs.CL stat.AP

    A Rational Analysis of the Speech-to-Song Illusion

    Authors: Raja Marjieh, Pol van Rijn, Ilia Sucholutsky, Harin Lee, Thomas L. Griffiths, Nori Jacoby

    Abstract: The speech-to-song illusion is a robust psychological phenomenon whereby a spoken sentence sounds increasingly more musical as it is repeated. Despite decades of research, a complete formal account of this transformation is still lacking, and some of its nuanced characteristics, namely, that certain phrases appear to transform while others do not, is not well understood. Here we provide a formal a… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: 7 pages, 5 figures

  19. arXiv:2402.04203  [pdf, other

    cs.AI q-bio.NC

    Human-Like Geometric Abstraction in Large Pre-trained Neural Networks

    Authors: Declan Campbell, Sreejan Kumar, Tyler Giallanza, Thomas L. Griffiths, Jonathan D. Cohen

    Abstract: Humans possess a remarkable capacity to recognize and manipulate abstract structure, which is especially apparent in the domain of geometry. Recent research in cognitive science suggests neural networks do not share this capacity, concluding that human geometric abilities come from discrete symbolic structure in human mental representations. However, progress in artificial intelligence (AI) sugges… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  20. arXiv:2402.04105  [pdf, other

    cs.CY cs.CL

    Measuring Implicit Bias in Explicitly Unbiased Large Language Models

    Authors: Xuechunzi Bai, Angelina Wang, Ilia Sucholutsky, Thomas L. Griffiths

    Abstract: Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs become increasingly proprietary, it may not be possible to access their embeddings and apply existing bias measures; furthermore, implicit biases are primarily a con… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  21. arXiv:2402.03618  [pdf, other

    cs.AI cs.CL q-bio.NC

    Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction

    Authors: Sreejan Kumar, Raja Marjieh, Byron Zhang, Declan Campbell, Michael Y. Hu, Umang Bhatt, Brenden Lake, Thomas L. Griffiths

    Abstract: Humans extract useful abstractions of the world from noisy sensory data. Serial reproduction allows us to study how people construe the world through a paradigm similar to the game of telephone, where one person observes a stimulus and reproduces it for the next to form a chain of reproductions. Past serial reproduction experiments typically employ a single sensory modality, but humans often commu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  22. arXiv:2402.03081  [pdf, other

    cs.RO cs.AI cs.LG

    Preference-Conditioned Language-Guided Abstraction

    Authors: Andi Peng, Andreea Bobu, Belinda Z. Li, Theodore R. Sumers, Ilia Sucholutsky, Nishanth Kumar, Thomas L. Griffiths, Julie A. Shah

    Abstract: Learning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: HRI 2024

  23. arXiv:2401.16657  [pdf, other

    cs.AI cs.CL

    Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo

    Authors: Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

    Abstract: Simulating sampling algorithms with people has proven a useful method for efficiently probing and understanding their mental representations. We propose that the same methods can be used to study the representations of Large Language Models (LLMs). While one can always directly prompt either humans or LLMs to disclose their mental representations introspectively, we show that increased efficiency… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  24. arXiv:2401.16646  [pdf, other

    cs.CL cs.AI

    Incoherent Probability Judgments in Large Language Models

    Authors: Jian-Qiao Zhu, Thomas L. Griffiths

    Abstract: Autoregressive Large Language Models (LLMs) trained for next-word prediction have demonstrated remarkable proficiency at producing coherent text. But are they equally adept at forming coherent probability judgments? We use probabilistic identities and repeated judgments to assess the coherence of probability judgments made by LLMs. Our results show that the judgments produced by these models are o… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  25. arXiv:2401.08672  [pdf, ps, other

    cs.LG cs.AI q-bio.NC

    Concept Alignment

    Authors: Sunayana Rane, Polyphony J. Bruna, Ilia Sucholutsky, Christopher Kello, Thomas L. Griffiths

    Abstract: Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is imperative that AI systems and humans align the concepts they use to understand the world. We integrate ideas from philosophy, cognitive science, and deep learning to ex… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: NeurIPS MP2 Workshop 2023

  26. arXiv:2312.14226  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Deep de Finetti: Recovering Topic Distributions from Large Language Models

    Authors: Liyi Zhang, R. Thomas McCoy, Theodore R. Sumers, Jian-Qiao Zhu, Thomas L. Griffiths

    Abstract: Large language models (LLMs) can produce long, coherent passages of text, suggesting that LLMs, although trained on next-word prediction, must represent the latent structure that characterizes a document. Prior work has found that internal representations of LLMs encode one aspect of latent structure, namely syntax; here we investigate a complementary aspect, namely the document's topic structure.… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 13 pages, 4 figures

    ACM Class: I.2.6; I.2.7

  27. arXiv:2312.14106  [pdf, other

    cs.AI cs.LG

    Learning Human-like Representations to Enable Learning Human Values

    Authors: Andrea Wynn, Ilia Sucholutsky, Thomas L. Griffiths

    Abstract: How can we build AI systems that are aligned with human values to avoid causing harm or violating societal standards for acceptable behavior? We argue that representational alignment between humans and AI agents facilitates value alignment. Making AI systems learn human-like representations of the world has many known benefits, including improving generalization, robustness to domain shifts, and f… ▽ More

    Submitted 12 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

  28. arXiv:2312.08519  [pdf

    q-bio.NC cs.AI

    Reconciling Shared versus Context-Specific Information in a Neural Network Model of Latent Causes

    Authors: Qihong Lu, Tan T. Nguyen, Qiong Zhang, Uri Hasson, Thomas L. Griffiths, Jeffrey M. Zacks, Samuel J. Gershman, Kenneth A. Norman

    Abstract: It has been proposed that, when processing a stream of events, humans divide their experiences in terms of inferred latent causes (LCs) to support context-dependent learning. However, when shared structure is present across contexts, it is still unclear how the "splitting" of LCs and learning of shared structure can be simultaneously achieved. Here, we present the Latent Cause Network (LCNet), a n… ▽ More

    Submitted 6 June, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  29. arXiv:2311.18644  [pdf, other

    cs.AI

    Exploring the hierarchical structure of human plans via program generation

    Authors: Carlos G. Correa, Sophia Sanborn, Mark K. Ho, Frederick Callaway, Nathaniel D. Daw, Thomas L. Griffiths

    Abstract: Human behavior is inherently hierarchical, resulting from the decomposition of a task into subtasks or an abstract action into concrete actions. However, behavior is typically measured as a sequence of actions, which makes it difficult to infer its hierarchical structure. In this paper, we explore how people form hierarchically-structured plans, using an experimental paradigm that makes hierarchic… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  30. arXiv:2311.14601  [pdf, other

    cs.LG cs.NE stat.ML

    A Metalearned Neural Circuit for Nonparametric Bayesian Inference

    Authors: Jake C. Snell, Gianluca Bencomo, Thomas L. Griffiths

    Abstract: Most applications of machine learning to classification assume a closed set of balanced classes. This is at odds with the real world, where class occurrence statistics often follow a long-tailed power-law distribution and it is unlikely that all classes are seen in a single sample. Nonparametric Bayesian models naturally capture this phenomenon, but have significant practical barriers to widesprea… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 13 pages, 3 figures. Code available at https://github.com/jakesnell/neural-circuits

  31. Machine Culture

    Authors: Levin Brinkmann, Fabian Baumann, Jean-François Bonnefon, Maxime Derex, Thomas F. Müller, Anne-Marie Nussberger, Agnieszka Czaplicka, Alberto Acerbi, Thomas L. Griffiths, Joseph Henrich, Joel Z. Leibo, Richard McElreath, Pierre-Yves Oudeyer, Jonathan Stray, Iyad Rahwan

    Abstract: The ability of humans to create and disseminate culture is often credited as the single most important factor of our success as a species. In this Perspective, we explore the notion of machine culture, culture mediated or generated by machines. We argue that intelligent machines simultaneously transform the cultural evolutionary processes of variation, transmission, and selection. Recommender algo… ▽ More

    Submitted 22 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Journal ref: Nat Hum Behav 7, 1855-1868 (2023)

  32. arXiv:2311.10580  [pdf, other

    cs.LG eess.SY stat.ML

    Implicit Maximum a Posteriori Filtering via Adaptive Optimization

    Authors: Gianluca M. Bencomo, Jake C. Snell, Thomas L. Griffiths

    Abstract: Bayesian filtering approximates the true underlying behavior of a time-varying system by inverting an explicit generative model to convert noisy measurements into state estimates. This process typically requires either storage, inversion, and multiplication of large matrices or Monte Carlo estimation, neither of which are practical in high-dimensional state spaces such as the weight spaces of arti… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Under review at ICLR 2024

  33. arXiv:2311.10206  [pdf, other

    cs.LG cs.AI

    Bayes in the age of intelligent machines

    Authors: Thomas L. Griffiths, Jian-Qiao Zhu, Erin Grant, R. Thomas McCoy

    Abstract: The success of methods based on artificial neural networks in creating intelligent machines seems like it might pose a challenge to explanations of human cognition in terms of Bayesian inference. We argue that this is not the case, and that in fact these systems offer new opportunities for Bayesian modeling. Specifically, we argue that Bayesian models of cognition and artificial neural networks li… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  34. arXiv:2311.09682  [pdf, other

    cs.CL cs.AI

    MacGyver: Are Large Language Models Creative Problem Solvers?

    Authors: Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman

    Abstract: We explore the creative problem-solving capabilities of modern LLMs in a novel constrained setting. To this end, we create MACGYVER, an automatically generated dataset consisting of over 1,600 real-world problems deliberately designed to trigger innovative usage of objects and necessitate out-of-the-box thinking. We then present our collection to both LLMs and humans to compare and contrast their… ▽ More

    Submitted 27 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  35. arXiv:2311.00687  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Improving Interpersonal Communication by Simulating Audiences with Language Models

    Authors: Ryan Liu, Howard Yen, Raja Marjieh, Thomas L. Griffiths, Ranjay Krishna

    Abstract: How do we communicate with others to achieve our goals? We use our prior experience or advice from others, or construct a candidate utterance by predicting how it will be received. However, our experiences are limited and biased, and reasoning about potential outcomes can be difficult and cognitively challenging. In this paper, we explore how we can leverage Large Language Model (LLM) simulations… ▽ More

    Submitted 3 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 16 pages (main paper), 7 tables and figures (main)

  36. arXiv:2310.20059  [pdf, other

    cs.AI

    Concept Alignment as a Prerequisite for Value Alignment

    Authors: Sunayana Rane, Mark Ho, Ilia Sucholutsky, Thomas L. Griffiths

    Abstract: Value alignment is essential for building AI systems that can safely and reliably interact with people. However, what a person values -- and is even capable of valuing -- depends on the concepts that they are currently using to understand and evaluate what happens in the world. The dependence of values on concepts means that concept alignment is a prerequisite for value alignment -- agents need to… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  37. arXiv:2310.13018  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Getting aligned on representational alignment

    Authors: Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith, Sunayana Rane, Talia Konkle, Thomas P. O'Connell , et al. (5 additional authors not shown)

    Abstract: Biological and artificial information processing systems form representations that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the extent to which the representations formed by these diverse systems agree? Do similarities in representations then translate into similar behavior? How can a system's representations be modified to better match those of an… ▽ More

    Submitted 2 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Working paper, changes to be made in upcoming revisions

  38. arXiv:2310.12994  [pdf

    q-bio.NC cs.AI

    Dimensions of Disagreement: Unpacking Divergence and Misalignment in Cognitive Science and Artificial Intelligence

    Authors: Kerem Oktar, Ilia Sucholutsky, Tania Lombrozo, Thomas L. Griffiths

    Abstract: The increasing prevalence of artificial agents creates a correspondingly increasing need to manage disagreements between humans and artificial agents, as well as between artificial agents themselves. Considering this larger space of possible agents exposes an opportunity for furthering our understanding of the nature of disagreement: past studies in psychology have often cast disagreement as two a… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Currently under review

  39. arXiv:2310.02221  [pdf, other

    cs.LG

    Structurally guided task decomposition in spatial navigation tasks

    Authors: Ruiqi He, Carlos G. Correa, Thomas L. Griffiths, Mark K. Ho

    Abstract: How are people able to plan so efficiently despite limited cognitive resources? We aimed to answer this question by extending an existing model of human task decomposition that can explain a wide range of simple planning problems by adding structure information to the task to facilitate planning in more complex tasks. The extended model was then applied to a more complex planning domain of spatial… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  40. arXiv:2309.17363  [pdf, other

    q-bio.NC

    Relational Constraints On Neural Networks Reproduce Human Biases towards Abstract Geometric Regularity

    Authors: Declan Campbell, Sreejan Kumar, Tyler Giallanza, Jonathan D. Cohen, Thomas L. Griffiths

    Abstract: Uniquely among primates, humans possess a remarkable capacity to recognize and manipulate abstract structure in the service of task goals across a broad range of behaviors. One illustration of this is in the visual perception of geometric forms. Studies have shown a uniquely human bias toward geometric regularity, with task performance enhanced for more regular and symmetric forms compared to thei… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  41. arXiv:2309.13638  [pdf, other

    cs.CL cs.AI

    Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

    Authors: R. Thomas McCoy, Shunyu Yao, Dan Friedman, Matthew Hardy, Thomas L. Griffiths

    Abstract: The widespread adoption of large language models (LLMs) makes it important to recognize their strengths and limitations. We argue that in order to develop a holistic understanding of these systems we need to consider the problem that they were trained to solve: next-word prediction over Internet text. By recognizing the pressures that this task exerts we can make predictions about the strategies t… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 50 pages plus 11 page of references and 23 pages of appendices

  42. arXiv:2309.02427  [pdf, other

    cs.AI cs.CL cs.LG cs.SC

    Cognitive Architectures for Language Agents

    Authors: Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths

    Abstract: Recent efforts have augmented large language models (LLMs) with external resources (e.g., the Internet) or internal control flows (e.g., prompt chaining) for tasks requiring grounding or reasoning, leading to a new class of language agents. While these agents have achieved substantial empirical success, we lack a systematic framework to organize existing agents and plan future developments. In thi… ▽ More

    Submitted 15 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: v3 is TMLR camera ready version. 19 pages of main content, 5 figures. The first two authors contributed equally, order decided by coin flip. A CoALA-based repo of recent work on language agents: https://github.com/ysymyth/awesome-language-agents

  43. arXiv:2306.08564  [pdf, other

    q-bio.NC cs.AI stat.AP

    The Universal Law of Generalization Holds for Naturalistic Stimuli

    Authors: Raja Marjieh, Nori Jacoby, Joshua C. Peterson, Thomas L. Griffiths

    Abstract: Shepard's universal law of generalization is a remarkable hypothesis about how intelligent organisms should perceive similarity. In its broadest form, the universal law states that the level of perceived similarity between a pair of stimuli should decay as a concave function of their distance when embedded in an appropriate psychological space. While extensively studied, evidence in support of the… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 36 pages, 6 figures

  44. arXiv:2305.18213  [pdf

    cs.LG cs.AI

    Gaussian Process Probes (GPP) for Uncertainty-Aware Probing

    Authors: Zi Wang, Alexander Ku, Jason Baldridge, Thomas L. Griffiths, Been Kim

    Abstract: Understanding which concepts models can and cannot represent has been fundamental to many tasks: from effective and responsible use of models to detecting out of distribution data. We introduce Gaussian process probes (GPP), a unified and simple framework for probing and measuring uncertainty about concepts represented by models. As a Bayesian extension of linear probing methods, GPP asks what kin… ▽ More

    Submitted 6 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  45. arXiv:2305.17262  [pdf, other

    cs.CV cs.AI

    Im-Promptu: In-Context Composition from Image Prompts

    Authors: Bhishma Dedhia, Michael Chang, Jake C. Snell, Thomas L. Griffiths, Niraj K. Jha

    Abstract: Large language models are few-shot learners that can solve diverse tasks from a handful of demonstrations. This implicit understanding of tasks suggests that the attention mechanisms over word tokens may play a role in analogical reasoning. In this work, we investigate whether analogical reasoning can enable in-context composition over composable elements of visual stimuli. First, we introduce a s… ▽ More

    Submitted 22 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  46. arXiv:2305.14701  [pdf, other

    cs.CL cs.AI

    Modeling rapid language learning by distilling Bayesian priors into artificial neural networks

    Authors: R. Thomas McCoy, Thomas L. Griffiths

    Abstract: Humans can learn languages from remarkably little experience. Developing computational models that explain this ability has been a major challenge in cognitive science. Bayesian models that build in strong inductive biases - factors that guide generalization - have been successful at explaining how humans might generalize from few examples in controlled settings but are usually too restrictive to… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 21 pages plus references; 4 figures

  47. arXiv:2305.10601  [pdf, other

    cs.CL cs.AI cs.LG

    Tree of Thoughts: Deliberate Problem Solving with Large Language Models

    Authors: Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan

    Abstract: Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference. This means they can fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. To surmount these challenges, we introduce a new framework for… ▽ More

    Submitted 3 December, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera ready version. Code repo with all prompts: https://github.com/princeton-nlp/tree-of-thought-llm

  48. arXiv:2303.11373  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Neural Constraint Satisfaction: Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement

    Authors: Michael Chang, Alyssa L. Dayan, Franziska Meier, Thomas L. Griffiths, Sergey Levine, Amy Zhang

    Abstract: Object rearrangement is a challenge for embodied agents because solving these tasks requires generalizing across a combinatorially large set of configurations of entities and their locations. Worse, the representations of these entities are unknown and must be inferred from sensory percepts. We present a hierarchical abstraction approach to uncover these underlying entities and achieve combinatori… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 19 pages, 11 figures, Published as a conference paper at the International Conference on Learning Representations 2023

  49. Superhuman Artificial Intelligence Can Improve Human Decision Making by Increasing Novelty

    Authors: Minkyu Shin, Jin Kim, Bas van Opheusden, Thomas L. Griffiths

    Abstract: How will superhuman artificial intelligence (AI) affect human decision making? And what will be the mechanisms behind this effect? We address these questions in a domain where AI already exceeds human performance, analyzing more than 5.8 million move decisions made by professional Go players over the past 71 years (1950-2021). To address the first question, we use a superhuman AI program to estima… ▽ More

    Submitted 14 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: This paper is published in PNAS: https://www.pnas.org/doi/10.1073/pnas.2214840120 Minor edits to v1 include the addition of watermark and link to the published paper in the footer

    MSC Class: 68T01; 68T05; 68T35; 68T99 ACM Class: I.2.0; I.2.1; I.2.6; I.2.m

    Journal ref: Proceedings of the National Academy of Sciences, 120 (12), e2214840120 (2023)

  50. arXiv:2302.08013  [pdf, other

    physics.flu-dyn math-ph

    Revisiting boundary layer flows of viscoelastic fluids

    Authors: L. J. Escott, P. T. Griffiths

    Abstract: In this article we reconsider high Reynolds number boundary layer flows of fluids with viscoelastic properties. We show that a number of previous studies that have attempted to address this problem are, in fact, incomplete. We correctly reformulate the problem and solve the governing equations using a Chebyshev collocation scheme. By analysing the decay of the solutions to the far-field we determi… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Journal ref: J. Non-Newt. Fluid Mech. 312, 104976 (2023)