(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 196 results for author: Agarwal, R

.
  1. arXiv:2407.04622  [pdf, other

    cs.LG

    On scalable oversight with weak LLMs judging strong LLMs

    Authors: Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah

    Abstract: Scalable oversight protocols aim to enable humans to accurately supervise superhuman AI. In this paper we study debate, where two AI's compete to convince a judge; consultancy, where a single AI tries to convince a judge that asks questions; and compare to a baseline of direct question-answering, where the judge just answers outright without the AI. We use large language models (LLMs) as both AI a… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 15 pages (53 including appendices)

  2. arXiv:2406.18537  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

    Authors: Keenon Werling, Janelle Kaneda, Alan Tan, Rishi Agarwal, Six Skov, Tom Van Wouwe, Scott Uhlrich, Nicholas Bianco, Carmichael Ong, Antoine Falisse, Shardul Sapkota, Aidan Chandra, Joshua Carter, Ezio Preatoni, Benjamin Fregly, Jennifer Hicks, Scott Delp, C. Karen Liu

    Abstract: While reconstructing human poses in 3D from inexpensive sensors has advanced significantly in recent years, quantifying the dynamics of human motion, including the muscle-generated joint torques and external forces, remains a challenge. Prior attempts to estimate physics from reconstructed human poses have been hampered by a lack of datasets with high-quality pose and force data for a variety of m… ▽ More

    Submitted 16 May, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures, 4 tables

  3. arXiv:2406.15025  [pdf, other

    cs.LG

    SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning

    Authors: Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara

    Abstract: An open challenge in reinforcement learning (RL) is the effective deployment of a trained policy to new or slightly different situations as well as semantically-similar environments. We introduce Symmetry-Invariant Transformer (SiT), a scalable vision transformer (ViT) that leverages both local and global data patterns in a self-supervised manner to improve generalisation. Central to our approach… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 9 main pages, accepted to ICML2024

  4. arXiv:2405.18513  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Strong Chirality Suppression in 1-D correlated Weyl Semimetal (TaSe4)2I

    Authors: Utkarsh Khandelwal, Harshvardhan Jog, Shupeng Xu, Yicong Chen, Kejian Qu, Chengxi Zhao, Eugene Mele, Daniel P. Shoemaker, Ritesh Agarwal

    Abstract: The interaction of light with correlated Weyl semimetals (WSMs) provides a unique platform for exploring non-equilibrium phases and fundamental properties such as chirality. Here, we investigate the structural chirality of (TaSe4)2I, a correlated WSM, under weak optical pumping using Circular Photogalvanic Effect (CPGE) measurements and Raman spectroscopy. Surprisingly, we find that there is a los… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 21 pages, 4 figures

  5. arXiv:2404.14448  [pdf

    cs.SE

    Object-Oriented Architecture: A Software Engineering-Inspired Shape Grammar for Durands Plates

    Authors: Rohan Agarwal

    Abstract: Addressing the challenge of modular architectural design, this study presents a novel approach through the implementation of a shape grammar system using functional and object-oriented programming principles from computer science. The focus lies on the modular generation of plates in the style of French Neoclassical architect Jean-Nicolas-Louis Durand, known for his modular rule-based method to ar… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  6. arXiv:2404.11018  [pdf, other

    cs.LG cs.AI cs.CL

    Many-Shot In-Context Learning

    Authors: Rishabh Agarwal, Avi Singh, Lei M. Zhang, Bernd Bohnet, Luis Rosias, Stephanie Chan, Biao Zhang, Ankesh Anand, Zaheer Abbas, Azade Nova, John D. Co-Reyes, Eric Chu, Feryal Behbahani, Aleksandra Faust, Hugo Larochelle

    Abstract: Large language models (LLMs) excel at few-shot in-context learning (ICL) -- learning from a few examples provided in context at inference, without any weight updates. Newly expanded context windows allow us to investigate ICL with hundreds or thousands of examples -- the many-shot regime. Going from few-shot to many-shot, we observe significant performance gains across a wide variety of generative… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  7. arXiv:2404.04903  [pdf, other

    cs.LG cs.AI

    Online Learning under Haphazard Input Conditions: A Comprehensive Review and Analysis

    Authors: Rohit Agarwal, Arijit Das, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

    Abstract: The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss,… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  8. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  9. arXiv:2403.03950  [pdf, other

    cs.LG cs.AI stat.ML

    Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

    Authors: Jesse Farebrother, Jordi Orbay, Quan Vuong, Adrien Ali Taïga, Yevgen Chebotar, Ted Xiao, Alex Irpan, Sergey Levine, Pablo Samuel Castro, Aleksandra Faust, Aviral Kumar, Rishabh Agarwal

    Abstract: Value functions are a central component of deep reinforcement learning (RL). These functions, parameterized by neural networks, are trained using a mean squared error regression objective to match bootstrapped target values. However, scaling value-based RL methods that use regression to large networks, such as high-capacity Transformers, has proven challenging. This difficulty is in stark contrast… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  10. arXiv:2402.15514  [pdf

    cs.CL cs.AI

    Large Scale Generative AI Text Applied to Sports and Music

    Authors: Aaron Baughman, Stephen Hammer, Rahul Agarwal, Gozde Akay, Eduardo Morales, Tony Johnson, Leonid Karlinsky, Rogerio Feris

    Abstract: We address the problem of scaling up the production of media content, including commentary and personalized news stories, for large-scale sports and music events worldwide. Our approach relies on generative AI models to transform a large volume of multimodal data (e.g., videos, articles, real-time scoring feeds, statistics, and fact sheets) into coherent and fluent text. Based on this approach, we… ▽ More

    Submitted 27 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: 9 pages, 8 figures, 5 tables

  11. arXiv:2402.09665  [pdf, other

    cond-mat.mes-hall physics.optics

    Simple realization of a fragile topological lattice with quasi flat-bands in a microcavity array

    Authors: Yuhui Wang, Shupeng Xu, Liang Feng, Ritesh Agarwal

    Abstract: Topological flat bands (TFBs) are increasingly recognized as an important paradigm to study topological effects in the context of strong correlation physics. As a representative example, recently it has been theoretically proposed that the topological non-triviality offers a unique contribution to flat-band superconductivity, which can potentially lead to a higher critical temperature of supercond… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  12. arXiv:2402.09371  [pdf, other

    cs.LG cs.AI cs.CL

    Transformers Can Achieve Length Generalization But Not Robustly

    Authors: Yongchao Zhou, Uri Alon, Xinyun Chen, Xuezhi Wang, Rishabh Agarwal, Denny Zhou

    Abstract: Length generalization, defined as the ability to extrapolate from shorter training sequences to longer test ones, is a significant challenge for language models. This issue persists even with large-scale Transformers handling relatively straightforward tasks. In this paper, we test the Transformer's ability of length generalization using the task of addition of two integers. We show that the succe… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  13. arXiv:2402.06457  [pdf, other

    cs.LG cs.AI cs.CL

    V-STaR: Training Verifiers for Self-Taught Reasoners

    Authors: Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

    Abstract: Common self-improvement approaches for large language models (LLMs), such as STaR (Zelikman et al., 2022), iteratively fine-tune LLMs on self-generated solutions to improve their problem-solving ability. However, these approaches discard the large amounts of incorrect solutions generated during this process, potentially neglecting valuable information in such solutions. To address this shortcoming… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  14. arXiv:2312.10954  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics

    Opto-twistronic Hall effect in a three-dimensional spiral lattice

    Authors: Zhurun Ji, Yuzhou Zhao, Yicong Chen, Ziyan Zhu, Yuhui Wang, Wenjing Liu, Gaurav Modi, Eugene J. Mele, Song Jin, Ritesh Agarwal

    Abstract: Studies of moire systems have elucidated the exquisite effect of quantum geometry on the electronic bands and their properties, leading to the discovery of new correlated phases. However, most experimental studies have been confined to a few layers in the 2D limit. The extension of twistronics to its 3D limit, where the twist is extended into the third dimension between adjacent layers, remains un… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  15. arXiv:2312.06585  [pdf, other

    cs.LG

    Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

    Authors: Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron , et al. (16 additional authors not shown)

    Abstract: Fine-tuning language models~(LMs) on human-generated data remains a prevalent practice. However, the performance of such models is often limited by the quantity and diversity of high-quality human data. In this paper, we explore whether we can go beyond human data on tasks where we have access to scalar feedback, for example, on math problems where one can verify correctness. To do so, we investig… ▽ More

    Submitted 17 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to TMLR. Camera-ready version. First three authors contributed equally

  16. arXiv:2311.17894  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cs.LG

    Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

    Authors: Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron Courville, Marc G. Bellemare, Sergei Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore

    Abstract: We introduce a machine learning approach to determine the transition dynamics of silicon atoms on a single layer of carbon atoms, when stimulated by the electron beam of a scanning transmission electron microscope (STEM). Our method is data-centric, leveraging data collected on a STEM. The data samples are processed and filtered to produce symbolic representations, which we use to train a neural n… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  17. arXiv:2311.11958  [pdf, ps, other

    math.GM

    Existence and multiplicity for fractional Dirichlet problem with $γがんま(ξくしー)$-Laplacian equation and Nehari manifold

    Authors: J. Vanterler da C. Sousa, D. S. Oliveira, Ravi P. Agarwal

    Abstract: This paper is divided in two parts. In the first part, we prove coercivity results and minimization of the Euler energy functional. In the second part, we focus on the existence and multiplicity of a positive solution of fractional Dirichlet problem involving the $γがんま(ξくしー)$-Laplacian equation with non-negative weight functions in $\mathcal{H}^{αあるふぁ,βべーた;χかい}_{γがんま(ξくしー)}(Λらむだ,\mathbb{R})$ using some variational techni… ▽ More

    Submitted 3 October, 2023; originally announced November 2023.

    Comments: 14 pages

    MSC Class: 26A33; 35B38; 35D05; 35J60; 35J70; 58E05

  18. arXiv:2310.20144  [pdf, other

    cs.CL cs.AI cs.LG

    EELBERT: Tiny Models through Dynamic Embeddings

    Authors: Gabrielle Cohn, Rishika Agarwal, Deepanshu Gupta, Siddharth Patwardhan

    Abstract: We introduce EELBERT, an approach for compression of transformer-based models (e.g., BERT), with minimal impact on the accuracy of downstream tasks. This is achieved by replacing the input embedding layer of the model with dynamic, i.e. on-the-fly, embedding computations. Since the input embedding layer accounts for a significant fraction of the model size, especially for the smaller BERT variants… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023, Industry Track 9 pages, 2 figures, 5 tables

    MSC Class: 68T07 ACM Class: I.2.7; I.2.6

  19. arXiv:2310.08710  [pdf, other

    cs.RO cs.LG

    Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

    Authors: Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp

    Abstract: Simulation is an essential tool to develop and benchmark autonomous vehicle planning software in a safe and cost-effective manner. However, realistic simulation requires accurate modeling of nuanced and complex multi-agent interactive behaviors. To address these challenges, we introduce Waymax, a new data-driven simulator for autonomous driving in multi-agent scenes, designed for large-scale simul… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  20. arXiv:2310.08461  [pdf, other

    cs.CL cs.AI cs.LG

    DistillSpec: Improving Speculative Decoding via Knowledge Distillation

    Authors: Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat, Aditya Krishna Menon, Afshin Rostamizadeh, Sanjiv Kumar, Jean-François Kagy, Rishabh Agarwal

    Abstract: Speculative decoding (SD) accelerates large language model inference by employing a faster draft model for generating multiple tokens, which are then verified in parallel by the larger target model, resulting in the text generated according to the target model distribution. However, identifying a compact draft model that is well-aligned with the target model is challenging. To tackle this issue, w… ▽ More

    Submitted 30 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  21. arXiv:2309.16675  [pdf, ps, other

    math.GM math.FA

    Uncertainty principles associated with the short time quaternion coupled fractional Fourier transform

    Authors: Bivek Gupta, Amit K. Verma, Ravi P. Agarwal

    Abstract: In this paper, we extend the coupled fractional Fourier transform of a complex valued functions to that of the quaternion valued functions on $\mathbb{R}^4$ and call it the quaternion coupled fractional Fourier transform (QCFrFT). We obtain the sharp Hausdorff-Young inequality for QCFrFT and obtain the associated Rènyi uncertainty principle. We also define the short time quaternion coupled fractio… ▽ More

    Submitted 3 July, 2023; originally announced September 2023.

    MSC Class: 11R52; 42B10; 42A05

  22. arXiv:2309.08698  [pdf, other

    cs.AI cs.LG

    Modelling Irregularly Sampled Time Series Without Imputation

    Authors: Rohit Agarwal, Aman Sinha, Dilip K. Prasad, Marianne Clausel, Alexander Horsch, Mathieu Constant, Xavier Coubez

    Abstract: Modelling irregularly-sampled time series (ISTS) is challenging because of missing values. Most existing methods focus on handling ISTS by converting irregularly sampled data into regularly sampled data via imputation. These models assume an underlying missing mechanism leading to unwanted bias and sub-optimal performance. We present SLAN (Switch LSTM Aggregate Network), which utilizes a pack of L… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  23. arXiv:2309.04607  [pdf

    cs.CL cs.AI

    Linking Symptom Inventories using Semantic Textual Similarity

    Authors: Eamonn Kennedy, Shashank Vadlamani, Hannah M Lindsey, Kelly S Peterson, Kristen Dams OConnor, Kenton Murray, Ronak Agarwal, Houshang H Amiri, Raeda K Andersen, Talin Babikian, David A Baron, Erin D Bigler, Karen Caeyenberghs, Lisa Delano-Wood, Seth G Disner, Ekaterina Dobryakova, Blessen C Eapen, Rachel M Edelstein, Carrie Esopenko, Helen M Genova, Elbert Geuze, Naomi J Goodrich-Hunsaker, Jordan Grafman, Asta K Haberg, Cooper B Hodges , et al. (57 additional authors not shown)

    Abstract: An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  24. arXiv:2308.02317  [pdf, other

    cs.AI

    A Controllable Co-Creative Agent for Game System Design

    Authors: Rohan Agarwal, Zhiyu Lin, Mark Riedl

    Abstract: Many advancements have been made in procedural content generation for games, and with mixed-initiative co-creativity, have the potential for great benefits to human designers. However, co-creative systems for game generation are typically limited to specific genres, rules, or games, limiting the creativity of the designer. We seek to model games abstractly enough to apply to any genre, focusing on… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Thesis

  25. arXiv:2306.13649  [pdf, other

    cs.LG cs.AI cs.CL

    On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

    Authors: Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos, Matthieu Geist, Olivier Bachem

    Abstract: Knowledge distillation (KD) is widely used for compressing a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, current KD methods for auto-regressive sequence models suffer from distribution mismatch between output sequences seen during training and those generated by the student during inference. To address this issue, we introduce Gene… ▽ More

    Submitted 16 January, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted at ICLR 2024. First two authors contributed equally

  26. arXiv:2306.10171  [pdf, other

    cs.LG cs.AI stat.ML

    Bootstrapped Representations in Reinforcement Learning

    Authors: Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney

    Abstract: In reinforcement learning (RL), state representations are key to dealing with large or continuous state spaces. While one of the promises of deep learning algorithms is to automatically construct features well-tuned for the task they try to solve, such a representation might not emerge from end-to-end training of deep RL agents. To mitigate this issue, auxiliary objectives are often incorporated i… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  27. arXiv:2306.05974  [pdf, other

    physics.optics

    Taxonomy of hybridly polarized Stokes vortex beams

    Authors: Gauri Arora, Ankit Butola, Ruchi Rajput, Rohit Agarwal, Krishna Agarwal, Alexander Horsch, Dilip K Prasad, Paramasivam Senthilkumaran

    Abstract: Structured beams carrying topological defects, namely phase and Stokes singularities, have gained extensive interest in numerous areas of optics. The non-separable spin and orbital angular momentum states of hybridly polarized Stokes singular beams provide additional freedom for manipulating optical fields. However, the characterization of hybridly polarized Stokes vortex beams remains challenging… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  28. arXiv:2305.19452  [pdf, other

    cs.LG cs.AI

    Bigger, Better, Faster: Human-level Atari with human-level efficiency

    Authors: Max Schwarzer, Johan Obando-Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro

    Abstract: We introduce a value-based RL agent, which we call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used for value estimation, as well as a number of other design choices that enable this scaling in a sample-efficient manner. We conduct extensive analyses of these design choices and provide insights for future work. We end with a dis… ▽ More

    Submitted 13 November, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: ICML 2023, revised version

  29. arXiv:2305.17222  [pdf, other

    cs.OS

    Karma: Resource Allocation for Dynamic Demands

    Authors: Midhul Vuppalapati, Giannis Fikioris, Rachit Agarwal, Asaf Cidon, Anurag Khandelwal, Eva Tardos

    Abstract: We consider the problem of fair resource allocation in a system where user demands are dynamic, that is, where user demands vary over time. Our key observation is that the classical max-min fairness algorithm for resource allocation provides many desirable properties (e.g., Pareto efficiency, strategy-proofness, and fairness), but only under the strong assumption of user demands being static over… ▽ More

    Submitted 7 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Full version of paper accepted to USENIX OSDI 2023 with proofs of theoretical guarantees

  30. arXiv:2305.14356  [pdf

    q-bio.NC

    Creativity as Variations on a Theme: Formalizations, Evidence, and Engineered Applications

    Authors: Rohan Agarwal

    Abstract: There are many philosophies and theories on what creativity is and how it works, but one popular idea is that of variations on a theme and intersection of concepts. This literature review explores philosophical proposals of how creativity emerges from variations on a theme, and how formalizations of these proposals in human subject studies and computational methods result in creativity. Specifical… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  31. arXiv:2305.10201  [pdf

    cs.AI cs.CY

    Echoes of Biases: How Stigmatizing Language Affects AI Performance

    Authors: Yizhi Liu, Weiguang Wang, Guodong Gordon Gao, Ritu Agarwal

    Abstract: Electronic health records (EHRs) serve as an essential data source for the envisioned artificial intelligence (AI)-driven transformation in healthcare. However, clinician biases reflected in EHR notes can lead to AI models inheriting and amplifying these biases, perpetuating health disparities. This study investigates the impact of stigmatizing language (SL) in EHR notes on mortality prediction us… ▽ More

    Submitted 12 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 54 pages, 9 figures

  32. arXiv:2305.07465  [pdf, other

    cs.AI

    Beyond Prompts: Exploring the Design Space of Mixed-Initiative Co-Creativity Systems

    Authors: Zhiyu Lin, Upol Ehsan, Rohan Agarwal, Samihan Dani, Vidushi Vashishth, Mark Riedl

    Abstract: Generative Artificial Intelligence systems have been developed for image, code, story, and game generation with the goal of facilitating human creativity. Recent work on neural generative systems has emphasized one particular means of interacting with AI systems: the user provides a specification, usually in the form of prompts, and the AI system generates the content. However, there are other con… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted by ICCC'23

    Journal ref: Proceedings of 14th International Conference on Computational Creativity (2023), 64-73

  33. arXiv:2304.14250  [pdf, ps, other

    math.FA

    Discrete Rubio de Francia extrapolation theorem via factorization of weights and iterated algorithms

    Authors: S. H. Saker, A. I. Saied, R. P. Agarwal

    Abstract: In this paper, we prove a discrete Rubio de Francia extrapolation theorem via factorization of discrete Muckenhoupt weights and discrete iterated Rubio de Francia algorithm and its duality.

    Submitted 27 April, 2023; originally announced April 2023.

  34. arXiv:2304.13170  [pdf

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other

    Optically induced symmetry breaking due to nonequilibrium steady state formation in charge density wave material 1T-TiSe2

    Authors: Harshvardhan Jog, Luminita Harnagea, Dibyata Rout, Takashi Taniguchi, Kenji Watanabe, Eugene J. Mele, Ritesh Agarwal

    Abstract: The strongly correlated charge density wave (CDW) phase of 1T-TiSe$_2$ is being extensively researched to verify the claims of a unique chiral order due to the presence of three equivalent Fermi wavevectors involved in the CDW formation. Characterization of the symmetries is therefore critical to understand the origin of their intriguing properties but can be complicated by the coupling of the ele… ▽ More

    Submitted 19 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Journal ref: Nano Letters 2023, 23, 20, 9634-9640

  35. arXiv:2304.12567  [pdf, other

    cs.LG cs.AI stat.ML

    Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

    Authors: Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare

    Abstract: Auxiliary tasks improve the representations learned by deep reinforcement learning agents. Analytically, their effect is reasonably well understood; in practice, however, their primary use remains in support of a main learning objective, rather than as a method for learning representations. This is perhaps surprising given that many auxiliary tasks are defined procedurally, and hence can be treate… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: ICLR 2023. Code and models are available at https://github.com/google-research/google-research/tree/master/pvn 22 pages, 8 figures

  36. arXiv:2304.11695  [pdf, ps, other

    math.CV

    Hankel determinant for a general subclass of m-fold symmetric bi-univalent functions defined by Ruscheweyh operator

    Authors: Pishtiwan Othman Sabir, Ravi P. Agarwal, Shabaz Jalil MohammedFaeq, Pshtiwan Othman Mohammed, Nejmeddine Chorfi, Thabet Abdeljawad

    Abstract: Making use of the Hankel determinant and the Ruscheweyh derivative, in this work, we consider a general subclass of m-fold symmetric normalized bi-univalent functions defined in the open unit disk. Moreover, we investigate the bounds for the second Hankel determinant of this class and some consequences of the results are presented. In addition, to demonstrate the accuracy on some functions and con… ▽ More

    Submitted 30 August, 2023; v1 submitted 23 April, 2023; originally announced April 2023.

    Comments: 16 pages, 7 figures

    MSC Class: 30C45; 30C50; 26A51; 26B05; 15A15

  37. arXiv:2304.09948  [pdf

    cs.CL cs.AI

    Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers

    Authors: Aishwarya Deep Shukla, Laksh Agarwal, Jie Mein, Goh, Guodong, Gao, Ritu Agarwal

    Abstract: The proliferation of fake reviews of doctors has potentially detrimental consequences for patient well-being and has prompted concern among consumer protection groups and regulatory bodies. Yet despite significant advancements in the fields of machine learning and natural language processing, there remains limited comprehension of the characteristics differentiating fraudulent from authentic revie… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  38. arXiv:2304.07598  [pdf, other

    cs.CR

    Understanding Rug Pulls: An In-Depth Behavioral Analysis of Fraudulent NFT Creators

    Authors: Trishie Sharma, Rachit Agarwal, Sandeep Kumar Shukla

    Abstract: The explosive growth of non-fungible tokens (NFTs) on Web3 has created a new frontier for digital art and collectibles, but also an emerging space for fraudulent activities. This study provides an in-depth analysis of NFT rug pulls, which are fraudulent schemes aimed at stealing investors' funds. Using data from 758 rug pulls across 10 NFT marketplaces, we examine the structural and behavioral pro… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  39. arXiv:2303.17912  [pdf, other

    cs.CV cs.GR

    CIRCLE: Capture In Rich Contextual Environments

    Authors: Joao Pedro Araujo, Jiaman Li, Karthik Vetrivel, Rishi Agarwal, Deepak Gopinath, Jiajun Wu, Alexander Clegg, C. Karen Liu

    Abstract: Synthesizing 3D human motion in a contextual, ecological environment is important for simulating realistic activities people perform in the real world. However, conventional optics-based motion capture systems are not suited for simultaneously capturing human movements and complex scenes. The lack of rich contextual 3D human motion datasets presents a roadblock to creating high-quality generative… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  40. arXiv:2303.12617  [pdf, other

    cond-mat.mtrl-sci physics.optics

    Absence of topological protection of the interface states in $\mathbb{Z}_2$ photonic crystals

    Authors: Shupeng Xu, Yuhui Wang, Ritesh Agarwal

    Abstract: Inspired from electronic systems, topological photonics aims to engineer new optical devices with robust properties. In many cases, the ideas from topological phases protected by internal symmetries in fermionic systems are extended to those protected by crystalline symmetries. One such popular photonic crystal model was proposed by Wu and Hu in 2015 for realizing a bosonic $\mathbb{Z}_2$ topologi… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Journal ref: Phys. Rev. Lett. 131, 053802 (2023)

  41. arXiv:2303.09103  [pdf

    eess.IV cs.CV cs.LG

    Machine learning based biomedical image processing for echocardiographic images

    Authors: Ayesha Heena, Nagashettappa Biradar, Najmuddin M. Maroof, Surbhi Bhatia, Rashmi Agarwal, Kanta Prasad

    Abstract: The popularity of Artificial intelligence and machine learning have prompted researchers to use it in the recent researches. The proposed method uses K-Nearest Neighbor (KNN) algorithm for segmentation of medical images, extracting of image features for analysis by classifying the data based on the neural networks. Classification of the images in medical imaging is very important, KNN is one suita… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 10 figures 4 tables

    MSC Class: Computers

  42. arXiv:2303.08533  [pdf, other

    physics.acc-ph hep-ex hep-ph

    Towards a Muon Collider

    Authors: Carlotta Accettura, Dean Adams, Rohit Agarwal, Claudia Ahdida, Chiara Aimè, Nicola Amapane, David Amorim, Paolo Andreetto, Fabio Anulli, Robert Appleby, Artur Apresyan, Aram Apyan, Sergey Arsenyev, Pouya Asadi, Mohammed Attia Mahmoud, Aleksandr Azatov, John Back, Lorenzo Balconi, Laura Bandiera, Roger Barlow, Nazar Bartosik, Emanuela Barzi, Fabian Batsch, Matteo Bauce, J. Scott Berg , et al. (272 additional authors not shown)

    Abstract: A muon collider would enable the big jump ahead in energy reach that is needed for a fruitful exploration of fundamental interactions. The challenges of producing muon collisions at high luminosity and 10 TeV centre of mass energy are being investigated by the recently-formed International Muon Collider Collaboration. This Review summarises the status and the recent advances on muon colliders desi… ▽ More

    Submitted 27 November, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 118 pages, 103 figures

  43. arXiv:2303.05155  [pdf, other

    cs.LG cs.AI

    Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts

    Authors: Rohit Agarwal, Deepak Gupta, Alexander Horsch, Dilip K. Prasad

    Abstract: Many real-world applications based on online learning produce streaming data that is haphazard in nature, i.e., contains missing features, features becoming obsolete in time, the appearance of new features at later points in time and a lack of clarity on the total number of input features. These challenges make it hard to build a learnable system for such applications, and almost no work exists in… ▽ More

    Submitted 31 May, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted at Transactions on Machine Learning Research (TMLR). Link: https://openreview.net/pdf?id=R9CgBkeZ6Z

    Journal ref: Transactions on Machine Learning Research, 2023

  44. arXiv:2303.03050  [pdf, other

    cs.CV cs.AI cs.IR

    MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval

    Authors: Rohit Agarwal, Gyanendra Das, Saksham Aggarwal, Alexander Horsch, Dilip K. Prasad

    Abstract: Image retrieval has garnered growing interest in recent times. The current approaches are either supervised or self-supervised. These methods do not exploit the benefits of hybrid learning using both supervision and self-supervision. We present a novel Master Assistant Buddy Network (MABNet) for image retrieval which incorporates both learning mechanisms. MABNet consists of master and assistant bl… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  45. arXiv:2302.12902  [pdf, other

    cs.LG

    The Dormant Neuron Phenomenon in Deep Reinforcement Learning

    Authors: Ghada Sokar, Rishabh Agarwal, Pablo Samuel Castro, Utku Evci

    Abstract: In this work we identify the dormant neuron phenomenon in deep reinforcement learning, where an agent's network suffers from an increasing number of inactive neurons, thereby affecting network expressivity. We demonstrate the presence of this phenomenon across a variety of algorithms and environments, and highlight its effect on learning. To address this issue, we propose a simple and effective me… ▽ More

    Submitted 13 June, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Oral at ICML 2023

  46. arXiv:2302.08474  [pdf, other

    cs.CV cs.AI

    Efficient 3D Object Reconstruction using Visual Transformers

    Authors: Rohan Agarwal, Wei Zhou, Xiaofeng Wu, Yuhan Li

    Abstract: Reconstructing a 3D object from a 2D image is a well-researched vision problem, with many kinds of deep learning techniques having been tried. Most commonly, 3D convolutional approaches are used, though previous work has shown state-of-the-art methods using 2D convolutions that are also significantly more efficient to train. With the recent rise of transformers for vision tasks, often outperformin… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  47. arXiv:2302.00141  [pdf, other

    cs.LG cs.AI stat.ML

    Revisiting Bellman Errors for Offline Model Selection

    Authors: Joshua P. Zitovsky, Daniel de Marchi, Rishabh Agarwal, Michael R. Kosorok

    Abstract: Offline model selection (OMS), that is, choosing the best policy from a set of many policies given only logged data, is crucial for applying offline RL in real-world settings. One idea that has been extensively explored is to select policies based on the mean squared Bellman error (MSBE) of the associated Q-functions. However, previous work has struggled to obtain adequate OMS performance with Bel… ▽ More

    Submitted 6 June, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Published in ICML 2023

    ACM Class: I.2.8; I.6.4

    Journal ref: In ICML (pp. 43369-43406). PMLR (2023)

  48. arXiv:2212.06409  [pdf, ps, other

    math.FA

    A New Shrinking projection Algorithm for an infinite family of Bregman weak relatively nonexpansive mappings in a Banach Space

    Authors: Bijan Orouji, Ebrahim Soori, Donal O'Regan, Ravi P. Agarwal

    Abstract: In this paper, using a new shrinking projection method and generalized resolvents of maximal monotone operators and generalized projections, we consider the strong convergence for finding a common point of the fixed points of a Bregman quasi-nonexpansive mapping, and common fixed points of a infinite family of Bregman weak relatively nonexpansive mappings, and common zero points of a finite family… ▽ More

    Submitted 20 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 28 pages. arXiv admin note: substantial text overlap with arXiv:2107.13254

    MSC Class: 47H10

  49. arXiv:2212.04025  [pdf, other

    cs.LG cs.AI stat.ML

    A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

    Authors: Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare

    Abstract: Many machine learning problems encode their data as a matrix with a possibly very large number of rows and columns. In several applications like neuroscience, image compression or deep reinforcement learning, the principal subspace of such a matrix provides a useful, low-dimensional representation of individual data. Here, we are interested in determining the $d$-dimensional principal subspace of… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 8 pages in main content, 2 pages of bibliography and 5 pages in Appendix

  50. arXiv:2211.15144  [pdf, other

    cs.LG

    Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes

    Authors: Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine

    Abstract: The potential of offline reinforcement learning (RL) is that high-capacity models trained on large, heterogeneous datasets can lead to agents that generalize broadly, analogously to similar advances in vision and NLP. However, recent works argue that offline RL methods encounter unique challenges to scaling up model capacity. Drawing on the learnings from these works, we re-examine previous design… ▽ More

    Submitted 17 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted at ICLR 2023. Project website: https://sites.google.com/view/scaling-offlinerl/home