(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–18 of 18 results for author: Hruschka, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00584  [pdf, other

    cs.DB cs.AI

    A Blueprint Architecture of Compound AI Systems for Enterprise

    Authors: Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases. Towards this goal, there is a notable shift to building compound AI systems, wherein LLMs are integrated into an expansive software infrastructure with many components like models, retrievers, databases and tools. In this paper, we intr… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Compound AI Systems Workshop at the Data+AI Summit 2024

  2. arXiv:2404.00211  [pdf, other

    cs.CL cs.LG

    Multi-Conditional Ranking with Large Language Models

    Authors: Pouya Pezeshkpour, Estevam Hruschka

    Abstract: Utilizing large language models (LLMs) to rank a set of items has become a common approach in recommendation and retrieval systems. Typically, these systems focus on ordering a substantial number of documents in a monotonic order based on a given query. However, real-world scenarios often present a different challenge: ranking a comparatively smaller set of items, but according to a variety of div… ▽ More

    Submitted 9 August, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  3. arXiv:2402.03291  [pdf, other

    cs.HC cs.DB

    Knowledge Acquisition and Integration with Expert-in-the-loop

    Authors: Sajjadur Rahman, Frederick Choi, Hannah Kim, Dan Zhang, Estevam Hruschka

    Abstract: Constructing and serving knowledge graphs (KGs) is an iterative and human-centered process involving on-demand programming and analysis. In this paper, we present Kyurem, a programmable and interactive widget library that facilitates human-in-the-loop knowledge acquisition and integration to enable continuous curation a knowledge graph (KG). Kyurem provides a seamless environment within computatio… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2402.01108  [pdf, other

    cs.CL cs.LG

    Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

    Authors: Pouya Pezeshkpour, Eser Kandogan, Nikita Bhutani, Sajjadur Rahman, Tom Mitchell, Estevam Hruschka

    Abstract: Remarkable performance of large language models (LLMs) in a variety of tasks brings forth many opportunities as well as challenges of utilizing them in production settings. Towards practical adoption of LLMs, multi-agent systems hold great promise to augment, integrate, and orchestrate LLMs in the larger context of enterprise platforms that use existing proprietary data and models to tackle comple… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2311.06414  [pdf, other

    cs.LG cs.CL

    Knowledge Graphs are not Created Equal: Exploring the Properties and Structure of Real KGs

    Authors: Nedelina Teneva, Estevam Hruschka

    Abstract: Despite the recent popularity of knowledge graph (KG) related tasks and benchmarks such as KG embeddings, link prediction, entity alignment and evaluation of the reasoning abilities of pretrained language models as KGs, the structure and properties of real KGs are not well studied. In this paper, we perform a large scale comparative study of 29 real KG datasets from diverse domains such as the nat… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)

  6. arXiv:2311.06383  [pdf, other

    cs.CL cs.LG

    Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks

    Authors: Pouya Pezeshkpour, Hayate Iso, Thom Lake, Nikita Bhutani, Estevam Hruschka

    Abstract: Numerous HR applications are centered around resumes and job descriptions. While they can benefit from advancements in NLP, particularly large language models, their real-world adoption faces challenges due to absence of comprehensive benchmarks for various HR tasks, and lack of smaller models with competitive capabilities. In this paper, we aim to bridge this gap by introducing the Resume-Job Des… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  7. arXiv:2311.05085  [pdf, other

    cs.CL cs.AI

    Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks

    Authors: Aditi Mishra, Sajjadur Rahman, Hannah Kim, Kushan Mitra, Estevam Hruschka

    Abstract: Large language models (LLMs) are proficient at generating fluent text with minimal task-specific supervision. Yet, their ability to provide well-grounded rationalizations for knowledge-intensive tasks remains under-explored. Such tasks, like commonsense multiple-choice questions, require rationales based on world knowledge to support predictions and refute alternate options. We consider the task o… ▽ More

    Submitted 31 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  8. arXiv:2309.07382  [pdf, other

    cs.CL

    Less is More for Long Document Summary Evaluation by LLMs

    Authors: Yunshu Wu, Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have shown promising performance in summary evaluation tasks, yet they face challenges such as high computational costs and the Lost-in-the-Middle problem where important information in the middle of long documents is often overlooked. To address these issues, this paper introduces a novel approach, Extract-then-Evaluate, which involves extracting key sentences from a… ▽ More

    Submitted 18 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: EACL (main)

  9. arXiv:2308.13676  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Language Models as Symbolic Knowledge Graphs

    Authors: Vishwas Mruthyunjaya, Pouya Pezeshkpour, Estevam Hruschka, Nikita Bhutani

    Abstract: Symbolic knowledge graphs (KGs) play a pivotal role in knowledge-centric applications such as search, question answering and recommendation. As contemporary language models (LMs) trained on extensive textual data have gained prominence, researchers have extensively explored whether the parametric knowledge within these models can match up to that present in knowledge graphs. Various methodologies… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  10. arXiv:2308.11483  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions

    Authors: Pouya Pezeshkpour, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in various NLP tasks. However, previous works have shown these models are sensitive towards prompt wording, and few-shot demonstrations and their order, posing challenges to fair assessment of these models. As these models become more powerful, it becomes imperative to understand and address these limitations. In this paper, we… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  11. arXiv:2301.03656  [pdf, other

    cs.DB cs.AI cs.HC

    Towards Multifaceted Human-Centered AI

    Authors: Sajjadur Rahman, Hannah Kim, Dan Zhang, Estevam Hruschka, Eser Kandogan

    Abstract: Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks. In this paper, we call for a holistic view when designing support mechanisms, such as interaction paradigms, interfaces, and systems, for these multifaceted workflows.

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Workshop on Human-Centered AI at NeurIPS 2022

  12. arXiv:2301.03095  [pdf, other

    cs.HC cs.CL

    MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

    Authors: Dan Zhang, Hannah Kim, Rafael Li Chen, Eser Kandogan, Estevam Hruschka

    Abstract: We present MEGAnno, a novel exploratory annotation framework designed for NLP researchers and practitioners. Unlike existing labeling tools that focus on data labeling only, our framework aims to support a broader, iterative ML workflow including data exploration and model development. With MEGAnno's API, users can programmatically explore the data through sophisticated search and automated sugges… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: Data Science with Human-in-the-loop (DaSH) @ EMNLP 2022. Demo: https://meganno.github.io

  13. arXiv:2212.10708  [pdf, other

    cs.CL

    Zero-shot Triplet Extraction by Template Infilling

    Authors: Bosung Kim, Hayate Iso, Nikita Bhutani, Estevam Hruschka, Ndapa Nakashole, Tom Mitchell

    Abstract: The task of triplet extraction aims to extract pairs of entities and their corresponding relations from unstructured text. Most existing methods train an extraction model on training data involving specific target relations, and are incapable of extracting new relations that were not observed at training time. Generalizing the model to unseen relations typically requires fine-tuning on synthetic t… ▽ More

    Submitted 20 September, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: IJCNLP-AACL 2023 (main)

  14. arXiv:2106.12767  [pdf, other

    cs.CL cs.DB cs.HC cs.LG

    TagRuler: Interactive Tool for Span-Level Data Programming by Demonstration

    Authors: Dongjin Choi, Sara Evensen, Çağatay Demiralp, Estevam Hruschka

    Abstract: Despite rapid developments in the field of machine learning research, collecting high-quality labels for supervised learning remains a bottleneck for many applications. This difficulty is exacerbated by the fact that state-of-the-art models for NLP tasks are becoming deeper and more complex, often increasing the amount of training data required even for fine-tuning. Weak supervision methods, inclu… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: WWW'21 Demo

  15. arXiv:1910.00752  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Ward2ICU: A Vital Signs Dataset of Inpatients from the General Ward

    Authors: Daniel Severo, Flávio Amaro, Estevam R. Hruschka Jr, André Soares de Moura Costa

    Abstract: We present a proxy dataset of vital signs with class labels indicating patient transitions from the ward to intensive care units called Ward2ICU. Patient privacy is protected using a Wasserstein Generative Adversarial Network to implicitly learn an approximation of the data distribution, allowing us to sample synthetic data. The quality of data generation is assessed directly on the binary classif… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  16. arXiv:1211.2304  [pdf, other

    cs.LG stat.ML

    Probabilistic Combination of Classifier and Cluster Ensembles for Non-transductive Learning

    Authors: Ayan Acharya, Eduardo R. Hruschka, Joydeep Ghosh, Badrul Sarwar, Jean-David Ruvini

    Abstract: Unsupervised models can provide supplementary soft constraints to help classify new target data under the assumption that similar objects in the target set are more likely to share the same class label. Such models can also help detect possible differences between training and target distributions, which is useful in applications where concept drift may take place. This paper describes a Bayesian… ▽ More

    Submitted 10 November, 2012; originally announced November 2012.

  17. arXiv:1206.0994  [pdf, ps, other

    cs.LG

    An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

    Authors: Ayan Acharya, Eduardo R. Hruschka, Joydeep Ghosh, Sreangsu Acharyya

    Abstract: Unsupervised models can provide supplementary soft constraints to help classify new, "target" data since similar instances in the target set are more likely to share the same class label. Such models can also help detect possible differences between training and target distributions, which is useful in applications where concept drift may take place, as in transfer learning settings. This paper de… ▽ More

    Submitted 19 April, 2012; originally announced June 2012.

    ACM Class: I.5.2; I.5.3; I.5.4

  18. arXiv:1204.4521  [pdf, ps, other

    cs.LG cs.CV stat.ML

    A Privacy-Aware Bayesian Approach for Combining Classifier and Cluster Ensembles

    Authors: Ayan Acharya, Eduardo R. Hruschka, Joydeep Ghosh

    Abstract: This paper introduces a privacy-aware Bayesian approach that combines ensembles of classifiers and clusterers to perform semi-supervised and transductive learning. We consider scenarios where instances and their classification/clustering results are distributed across different data sites and have sharing restrictions. As a special case, the privacy aware computation of the model when instances of… ▽ More

    Submitted 19 April, 2012; originally announced April 2012.

    ACM Class: I.5.4