(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–6 of 6 results for author: Rassin, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16048  [pdf, other

    cs.IR

    Evaluating D-MERIT of Partial-annotation on Information Retrieval

    Authors: Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg

    Abstract: Retrieval models are often evaluated on partially-annotated datasets. Each query is mapped to a few relevant texts and the remaining corpus is assumed to be irrelevant. As a result, models that successfully retrieve false negatives are punished in evaluation. Unfortunately, completely annotating all texts for every query is not resource efficient. In this work, we show that using partially-annotat… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Our dataset can be downloaded from https://D-MERIT.github.io

  2. arXiv:2406.10210  [pdf, other

    cs.CV cs.AI cs.GR

    Make It Count: Text-to-Image Generation with an Accurate Number of Objects

    Authors: Lital Binyamin, Yoad Tewel, Hilit Segev, Eran Hirsch, Royi Rassin, Gal Chechik

    Abstract: Despite the unprecedented success of text-to-image diffusion models, controlling the number of depicted objects using text is surprisingly hard. This is important for various applications from technical documents, to children's books to illustrating cooking recipes. Generating object-correct counts is fundamentally challenging because the generative model needs to keep a sense of separate identity… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project page is at https://make-it-count-paper.github.io/

  3. arXiv:2311.17946  [pdf, other

    cs.CV cs.AI cs.CL

    DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

    Authors: Jiao Sun, Deqing Fu, Yushi Hu, Su Wang, Royi Rassin, Da-Cheng Juan, Dana Alon, Charles Herrmann, Sjoerd van Steenkiste, Ranjay Krishna, Cyrus Rashtchian

    Abstract: Despite their wide-spread success, Text-to-Image models (T2I) still struggle to produce images that are both aesthetically pleasing and faithful to the user's input text. We introduce DreamSync, a model-agnostic training algorithm by design that improves T2I models to be faithful to the text input. DreamSync builds off a recent insight from TIFA's evaluation framework -- that large vision-language… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  4. arXiv:2306.08877  [pdf, other

    cs.CL cs.CV

    Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment

    Authors: Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg, Gal Chechik

    Abstract: Text-conditioned image generation models often generate incorrect associations between entities and their visual attributes. This reflects an impaired mapping between linguistic binding of entities and modifiers in the prompt and visual binding of the corresponding elements in the generated image. As one notable example, a query like "a pink sunflower and a yellow flamingo" may incorrectly produce… ▽ More

    Submitted 23 January, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 (oral). Our code is publicly available at https://github.com/RoyiRa/Syntax-Guided-Generation

  5. arXiv:2305.16740  [pdf, other

    cs.CL

    Conjunct Resolution in the Face of Verbal Omissions

    Authors: Royi Rassin, Yoav Goldberg, Reut Tsarfaty

    Abstract: Verbal omissions are complex syntactic phenomena in VP coordination structures. They occur when verbs and (some of) their arguments are omitted from subsequent clauses after being explicitly stated in an initial clause. Recovering these omitted elements is necessary for accurate interpretation of the sentence, and while humans easily and intuitively fill in the missing information, state-of-the-ar… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  6. arXiv:2210.10606  [pdf, other

    cs.CL cs.LG

    DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models

    Authors: Royi Rassin, Shauli Ravfogel, Yoav Goldberg

    Abstract: We study the way DALLE-2 maps symbols (words) in the prompt to their references (entities or properties of entities in the generated image). We show that in stark contrast to the way human process language, DALLE-2 does not follow the constraint that each word has a single role in the interpretation, and sometimes re-use the same symbol for different purposes. We collect a set of stimuli that refl… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 5 pages, BlackboxNLP @ EMNLP 2022