(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–1 of 1 results for author: Diachkov, D

.
  1. arXiv:2311.05778  [pdf, other

    cs.CV cs.AI

    DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency

    Authors: Azhar Shaikh, Michael Cochez, Denis Diachkov, Michiel de Rijcke, Sahar Yousefi

    Abstract: This paper introduces DONUT-hole, a sparse OCR-free visual document understanding (VDU) model that addresses the limitations of its predecessor model, dubbed DONUT. The DONUT model, leveraging a transformer architecture, overcoming the challenges of separate optical character recognition (OCR) and visual semantic understanding (VSU) components. However, its deployment in production environments an… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.