Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
-
Updated
Nov 12, 2024 - Python
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Document Layout Analysis resources repos for development with PdfPig.
hand-written dictionaries from the FreeDict project
The main TEI Publisher app
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
PhiloLogic4
Lili Elbe Digital Archive practicum - learning markup via an engaged markdown community. Visit our wiki!
a repository to help introduce and orient students to the GitHub collaboration environment, and to support DH classes.
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
A highly customizable plugin for setting up and activating remote-driven autocompletions of attribute values in the oXygen XML Editor.
a repository for materials related to teaching and writing on technologies of up-conversion and project development with the XML family of languages, featuring regex, XPath, XQuery, XSLT, and Schematron.
Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 20th century
Add a description, image, and links to the tei topic page so that developers can more easily learn about it.
To associate your repository with the tei topic, visit your repo's landing page and select "manage topics."