A scraper which extracts data from the German Wiktionary HTML dump.
-
Updated
Jul 19, 2022 - Python
A scraper which extracts data from the German Wiktionary HTML dump.
Wiktionary Parser written in Ruby
🇫🇷 Source code for frenchhomophones website. [inactive]
A parser for the Kurdish Wiktionary (NPM package)
A simple TypeScript client + types for parsed Wiktionary data from Kaikki.org, parsed with wiktextract
Selected data processing scripts including language agnostic multilingual wiktionary parser
Extract hyphenation from Italian Wiktionary
Parses the Russian Wiktionary HTML dumps into JSON and generates ereader dictionaries
This repository contains a python script for parsing an xml dump of the Italian Wiktionary (Wikizionario); it also contains the parsed dictionary in a JSON file and a ONLI (italian database of neologisms) scraper with the scraped data in a CSV file
Web interface for parsing Wiktionary for results in specific languages
Simple and memory-efficient word extractor for Wiktionary
English-Deutsch (Sorted by Frequency)
Extraction of the Russian word forms and their segmentation from the Russian Wiktionary
Prototype of an interface to use Wiktionary translations
Light Wiki parser and renderer developed in Java and Lua, from wiktionary xml dump to html
A library for parsing the french wiktionary
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)
Add a description, image, and links to the wiktionary-parser topic page so that developers can more easily learn about it.
To associate your repository with the wiktionary-parser topic, visit your repo's landing page and select "manage topics."