EmDComF_raw

Publication type
Publication status
Published
Author
Debaene, F
Publisher
Hugging Face
View in Biblio
(externe link)

Abstract

Raw text extraction of 466 early modern Dutch comedies and farces after nltk sentence tokenization, with author indications. Gold data comes from DBNL and CENETON, OCR data is the raw output of Google Books scans by Transkribus Print M1.