Skip to content

DiBiLit (Digitale Bibliothek, Literatur) is a corpus of more than 2,000 literary and scientific texts from the 15th to the 20th century by renowned authors. The texts are available under the CC-BY-SA 4.0 license.

License

Notifications You must be signed in to change notification settings

deutschestextarchiv/DiBiLit-Korpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DiBiLit-Corpus

DiBiLit is a corpus being created in the BMBF-funded project CLARIAH-DE by homogenising various derivatives of texts from the "Digital Library" and extensively enriching them with (bibliographical) metadata. The more than 2,000 texts come from renowned authors, are DTABf-encoded and were made accessible within the DTA infrastructure under a Creative Commons-licence. Thus, the text collection originally published by DirectMedia Publishing can be researched using the DDC search engine integrated in the DTA as well as other DTA tools for linguistic analysis.

Content

The repository contains different directories:

  • data [contains all text assigned to genre-based subdirectories]
    • drama
    • erzaehlungen
    • essays
    • fabel
    • libretti
    • lyrik
    • prosa
    • roman
    • sagen_maerchen
    • wissenschaft
  • metadata [contains two subdirectories related to metadata]
    • bibl [contains the bibliographical metadata being the basis of the DTABf-Headers]
    • headers [contains the DTABf-headers of all texts]
  • publications [contains the documentation of the workflow]

Presentation

About

DiBiLit (Digitale Bibliothek, Literatur) is a corpus of more than 2,000 literary and scientific texts from the 15th to the 20th century by renowned authors. The texts are available under the CC-BY-SA 4.0 license.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published