Search Mailing List Archives

Limit search to: Subject & Body Subject Author
Sort by: Reverse Sort
Limit to: All This Week Last Week This Month Last Month
Select Date Range     through    

[multilingual-dh] Fresh off the press: multilingual (European) text collections

Worthey, Glen Cameron gworthey at
Thu Nov 21 06:28:39 PST 2019

This new  full-text collection should open some wonderful possibilities to folks in our group!  

On 11/21/19, 2:40 AM, "Humanist" <humanist at> wrote:

                      Humanist Discussion Group, Vol. 33, No. 426.
                Department of Digital Humanities, King's College London
                       Hosted by King's Digital Lab
                    Submit to: humanist at

            Date: 2019-11-20 16:09:33+00:00
            From: Justin Tonra <justin.tonra at>
            Subject: Distant Reading Novel Collections Released in ELTeC Version 0.5.0
    One of the principal aims of COST Action Distant Reading for European Literary
    History is to build a multilingual European Literary Text Collection (ELTeC),
    ultimately containing around 2,500 full-text novels in at least 10 different
    languages. Today, we are pleased to announce the first public release of ELTeC,
    with nine language collections included!
    The ultimate aim is for ELTeC is to provide multiple collections of 100 novels
    published between 1840 and 1920 in their original language. Work to add more
    novels to ELTeC is ongoing, as we aim to build a corpus which will aid us in our
    task to develop the resources necessary to change the way European literary
    history is written. Progress towards our goal and current statistics about the
    collections can be found on this ELTeC Summary page:
    Language collections in this first release are in German, English, French,
    Italian, Norwegian (Bokmål and Nynorsk), Portuguese, Romanian, Serbian, and
    Slovenian. The collections can be downloaded here:
    Each novel is encoded in compliance with the guidelines of the Text Encoding
    Initiative (TEI), and we have published details about the criteria for corpus
    composition and encoding guidelines here:, and
    about our TEI schemas here:
    As work progresses on ELTeC, we invite you to use our collections. Feedback from
    the community would be very welcome as we improve our collections and work
    towards future releases. Please feel free to write to
    me<mailto:justin.tonra at> or Action Chair, Christof
    Schöch<mailto:schoech at> with your comments. If you are interested in
    becoming a member of our Action, find out more here: https://www.distant-
    Thanks to all of our colleagues from across Europe (and beyond) who have helped
    us to reach this important milestone!
    Learn more about our Action:
    Learn more about ELTeC:
    ELTeC Summary page:
    ELTeC umbrella repository referencing all releases included in today's release:
    Zenodo community containing archive copies:
    Dr Justin Tonra
    Lecturer in English
    School of English & Creative Arts
    National University of Ireland Galway

    Unsubscribe at:
    List posts to: humanist at
    List info and archives at at:
    Listmember interface at:
    Subscribe at:

More information about the multilingual-dh mailing list