a project of the Natural Language Processing Group at the Institute of Computer Science at Leipzig University.
The international corpora portal offers access to more than 400 corpora of the Leipzig Corpora Collection (LCC) in more than 250 languages.
To the corpora portalOn this website you can contribute to corpus collection for under-resourced languages by simply entering a URL.
To the CURL portal
The words of the day based on a selection of newspaper and news services. Daily at 7 am and available as RSS!
RSS 2.0
The Wortschatz's CLARIN corpora portal offers access to all corpora of the Leipzig Corpora Collection (LCC) that we already integrated into the CLARIN infrastructure.
To the LCC's CLARIN corpora portalThe ASV Toolbox is a modular collection of tools for the exploration of written language data.
To the online toolboxThe corpus and language statistics contain analyses about various aspects of natural language based on our corpora.
To the corpus statisticsOur REST web services allow direct access to our corpora by using any software. Currently, these services are still in the beta phase.
To the RESTful webservicesData is automatically collected from carefully selected public sources. The example sentences are automatically selected and are not expression of this project. The authors are solely responsible for the content and opinions contained therein.