Search in more than 26 million sentences of German newspaper material

Welcome to the Leipzig Corpora Collection / Deutscher Wortschatz

a project of the Natural Language Processing Group at the Institute of Computer Science at Leipzig University.

The international corpora portal offers access to more than 230 corpora of the Leipzig Corpora Collection (LCC) in more than 200 languages.

To the Leipzig Corpora Collection

The words of the day based on a selection of newspaper and news services. Daily at 7 am and available as RSS! RSS 2.0 RSS Fedd Icon

To the words of the day

On this website you can contribute to corpus collection for under-resourced languages by simply entering a URL.

To the CURL portal

The corpus and language statistics contain analyses about various aspects of natural language based on our corpora.

To the corpus statistics

Our REST web services allow direct access to our corpora by using any software. Currently, these services are still in the beta phase.

To the RESTful webservices

Some of our tools and large parts of our data are available for download.

To the download page

Feedback from users about the different services of the LCC portal.

To the feedback page

Data is automatically collected from carefully selected public sources. The example sentences are automatically selected and are not expression of this project. The authors are solely responsible for the content and opinions contained therein.