Go back to main download site
To download a corpus select a corpus size - given in number of sentences - and download the corresponding data file.
Community
Year Country Downloads
2017 10K 30K 100K 300K 1M Alle
2021 10K 30K 100K 300K 1M Alle
Newscrawl
Year Country Downloads
2011 10K 30K 100K 300K 1M Alle
Web
Year Country Downloads
2015 Tajikistan 10K 30K 100K 300K 1M Alle
2015 Uzbekistan 10K 30K 100K 300K 1M Alle
2016 Tajikistan 10K 30K 100K 300K 1M Alle
Wikipedia
Year Country Downloads
2010 10K 30K 100K 300K 1M Alle
2014 10K 30K 100K 300K 1M Alle
2016 10K 30K 100K 300K 1M Alle
Go back to main download site