Go back to main download site
To download a corpus select a corpus size - given in number of sentences - and download the corresponding data file.
News
Year Country Downloads
2007-2009 10K 30K 100K 300K 1M
2020 10K 30K 100K 300K 1M
Newscrawl
Year Country Downloads
2011 trad 10K 30K 100K 300K 1M
Web
Year Country Downloads
2014 simp 10K 30K 100K 300K 1M
2015 China, People's Republic of 10K 30K 100K 300K 1M
2015 Macau 10K 30K 100K 300K 1M
2016 Macau 10K 30K 100K 300K 1M
Wikipedia
Year Country Downloads
2014 10K 30K 100K 300K 1M
2018 10K 30K 100K 300K 1M
Go back to main download site