Select a Danish corpus (ca. 207 million words in all): Corrected (400.000 words K90/2000) Korpus2000 (22.1 mill. words) Korpus90 (21.6 mill. words) Korpus2010 (42.4 mill. words, with verb frames & semantic roles) Information 1996-2008 (88 mill. words) Europarl (21.2 mill. words) Wikipedia (56.2 mill. words, with verb frames & semantic roles) Twitter Complete (372 mill. words, passwd) Facebook Complete (32 mill. words, passwd) Leipzig internet corpus (1.6 mill. words) dfk-folketing (7 mill. words) dfk-skalk (600.000 words) Munk-korpus (500.000 words) dfk-barelgazel (600.000 words) dfk-loke (1 mill. words) Firma_1 (66.000 words, passwd) Firma_2 (11.105 words,passwd) Firma_3 (6.527 words, passwd) Smik SpUni (70.000 words) Stereotype Interviews (22.500 words, passwd) Youtube (11.000 words, passwd) VIMU (99.000 words) FBmin v.1 (2.4 mill. words, passwd) FBmin v.2 (21.6 mill. words, passwd) FBmin-neg (520.000 words, passwd) FBmin DR/TV2 (600.000 words, passwd) FBmin DR/TV2 v.2 (8.8 mill. words, passwd) FBmin news (16.3 mill. words, passwd) FBmin-neg DR/TV2 (122.000 words, passwd) TWmin v.1 (2.2 mill. words, passwd) TWmin v.2 (50 mill. words, passwd) TWmin-neg (247.000 words, passwd) Twitter v.3 (140 mill. words, passwd) Twitter v.4 (194 mill. words, passwd) Twitter Corona (29.5 mill. words, passwd) TWmin DR/TV2 (16.000 words, passwd) TWmin-neg DR/TV2 (2.000 words, passwd)
Case insensitive Diacritics insensitive