Portuguese Corpus Page

CG-tagged corpora: CETEMP�blico (1.000.000 words, no password required)
speech data (50.000 words, no password) historical texts (50.000 words) modern texts (100.000 words)

Search manual (separate window).
Portuguese CG tag set

When searching tagged text, use double quotes for word forms, single quotes for base forms. Tags are separated by blank space, words by underscore. Use '_._' for dummy words, '_.?_' for one optional dummy word, '_.*_' for one or more optional dummy words, and '_.+_' for one or more obligatory dummy words. Sentence start is '- - -' in untagged corpora, ">>>" (in word form quotes) for tagged corpora.

Enter search string:
Enter password:
Corpus sources and copyright


The search system was designed by Eckhard Bick for VISL. More information on the project as well as live grammatical analysis and a number of grammar teaching tools are available at the VISL main site.