|
|
../ ARCHER -
American Representative Corpus of Historical English Registers |
| Developed by: |
 |
Original coordinator: Prof. Douglas Biber at the University of Northern Arizona in Flagstaff; currently cooperating with Prof. Mair and Kortmann from University of Freiburg |
| Size: |
|
1.7 millon words |
| Contents: |
|
various textual genres in British and American English; ~ 1650s - ? |
| Access: |
|
|
| Notes: |
|
|
| |
|
 |
|
|
|
|
../ Corpus of Early English Correspondence (CEEC) |
| Developed by: |
 |
Sociolinguistics and Language History project team at the Department of English, University of Helsinki |
| Size: |
|
2.7 million words |
| Contents: |
|
Personal letters written in England between c.1410 and 1680 |
| Access: |
|
Restricted to research; refer to CEECS for general access |
| Notes: |
|
Since 2000 work has been under way to extend the corpus to the eighteenth century. Tagging of the corpus was finished in 2004 and parsing is continuing into 2005. |
| |
|
 |
|
|
| |
|
../ CME - Corpus of Middle English Prose and Verse |
| Developed by: |
 |
HTI - University of Michigan, U.S. |
| Size: |
|
54 texts |
| Contents: |
|
Collection of Middle English texts provided by the University of Michigan and the Oxford Text Archive |
| Access: |
|
Free; Search possible in individual or groups of books; Conduct simple/boolean/proximity searches |
| Notes: |
|
SGML Markup according to the TEI guidelines |
| |
|
 |
|
|
|
|
../ Corpus del Español |
| Developed by: |
 |
Mark Davies, Brigham Young University |
| Size: |
|
100 million words |
| Contents: |
|
heterogeneous; corpus includes fictional, non-fictional and oral texts; 1200s to 1900s |
| Access: |
|
An elaborate online search is available that allows complex search patterns in individual parts of the corpus or in the entire corpus |
| Notes: |
|
This service is freely available. |
| |
|
 |
|
|
| |
| |
|
../ A Historical Corpus of the Welsh language 1500-1850 |
| Developed by: |
 |
Department of Linguistics, University of Cambridge |
| Size: |
|
420,000 words |
| Contents: |
|
heterogeneous; 30 texts, sample size ~15,000 words; 1500-1850 |
| Access: |
|
The texts are available for download; An online search function is also provided |
| Notes: |
|
XML encoded |
| |
|
 |
|
|
| ../ MEMEM - Michigan Early Modern English Materials |
| Developed by: |
 |
Richard W. Bailey, Jay L. Robinson, James W. Downer, with Patricia V. Lehman. |
| Size: |
|
50,000 records |
| Contents: |
|
"The Materials consist of citations collected for the modal verbs and certain other English words for the Early Modern English Dictionary." |
| Access: |
|
Free online access through a search interface is available |
| Notes: |
|
|
| |
|
 |
|
|
|
../ Newdigate Newsletters |
| Developed by: |
 |
Philip Hines, JR. |
| Size: |
|
2100 manuscript newsletters in the Newdigate series |
| Contents: |
|
"A multi-genre corpus, with a thousand-year timespan from the earliest documents to 1710."; 13 January 1673/4 to 29 September 1715 |
| Access: |
|
The Diachronic part is available on the ICAME CD-Rom; check out the online sample. |
| Notes: |
|
|
| |
|
 |
|
|
|
../ Penn-Helsinki Parsed Corpus of Middle English - PPCME2 |
| Developed by: |
 |
University of Pennsylvania, U.S. |
| Size: |
|
1.3 million words; 55 text samples |
| Contents: |
|
prose text samples of Middle English; |
| Access: |
|
Both the corpus and the software 'Corpus Search' are available on CD-Rom |
| Notes: |
|
syntactic annotation |
| |
|
 |
|
|
| |