The University of Pennsylvania Linguistics Department is home of a long-running project to create syntactically annotated (parsed) corpora of historical English. The project is directed by Anthony Kroch, Professor of Linguistics, and the research associate in charge of corpus annotation is Dr. Beatrice Santorini. The Middle English corpus was constructed by Dr. Ann Taylor, now research associate in charge of corpus annotation at the University of York, England.
The following corpora are available from the Penn project:
The corpora are available on CD-ROM, together. The price of the CD, which contains both currently released corpora, is US $300. Further information is available on our order page.
The Penn corpora are distributed with the search program CorpusSearch 2, written by Beth Randall and released under the Mozilla 1.1 Public License as open source software. In addition to being included on the Penn corpus CD-ROMs, CorpusSearch is freely downloadable from sourceforge.net.