1. Conditions of use

All users of the Penn-Helsinki corpora must accept the following conditions of use. If you are not willing to accept these conditions, you must return the CD to the party that provided it to you (the vendor, the library, the instructor, etc.) without using the corpus or making copies of any files on this CD.

Penn-Helsinki Corpus Conditions of Use

  1. Users must accept that the PPCME2, the PPCEME and the Helsinki Corpus of Historical English are subject to copyright restrictions. They must agree to abide by them and understand that violations of copyright restrictions may result in legal liability.
  2. Users may make no commercial use of the PPCME2, the PPCEME, or the Helsinki Corpus without prior permission.
  3. Users may not redistribute the PPCME2, the PPCEME, or the Helsinki Corpus to others except in limited passages under the ordinary standards of scholarly citation.
  4. Users must agree to acknowledge the PPCME2, the PPCEME, and the Helsinki Corpus in any written work or oral presentations based on research using these materials.
  5. Users must accept that the distributor of the PPCME2 and the PPCEME makes no warranties, express or implied, concerning the PPCME2 or the PPCEME, including but not limited to their ownership, merchantability, or fitness for a particular purpose. The distributor shall not be liable for any direct, consequential, punitive, or other damages suffered by user or any other person resulting from the use of the distributed materials.

2. Orientation

The two corpora included in this CD, the PPCME2 and the PPCEME, are located in the following two directories: The files of text samples in plain text, part-of-speech tagged text, and parsed text form are located in the "txt," "pos" and "psd" subdirectories, respectively. In addition the files of the PPCEME, but not the PPCME2, are divided into three equal sized subdirectories labeled "helsinki," "penn1," and "penn2."

All documentation for the two corpora and for the search program CorpusSearch is accessible from the Penn Corpora Home Page ("index.html") in the "PENN-CORPORA" folder on the CD.

Please copy the "PENN-CORPORA" folder to your hard drive and open the "index.html" file in your web browser to start exploring the Penn-Helsinki Parsed Corpora of Historical English. The corpus files themselves and the output of any searches using CorpusSearch can be read in a text editor. We recommend an editor like emacs, vi, or pico rather than a word processing program to avoid the danger of changing the files from simple text format to the more complex format of a word processor. CorpusSearch only searches files in text format.

To use CorpusSearch, follow the installation instructions below.

3. How to Install CorpusSearch


Last modified: Wed Jul 12 18:31:10 EDT 2006