Welcome to the second edition of the ICAME CD-ROM
=================================================

Check cond.htm for the condition of usage and credit.htm for a list of the
compilers of the different corpora and the authors of the enclosed
programs.

Also visit http://www.hit.uib.no/icame/cd for up-to-date information on the
use of the CD-ROM. From this address it is also be possible to search some
of the ICAME corpora with your Web-browser by using the WordSmith name and
code as user id and password.

The available manuals are enclosed on the CD-ROM.

The CD-ROM contains 20 different corpora with more than 17 million words.

The disc is in the ISO 9660 format and is therefore readable on a wide
range of computer systems (DOS, Windows, Macintosh and Unix). The enclosed
software is for Windows/DOS, apart from Qwick, which may be used on
machines with a Java runtime system.

The disc have the following directories:

 Lexa       LEXA programs, written by Raymond Hickey, Essen University

 Lingfont   Linguafont programs, written by Raymod Hickey, Essen
            University

 Manuals    The available manuals for the corpora and the software

 Qwick      Qwick program developed at Birmingham University, used on FLOB
            corpus

 Tact       TACT program from University of Toronto, used on the COLT
            corpus

 Texts      The corpora in their original formats

 WC         WordCruncher retrieval program and indexed versions of most of
            the corpora

 Wsmith     Wordsmith program written by Mike Scott at Liverpool
            University

The TEXTS directory has the following subdirectories:

 ACE         Australian Corpus of English (written)

 Brown1      Brown Corpus, format 1 (written)

 Brown2      Brown Corpus, format 2 (written)

 Browntag    Brown Corpus, tagged version (written)

 CEECS       Corpus of Early English Correspondence Sampler (written)

 COLT        Corpus of London Teenage Language (spoken)

 FLOB        Freiburg-LOB Corpus of British English (written)

 Frown       Freiburg-Brown Corpus of American English (written)

 Helsinki    Helsinki Corpus of English Texts, Diachronic part (written)

 ICE_EA      International Corpus of English, East-African component
             (written/spoken)

 Innsbruc    Innsbruck Computer-Archive of Machine-Readable English Texts
             (ICAMET)

 Kolhapur    Kolhapur Corpus of Indian English (written)

 Lampeter    Lampeter Corpus of Early Modern English Tracts (written)

 LLC         London-Lund Corpus (spoken)

 LOB         Lancaster-Bergen-Oslo Corpus (written)

 LOBTAG      Lancaster-Bergen-Oslo Corpus, tagged version (written)

 Newdigat    Newdigate Newsletters (written)

 Old_Scot    Helsinki Corpus of Older Scots (written)

 POW         Polytechnic of Wales Corpus (spoken)

 SEC         Lancaster/IBM Spoken English Corpus

 WC          Wellington Corpus of Written New Zealand English

 WSC         Wellington Corpus of Spoken New Zealand English

Some of the directories have futher sub-directories.

Installation of software
========================


WordSmith
=========

The program is installed to the hard disk by running Setup.Exe in the
WSmith directory.

The program is installed in the directory C:\WSMITH by default, this name
may be changed by the user.

After the program has been installed on the hard disk, start WSHELL.EXE to
use the program. Update the demo version to the full version by choosing
Adjust Settings and Update from Demo.

When "Updating from Demo", please type in the details EXACTLY as you see
them inside the CD-ROM cover. The first (8 letters or numbers) is the
"Name", the longer one is the "Registration". You can put any other
information in "Other Details" if you wish. Please see the "readme.txt"
file for any further details.

WordSmith can be used with the texts in the TEXTS directory. It may also be
used with texts in the WC directory (files with extention .BYB).

The WordSmith manual is found in the WSMITH directory as a Word document
(manual.doc) or as an Acrobat file in the MANUALS directory (wsmanual.pdf)

WordSmith is commercial software and you are only permitted to run the
software on the number of computers you have bought licences for.

For support on using the WordSmith program with ICAME texts, contact
Knut.Hofland@hit.uib.no.


WordCruncher
============

The retrieval component of WordCruncher (for DOS) is installed by opening a
MS-DOS promt/window from the Start menu. Select the letter for the CD-ROM
(usualy D:) and write COPY_WC. The program files are then copied to C:\WCS
and the program is started in the "menu mode". To finish the program, press
SHIFT+F10. The next time you want to start the ICAME CD, click WCVLTD.EXE
in the C:\WCS directory (or make a shortcut to this file from your
desktop).

WordCruncher can also be run in "bookshelf mode". Change to this mode by
running NOMENUCD.BAT, if you want to go back to "menu mode", choose
MENUCD.BAT. Some of the options of WordCruncher are not available in the
"menu mode" (CONCORD option from Main Menu and the Frequency Distribution
from the reference list), use "bookshelf mode" to access these.

Will this installation all the corpora are used from the CD-ROM. For
improved speed when working with the texts, copy the relevant corpora (all
the files in a directory) to your hard disk or network disk. In "bookshelf
mode", press "Insert" to put these corpora on the bookshelf.

A scanned version of the <a href=manuals\wc\index.htm>"Learning WCView"</a>
manual is included on the
CD-ROM. In "menu mode" there is a short introduction to the features of the
WCView program.

You are allowed to install WordCruncher on an unlimited numbers of
computers.


TACT
====

Install TACT to your hard disk by running INSTALL.EXE from the TACT
directory. After TACT has been installed, copy the Colt textbases
(COLT*.TDB) from TACT\TACT214 to the place where TACT was installed
(usually C:\TACT214).

To work with the COLT files, start USEBASE.EXE from the directory
C:\TACT214 and press return to choose a suitable database name:

COLTORT1 orthographic version, everything is indexed

COLTORT2 orthographic version, text in <> is not indexed

COLT_TAG tagged version, tag is connected to word with undescore (word_tag)

COLTPRO1 prosodic version (only 150 of 377files), everything is indexed

Press the Spacebar to access the menu line, select a display (from the
Displays menu) and then search from the Select menu. F1 gives (context
sensitive) help and F10 exits the program.


Qwick
=====

To install Qwick with the pre-indexed FLOB corpus, you first have to
install the Java runtime for the platform you are working on. For Windows
95/98/NT run Java_win.exe in the QWICK directory on the CD-ROM. Please make
a note of where this program is installed. Then unzip the file Qwick.zip to
the directory C:\QWICK. If you do not have WinZip, use the demo copy
included and install this from WinZip70.exe.

If you are running a non-English version of Windows, you have to edit the
file QWICK.BAT in C:\QWICK (see comments in this file).

Start Qwick by running QWIC.BAT

For documentation click the file index.html in the qwick-1.0\doc directory

or visit the Qwick site at University of Birmingham


LEXA
====

To copy the Lexa programs to your hard disk (C:\LEXA), run the file
LEXA\COPYLEXA.BAT or use Windows Explorer to copy the directory LEXA with
sub-directories. For documentation see RTF-files in the LEXA\DOCUMENT
directory.


Linguafont
==========

Use Windows Explorer to copy the directory LINGFONT to your hard disk.

Use LTEXT from the LEXA programs to view the documentation in the
LINGFONT\DOCUMENT directory.


Questions to the ICAME CD-ROM and the software can be directed to

Knut Hofland
HIT Centre
University of Bergen
Allegt. 27
N-5007 Bergen
Norway

Tel. +47 5558 9463
Fax. +47 5558 9470
E-mail: Knut.Hofland@hit.uib.no
