; ; This is the initialization file for the main programme LEXA. It is a ; normal ASCII text file which you can edit with any text editor (such ; as Lexa Text). ; ; Note that the programme only reads information from those ; lines which do NOT begin with a semi-colon. ; ; To indicate a positive value for a parameter you may type either ; the word "yes" or the number "1" (without inverted commas). A ; negative value is symbolised by either "no" or "0". ; ; The backslash is a so-called escape character. By this is meant ; that the character immediately following it is interpreted at its ; literal value, despite the fact that it is a special character for ; the programme in question. For instance "\;" is interpreted as a ; simple semi-colon, i.e. not as a comment. ; ; If you change any parameters then be very careful to keep ; to the pre-determined format as outlined in the comments below. ; ; LEXA only reads the setup file on starting. To operate the programme ; with altered parameters you must exit, edit this initialisation file ; and start LEXA again. ; ; ;---------------------------------------------------------------------------; ; (I) Information referring to the INPUT text for an analysis ; ; 1: Name of directory to be used for system input files ; (Leave empty if you wish to use the Lexa home directory) ; ; ; 2: Name of directory to be used for text input files ; (Leave empty if you wish to use the current DOS directory) ; ; ; 3: Name of default input file ; ; Note: You may enter a so-called file template here; this is a partial ; file specification e.g. *.txt, my_*.doc, etc. However, if more than ; a single file is found which matches the template then a second one is ; only processed if LEXA is loaded in the batch mode, otherwise the first ; file found is examined and all other potential input ignored. ; ; Pay attention to the fact that if LEXA is called in the batch mode then ; all file names (i.e. those for lemmatised texts, frequency lists, data- ; bases, etc.) are derived from the following entry by appending the appro- ; priate file extension (see section VI of the present file). ; BEOWULF.INP ; ; 4: Character which marks a comment line ; \; ; ; 5: Character which marks a comment to the end of the current line ; | ; ; 6: The line of the input text at which analysis is to begin ; 0 ; ; 7: The line of the input text at which analysis is to end ; ; Note: Here you either enter an actual value or 0 if you wish LEXA ; to process any input file until the end of the file is reached. ; Legal values range from 1 to 1000000. ; 0 ; ; 8: Name of the stop words file to be used (leave this line ; empty if you do not want any to be used) ; STOP.LST ; ; 9: Type of string used when creating a concordance and when searching ; current text in memory; the value can be changed from the desktop ; with the F6 key ; ; Options: 1 = Only examine entire words ; 0 = Examine parts of words ; 1 ; ; 10: Specify whether case differences in source file(s) are to be ignored ; yes ; ; ; 11: What characters are to count as punctuation? ; \;:'",.!? ; ; ; 12: Remove punctuation from input text file? ; no ; ; 13: What character is to function as a delimiter between a word and ; the tag attached to it (default = underscore) ; _ ; ; 14: What character is to function as a word delimiter when lemmatising ; (either at the beginning or the end of a word). This is the character ; used in the lemma definition file ; # ; ; 15: Ignore COCOA-style comments? ; yes ; ; 16: Ignore Helsinki Corpus comments? ; yes ; ; 17: Save text with line numbering ; ;yes no ; ; ;---------------------------------------------------------------------------; ; (II) Information referring to the LEMMATISATION of a text file ; ; 18: Name of lemma definition file ; LX_BEOW.DEF ; ; 19: Name of lemmatised output file ; BEOWULF.LEM ; ; 20: Name of manual tag file ; LOB.TAG ; ; 21: Name of Alt macro file ; LEXA.ALT ; ; ;---------------------------------------------------------------------------; ; (III) Information concerning the CONVERSION of a text to a DATABASE ; ; 22: Name of delimited text file (to be read into database system) ; N.B. The extension for this file must be .TXT! ; BEOW_DB.TXT ; ; 23: Remove punctuation when generating delimited file? ; yes ; ; 24: Deposit reverse order spelling of each token in text file/database? ; yes ; ; ; 25: Ignore case differences when generating lexical database? ; yes ; ; 26: What character is to function as a field delimiter? ; | ; ;---------------------------------------------------------------------------; ; (IV) Information concerning the generation of FREQUENCY LISTS ; ; 27: Name of token density file ; BEOW_TOK.DOC ; ; 28: Name of type density file ; BEOW_TYP.DOC ; ; ;---------------------------------------------------------------------------; ; (V) Information concerning the generation of a CONCORDANCE ; ; 29: Name of concordance output file ; BEOWULF.CON ; ; 30: Line column at which to insert keyword in context (KWIC) ; 30 ; ; 31: Only use a selection of forms for concordance? ; no ;yes ; ; 32: Name of file with list of forms for concordance ; EXCERPT.WDS ; ; ; 33: Generate a KWIC concordance as a delimited text? ; ;no yes ; ; 34: Character to be used as a delimiter ; | ; ; 35: Name of concordance delimited file ; MY_CONC.TXT ; ;---------------------------------------------------------------------------; ; (VI) Information necessary for operating LEXA in the BATCH mode ; ; 36: Generate a lemmatised output file? ; yes ; ; 37: Extension to be used for lemmatised output file (up to 3 characters) ; LEM ; ; 38: Extension to be used for lemma definition file (up to 3 characters) ; DEF ; ; 39: Use the same definition file during a batch (otherwise LEXA derives ; a new name from the input file template (above) by adding the definition ; file extension) ; yes ; ; 40: Tag each file using several lemma definition files (i.e. make several ; runs on each input text file)? ; ; Note: For this option to function correctly you must also specify a ; file which in its turn consists of a list of the lemma definition ; files. Lexa then attempts tagging each file with the information ; in each of these files in the order in which they occur in the ; list file ; yes ;no ; ; 41: Text file with list of lemma definition files ; ;LEM_DEF.LST ; ; ; 42: Generate a type profile file? ; yes ; ; 43: Extension to be used for type profile file (up to 3 characters) ; TYP ; ; 44: Generate a token profile file? ; yes ; ; 45: Extension to be used for token profile file (up to 3 characters) ; TOK ; ; 46: Generate a KWIC concordance file? ; yes ; ; 47: Extension to be used for KWIC file (up to 3 characters) ; KWI ; ; 48: Generate a KWOC concordance file? ; yes ; ; 49: Extension to be used for KWOC file (up to 3 characters) ; KWO ; ; 50: Generate a delimited text file? ; ; N.B. This file MUST have the extension TXT. The name is derived from the ; input file template (Item 1 in an initialization file) ; yes ; ; 51: Generate a database file? ; ; N.B. This file MUST have the extension DBF. The name is derived from the ; input file template (Item 1 in an initialization file) ; yes ; ; 52: Automatically overwrite files when in batch mode? ; ; N.B. Extreme caution is required with this parameter as it may well lead ; to overwriting valuable files when Lexa is run automatically. ; yes ; ; ;---------------------------------------------------------------------------; ; (VII) Information used when handling very large files ; ; 53: Process a large text in slices? ; yes ;no ; ; 54: Append each slice to the previous one when storing to disk? ; ; Note: If you do not choose the append mode here then each slice is stored ; to disk with the name of the input file and a numerical extension ; beginning at .001 and continues until all slices have been processed ; or the figure of 256 has been reached. ; yes ; ; 55: Number of lines per slice ; ; Note: Any number can be entered here as Lexa does not know in advance how ; long each line is and consequently how large an entire file is. ; Basically you should keep to files of about 100K maximum (per slice) ; when lemmatising. ; 500 ; ; ;---------------------------------------------------------------------------; ; (VIII) User's text editor and database manager ; ; 56: Name of user's text editor ; LTEXT ; ; 57: Name of user's database manager ; DBTXT ; ;----------------------------------------------------------------------------; ; 58: Call batch file on loading? (leave empty for none) ; GER_KEYB.BAT ; ; ;----------------------------------------------------------------------------; ; 59: Name of sorting and search file (.CEF file) ; HELSINKI.CEF ; ; ; End of LEXA setup file ; ;