fsa_morph

Langue: en

Version: 111320 (mandriva - 01/05/08)

Section: 1 (Commandes utilisateur)

NAME

fsa_morph - perform morphological analysis of a word

SYNOPSIS

fsa_morph [ options ] [ <infile ] [ >outfile ]

DESCRIPTION

fsa_morph reads lines from the input. Each line contains one word. For each (inflected) word, its morphological analysis is printed. It consists of one or more pairs (lexeme, tag).

OPTIONS

-d dictionary
use that dictionary. Several dictionaries may be given. At least one dictionary must be specified. Dictionaries are automata built using fsa_ubuild or fsa_build. The data for them must be prepared in a special way. Each line of data consists of the inflected form, followed by a separator, followed by a code, followed by the ending of the lexeme, followed by the separator, followed by tags. The code specifies how many characters from the end of the inflected form are different from last characters of the lexeme (i.e. how many characters constitute the inflected word ending). 'A' means that no characters are to be rejected, 'B' - 1, 'C' - 2, and so on. Though the data is different from the normal word list format, the automaton is the same (its content is different), so the magic number is the same as with the normal data.
-i input_file
specifies an input file - file that contains words that should be analysed. More than one file can be specified (i.e. the option can be used more than once). In absence of -i option, standard input is used.
-l language_file
specifies a file that hold language specific information, i.e. (for now) characters that form words, and pairs of (lowercase, uppercase) characters for case conversion. If the option is not specified, latin letters with standard case conversions wil be used.

Note: whether this information will be used or not depends on the module used for I/O handling (one_word_io or text_io).

-P
indicates that the dictionary contains coded prefixes (see fsa_morph(5)).
-I
indicates that the dictionary contains coded infixes (see fsa_morph(5)).
-v
print version details, including compile options used to build the program.

EXIT STATUS

0
OK
1
Invalid options, or lack of a required option.
2
Dictionary file could not be opened.
4
Not enough memory.

SEE ALSO

fsa_morph(5), fsa_accent(1), fsa_build(1), fsa_guess(1), fsa_hash(1), fsa_morph(5), fsa_prefix(1), fsa_spell(1), fsa_ubuild(1), fsa_visual(1).

BUGS

Send bug reports to the author: Jan Daciuk, jandac@pg.gda.pl.