NAME
udicm - user dictionary maintainance tool for Kana-Kanji
conversion
SYNOPSIS
udicm [ command ] [ arguments ]
AVAILABILITY
SUNWjc0u
DESCRIPTION
udicm maintenances an user dictionary for cs00 Kana-Kanji
conversion. It has also some functions for a main dictionary
which is used for maintenance of an user dictionary.
Started with command, udicm runs in non-interactive mode.
Started without command, udicm shows the prompt udicm> and
runs in interactive mode. With `quit' command, you can ter-
minate udicm. See "COMMANDS and ARGUMENTS" section below for
more detail.
COMMANDS and ARGUMENTS
Each function of udicm will be performed by specifying a
command and some arguments. udicm supports the following
commands;
mshow, show, add, del, ladd, ldel, cat, create,
extract, quit, hinshi, ?, help
How to specify each command is listed below. In the follow-
ing, main_dict means a main dictionary, and user_dict, an
user dictionary. If you run a command without specifying any
main dictionary, the specified user dictionary is assumed to
match with the default main dictionary (
/usr/lib/mle/ja/cs00/cs00_m.dic ). See "File Formats" sec-
tion below about synopsis, word, reading and part-of speech
symbols of a words-list file.
mshow main_dict [ -s start ] [ -e end ] [ -f filename ] [ -
k ]
Display contents of the main dictionary. Read the
words registered in specified area, format them in a
words-list, and put it to the specified file. Argu-
ments are as follows.
-s start
Specify the reading of a word at which start to
list. Without this argument, from the first word
it will be started.
-e end
Specify the reading of a word at which end to
list. Without this argument, at the last word,
it will be done.
-f filename
Specify the words-list file to which the results
are put. Without this argument, it will be put
into standard output.
-k Print the part of speech information in the name
of part of speech.
See udicm(1) in ja locale for an example.
show user_dict [ -s start ] [ -e end ] [ -f filename ] [ -
k ]
Display contents of the main dictionary. Read the
words registered in specified area, format them in a
list, and put it to the specified file. Arguments are
as follows.
-s start
Specify the reading of a word at which start to
list. Without this argument, from the first word
it will be started.
-e end
Specify the reading of a word at which end to
list. Without this argument, at the last word,
it will be done.
-f filename
Specify the words-list file to which the results
are put. Without this argument, it will be put
into standard output.
-k Print the part of speech information in the name
of part of speech. Without this argument, it is
printed in Part-of-Speech-Symbols.
add user_dict reading word part-of-speech-symbols
Add words specified as arguments to an user diction-
ary. Arguments are as follows.
reading
Specify the reading.
word Specify the word.
part-of-speech-symbols
Needed part-of-speech symbol are enamerated.
del user_dict reading word
Delete words specified as arguments from an user dic-
tionary. Arguments are as follows.
reading
Specify the reading.
word Specify the word.
ladd user_dict filename [ -l logfile ]
Add multiple words at one time to an user dictionary.
Arguments are specified as follows.
-l logfile
Put the results of registration to logfile. No
logfile is created without this argument.
filename
Specify a file in which words to add are listed.
ldel user_dict filename [ -l logfile ]
Delete multiple words at one time from an user dic-
tionary. Arguments are as follows.
-l logfile
Put the results of registration to logfile. No
logfile is created without this argument.
filename
Specify a file in which words to delete are
listed.
cat user_dict filename [ -l logfile ]
Merge one user dictionary to another user dictionary.
Arguments are as follows.
-l logfile
Put the result to the logfile specified as log-
file. No logfile is created without this argu-
ment.
filename
Specify the dictionary's filename which is to be
merged to the main dictionary.
create main_dict new_user_dict filename
Create an user dictionary from a words-list.
main_dict
Specify a main dictionary which is going to be a
pair to an user dictionary.
new_user_dict
Specify an user dictionary to be created.
filename
Specify a words-list file.
extract user_dict filename
Create a words-list file from the contents of an user
dictionary.
filename
Specify a words-list file.
quit Quit udicm (interactive mode).
hinshi
Show the list of part-of-speech-symbols to standard
out.
?, help
Help. Show the command reference to stdout.
File Formats
Words-list
The format of the words-list for input/output of each
command.
Comments
Lines starting with "#" are comments.
Data Consists of three fields. The first and second
fields are for reading and word, respectively.
And the last one is the part-of-speech informa-
tion, described as an enumerate of part-of-
speech-symbol. These fields are separated by
half-size Katakana (Hankaku), white spaces or
tabs. An example is shown on udicm(1) for locale
ja(japanese).
Reading
12 Hiragana characters defined in Japanese EUC
Codeset 1 can be used. However, "you-on" (such
as 'xya'), 'wi', 'we', 'wo' and 'nn' aren't per-
mitted, as the first character. For the second
or subsequent characters, "cho-on" ('-' in
Japanese EUC Codeset 1) can be used in addition
to all Hiragana characters. "daku-on" and
"handaku-on" (such as 'da' and 'pa') are treated
as two characters.
Word Eight characters defined in Japanese EUC Codeset
1 can be used.
Part-of-Speech-Symbols
The part-of-speech information consists of the
following part ot speech symbols
Symbols Part of speech Remarks
:N1 noun1 general noun
:N2 noun2 pronoun
:M1 person's name1 family-name
:M2 person's name2 first-name
:T1 place name1
:T2 place name2 Names of prefectures
:NM numeral
:NN supplemental numeral Mai(pieces),
Kai(times),
Nen(years), etc.
:PR prefix
:SF suffix
:AD adverb
:CN conjunction
:RT participial adjective
:AJ adjective
:AV adjective verb
:SH S-series irregular con-
jugation verb (Sahen-
Doushi)
:ZH Z-series irregular con-
jugation verb (Zahen-
Doushi)
:1V Single conjugation verb
:KV K-series five conjuga-
tion verb (Kagyou-
Godan-katsuyou-Doushi)
:GV G-series five conjuga-
tion verb (Gagyou-
Godan-katsuyou-Doushi)
:SV S-series five conjuga-
tion verb (Sagyou-
Godan-katsuyou-Doushi)
:TV T-series five conjuga-
tion verb (Tagyou-
Godan-katsuyou-Doushi)
:NV N-series five conjuga-
tion verb (Nagyou-
Godan-katsuyou-Doushi)
:BV B-series five conjuga-
tion verb (Bagyou-
Godan-katsuyou-Doushi)
:MV M-series five conjuga-
tion verb (Magyou-
Godan-katsuyou-Doushi)
:RV R-series five conjuga-
tion verb (Ragyou-
Godan-katsuyou-Doushi)
:WV W-series five conjuga-
tion verb (Wagyou-
Godan-katsuyou-Doushi)
:UN No Classification
:TK single kanji
:BS clause
FILES
/usr/bin/udicm
SEE ALSO
mdicm(1), cs00(1M)
|
Закладки на сайте Проследить за страницей |
Created 1996-2025 by Maxim Chirkov Добавить, Поддержать, Вебмастеру |