Copyright agreement is provided in first line of the dictionary file:
You may download original DOS-version of the Mueller dictionary (7 Edition) for DOS from www-page of Sergei Starostin. According to the extracurial compromise between company "ABBYY" and publishing house "Russky Yazyk", publishing house "Russky Yazyk" has copyrights on editions of Mueller dictionary published after 1961 only. Thus the content of Mueller dictionary published before 1961 is in public domain. S.Starostin, as the author of the first electronic version of Mueller dictionary, kindly allowed me to use his code one for any purpose.
The Russian Scientific-Technical Center "Informregistr" has registered the Electronic Version of Mueller Dictionary (7 Edition) on February 29, 2000. The number of State registration is 0320000030.
Short description of dictionary format is provided in the second line of the dictionary file:
This dictionary is arranged as a single, plain-text file. After multiple requests, I have inserted a delimiter in word article -- tandem of two blank symbols. This delimiter subdivides a word article into two parts where the first part is the word itself, and the other is the translation of the word. Besides, I employ the auxiliary symbol "_" because it was not used in the original text of the dictionary and it plays no specific role in Unix regular expressions.
There are some comment about functional labels. A English label word tags the translated word as grammar "part of speech", a Russian one --- as used in the different sphere of knowledge. All label words are ended with point or colon. List of label words and abbreviations are added to the dictionary file beginning after second line. Latin numerals signifying different meanings of the translated word are began from symbol "_" also. Word article may be subdivided on the sense section with "number+point" or/and "Russian letter+>" (I change ")" on ">" for more accurate work of formating).
In the primary S.Starostin's version of dictionary, accents in Russian words are encoded by special way. In my version all stressed letters in Russian words are turned on capital letters. There is version of my dictionary without accents in Russian words also.
Transcription is rounded by square brackets. Transcription symbols
correspond to the IPA (International Phonetic Alphabet) standard.
Basic English phonetic symbols are:
"a" from "man" --- Q, 81
"w" --- W
"a" from "past" --- A, 65
":" from a: in "past" --- 249, 0xF9
"e" from "her" --- 171, 0xAB
"e" first from diphthong in "care" --- E, 69
"o" from "wash" --- 141, 0x8D
"a" from "son" --- 195, 0xC3
"i" short "i", from "ink" --- I
"i" long "i", from "machine" --- i
"'" stress by the voice --- 200, 0xC8
"," fall of the voice --- 199, 0xC7
"k" --- H
"z" --- Z, 90
"ng" --- N, 78
"sh" --- S, 83
"th" with voice --- D, 68
"th" without voice --- T, 84
The majority of small English letters don't change place in
coding table. The main trouble of IPA coding table is impossible
to make common font for Russian and English and phonetic symbols
(may be UNICODE only).
In other hand, unfortunately, IPA has some other symbols on
place "-", "(", ")" and it's necessary to exclude ones from transcription
(although, they are used in mix with transcription symbols in the
ordinary paper dictionaries).
You may download the dictionary (version 1.2) altogether with hash file: in tar.bzip2 file, and tar-bzip2ed file with accent in Russian words. The size of each packet is 1.9 MB.
recode_dic.c.gz --- C program for recoding dictionary between various russian coding tables. Transcription in square brackets keeps changeless.
Please, send notes and suggestions to Eugene Cymbalyuk by e-mail: [email protected].