-
-
Notifications
You must be signed in to change notification settings - Fork 398
/
license
240 lines (217 loc) · 10.3 KB
/
license
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
====================================================
Interlingua - Spell Check Dictionary and Hyphenation
====================================================
Version: 2014-05-30
===================
Lincence: GPL 3
===============
This spell check dictionary for Interlingua is licensed under GPL.
A copy should be enclosed.
Package:
=======
This package contains the latest version of Interlingua spell check dictionary.
Word list, affix and hyphenation rules compiled by Artek Dzialek.
(c) 2009-2013 Artek Dzialek(artur<point>dzialek<at>gazeta.pl)
For the OpenOffice/LibreOffice extenstion please go to:
http://extensions.services.openoffice.org/en/project/dict-ia
Affix and Composition Rules:
============================
Affix rules based on "A Grammar of Interlingua", Alexander Gode & Hugh Blair, IALA 1951,
(http://members.optus.net/~ado_hall/interlingua/gi/home/entry_page.html)
Composition rules generate ordinal numerals (3tie, 12nde, 1000me etc.)
Naming of 'big numbers' follows a nomenclature described by Conway and Wechsler.
(http://www.miakinen.net/vrac/zillions).
Wordlist:
=========
Words extracted from following dictionaries:
--- IED: The Interlingua English Dictionary, IALA 1951, (26354)
http://interlingua.wikia.com/wiki/Interlingua-English_Dictionary
--- DIN: Dictionario Interlingua-Nederlandese, Piet Cleij (26228)
http://interlingua.wikia.com/wiki/Interlingua-Nederlandese,
--- ADD: Addenda del Lista Supplementari de Vocabulos International, (6)
http://www.interlingua.com/addenda
--- ERR: Ha Le IED Errores? Stanley A. Mulaik (111)
http://lecorde.com/interlinguaus/pakupaku/uploads/HaleIEDerrores_.pdf
--- DFI: Dictionario francese-interlingua, Piet Cleij (2149)
http://www.interlingua.com/fr/dict_fr_ia
--- CEID: A Concise English-Interlingua Dictionary, Woodruff W. Byrne (41)
http://www.interlingua.com/an/ceid
--- BIO: Terminos biologic in anglese e interlingua, Piet Cleij (2452)
http://www.interlingua.com/e-libros-biologia
--- ANN: Annotationes al margine del „Sancte Libro” (9)
http://archive.org/details/annotationes
--- SUP: Lista Supplementari de Vocabulos International, Piet Cleij
http://www.lulu.com/shop/piet-cleij/lista-supplementari-de-vocabulos-international/paperback/product-20520123.html
Errors noticed and corrected in these sources can be found in Errata.txt
Inflection Flags:
================
A - Adjectives of two forms descended from the Latin first/second declension.
B - Continuation flag for derivative adjectives of first/second declension
C - Invariable adjectives or nouns used attributively
D - Adverbs (not formed from adjectives)
E - Adjectives of one form descended from the Latin third declension ('purist list') and ending in -ista, -ota, -ita
F - Continuation flag for derivative adjectives of third declension
G - Proper names or nouns in singular only
H - Flag allowing affixes to be inside compounds
I - First part of compound
J - Last part of compound
K - Only in compounds
L - Nouns in plural only and "guest" plural form of nouns
M - Feminine nouns derived from masculine nouns
N - Ordinal numerals
O - Continuation flag for ordinal numerals
P - Continuation flag for ronouns
Q - Pronouns of two forms (first/second declension)
R - Pronouns of one form (third declension)
S - Nouns
T - Greek nouns with alternative sg on -is/-x, Latin nouns in -um, Greak nouns in -on, German nouns
U - English nouns, French nouns in -au, -eau and -s
V - One-stem verbs in -ar, -er and -ir
W - Double-stem verbs ending in -er or -ere
X - Flag for 'big' numbers
Y - First part of ordinal number
Z - Second part of ordinal number
] - Metrix prefixes, long form
[ - Metrix prefixes, abbreviated
) - Binary prefixes, long form
( - Binary prefixes, abbreviated
= - Prefixes separated from base with hyphen
Derivation Flags:
================
a -ista sb
b -abile, -ibile adj
c -abilitate, ibilitate sb
d -antia, -entia, -ientia sb
e -eria sb
f -o- adj used in compounds
g -age sb
h -ena sb
i -atori, -itori, -ori adj
j -atorio, -itorio, -orio sb
k -itate, -etate sb
l -illion, -illiardo sb
m -amento, -imento sb
n -ation, -ition, -ion
o -ismo sb
p -uple adj
q -ic adj
r -icitate sb
s -istic adj
t -ator, -atrice, -itor, -itrice, -or, -rice sb
u -atura, -itura, -ura sb
v -ative, -itive, -ive adj
w -ativitate, -itivitate, -ivitate sb
x - cardinal and ordinal suffixes of unit's digits
y - ordinal suffixes of decimal digits
z - prefixes of unit's digits
Hyphenation Rules:
=================
The rules have been built from scratch using the method described in:
--- "How to make a foreign language pattern file: Romanian",
http://www.tug.org/TUGboat/Articles/tb16-1/tb46bec2.pdf.
The hyphenation rules were adopted mainly from Italian, Spanish, Romanian and German.
1. a single consonant between two vowels belongs to the second syllable
2. digraphs bh, ch, dh, gh, kh, ph, rh, sh, th and zh are considered a single consonant
3. two consonants are divided between vowels unless an l or an r follows b, c, d, f, k, g, p, t, v
the same rules applies to a group of three consonans starting with ch, gh, kh, ph, sh, th
4. initial or final cluster of consonants are not devided
5. three consonants are divided after the first one if the group ends with
a) l or r following b, c, d, f, g, k, p, t or v
d) bh, ch, dh, gh, kh, ph, rh, sh, th and zh
c) sc, sk, sp or st
d) otherwise, they are divided after the second consonant
6. four consonants are divided after the first one if the group ends with
a) l or r following sc, sk, sp or st
b) sch, sph or sth
and they are devided after the second one
c) if l or r does not follow sc, sk, sp or st
d) or the group ends with ch, ph, sh or th
e) in all other cases a group of four consonst is devided after the third consonant
7. never devide before or after an apostrophe
8. patterns adjusted for compound words, "guest" words or etymology
Revision History:
================
2014-05-30
-- Unnecessary install.js file removed after feedback from Mozilla Add-ons
2014-05-18
-- Flag reorganized: B, F, N, M, f
-- Added words from "Lista Supplementari de Vocabulos International"
-- Updated affixes in A, E, F, P, T, U, k
-- Adjectives in -ista, -ota, -ita, -tor, -sor, -xor flagged with E
-- Flag U extended to cover French nouns in -s, English nouns in -man, -lf, affixes for Greek nouns in -is removed
-- Greek nouns with alternative sg in -is/-x moved to flag T, sg in -e used as base
-- Flag T redesigned and extended to cover German nouns, Latin nouns in -um, Greek nouns in -on
-- Updated compatibility for LibreOffice 4.1 release
-- Added basic and derived SI and non-SI units and units of information
-- Flags to add metrix and binary prefixes: [, ], ( and )
-- Flag generates hyphenated forms to mark hiatus, e.g. anti-inflation
-- Added missing names of countries with related adjectives
-- Added flag f and compunding rules to generate compound adj/sb of type: franco-german, nord-americano
-- Added keep case flag | and compound flag +
-- Added conversion of ' to and from ’
2013-12-06
-- Minor fixes to affix rules
-- Added missing names of chemical elements
2013-11-27
-- Affixes redesigned. Verbs devided into one-stem and double-stem classes (V vs. W)
-- New flags to handle suffixes -ista sb, -istic adj and -ismo sb
-- Flag for -ada sb removed due to low usage
-- Flags for -ic adj and -icitate extended
-- Added all names of countries and their capitals
2013-09-02
-- Added words from "Terminos biologic in anglese e interlingua"
-- Hyphenation rules reworked and extended
-- Most of corrections suggested in "Annotationes..." added
-- Around 200 additonal corrections added
-- Updated compatibility info for new Apache OpenOffice release
2013-04-30
-- Adjectives ended on -phile moved out of 'purist list'.
-- Added words from DFI N-Z
-- Handful of typos detectd in DIN corrected
2013-04-19
-- Adjectives ended on -gine moved to 'purist list'.
2013-04-13:
-- Flag for following words was corrected: compound, indoor, microcomputer, super,
west-nord-west, out, antidumping, antifading, antiskating, antitank, non-stop,
underground, antiballas, antidrogas, antigas, antipulices, antirugas, cis-trans,
topless, unisex, waterproof
-- Added words extracted from DFI letters A-M
-- Dictionary, affixes and hyphenation rules converted to Unicode
-- Replacement and mapping rules added.
-- Conversion rules for combining accents and hyphenation points
2010-11-21:
-- Following non-international words were added back: an, ella, esque, num, utrum
-- Following international words were added: calceria, diffusivitate, merceria, natal
-- Some duplicate entries were removed
2010-11-27:
-- Minor changes to naming of flags
-- Some duplicate entries were removed
2010-12-23
-- review of inflection flags, new flags M, J, T, U, correction to V, A, E
-- new continuation flags: B, F
-- review of derivation flags, changes to flag naming, new flag for -ic adjectives
-- review of entries from DIN
-- typo corrections: hydropatha, telepatha, cardiopatha, diphteria,
coproducteur, zoomer, dischristanisar, deruraliser, dissyllabic,
microcalorimétrie, kératoplastie, mille-pedes, kératose, kératotoma,
kératotomie, binationalité, Gethsémani, protéinogramme, caprifigo,
réplicative, tétanisate, protéinogramme, étiopathia, mille-flores,
mille-folios, altoparlator, urugayan, lipovaccin, trompe-d'oeil,
ethniciser, ostreicola, asinitate, audibitate
-- modernized spelling of English compunds (without hyphens)
-- regularized capitalization: sinn fein, Pompeiano, quechua
-- added -ic adjectives: -ologic, -ophilic, -ophobic, -ochimic
-- all duplicates entries removed
2011-01-01
-- added nosplit suggestions option
-- added 'hidden entries' in DIN: Abraham, Bernardo, Celsius,
Gotthard, Laurentio, antitempesta, corps diplomatique, erignatho,
horologic, incogniscite, japonic, maladive, onyce, oppositifolie,
polypoide, premiscer, ptarmic, robur, roccella, sebacic, semifundite,
sexuate, suprahuman, thapsiforme
2011-01-16
-- removed adjectives ending with -ari (correct forms end with -ar):
alveolari, axillari, biliari, ganglionari, interstellari, mamillari,
nucleari, pupillari, tumulari, tutelari, bacilari, cilari, supercilari
-- removed adjectives incorrectly ending with -ar, correct form ends with ari:
valetudinar, jubilar