You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/dictionary-data.md
+12-1Lines changed: 12 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -419,7 +419,7 @@ that for you, so it could be as simple as `./run.js --help`.
419
419
## Cree `linguistInfo`
420
420
421
421
For the Plains Cree dictionary, the following `linguistInfo` fields are
422
-
used to display linguistic info in search results, and for showing emoji:
422
+
used to display linguistic info in search results, to provide semantic class information, and for showing emoji:
423
423
424
424
-`inflectional_category`, String: The inflectional category for an
425
425
entry, with hyphen, e.g., `NI-1`. (CW's `\ps`)
@@ -428,6 +428,15 @@ used to display linguistic info in search results, and for showing emoji:
428
428
If we were naming this today following our glossary, we would call it
429
429
the *general word class*.
430
430
431
+
-`rw_domains`, list of String: The RapidWords semantic classification domain names for this entry, in the canonical form defined in both rapidwords.net and semdom.org: e.g. `[ "Sleep" ]`
432
+
433
+
-`rw_indices`, dictionary mapping String to a list of String: For each of the sources in the entry (using the same short abbreviations as in `source`), we provide the list of indices for the RapidWords semantic classification domains for the entry, in the canonical form defined in both rapidwords.net and semdom.org: e.g.:
434
+
```
435
+
{
436
+
"CW": [ "5.7.1" ]
437
+
}
438
+
```
439
+
431
440
- `stem`, String: The FST stem for this entry.
432
441
433
442
For Plains Cree specifically, there are two variants of linguistic
@@ -459,6 +468,8 @@ used to display linguistic info in search results, and for showing emoji:
459
468
of the minimal CW stem were some day added to morphodict, that would of
460
469
course require the dictionary data to include that data at that time.
461
470
471
+
- `wn_domains`, a list of String: The WordNet semantic classifications for this entry, using the same format as in the Altlab wordnet server, e.g., `[ "(v) sleep#1", "(adv) together#4" ]`.
472
+
462
473
- `wordclass`, String: The word class for this entry (`VTA` / `VAI` / etc.).
463
474
At one time our glossary called this a *specific word class*.
0 commit comments