Skip to content

Commit 3d3e203

Browse files
committed
Changed dictionary data docs to update to current semantic classification requirements
1 parent a212159 commit 3d3e203

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

docs/dictionary-data.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -419,7 +419,7 @@ that for you, so it could be as simple as `./run.js --help`.
419419
## Cree `linguistInfo`
420420

421421
For the Plains Cree dictionary, the following `linguistInfo` fields are
422-
used to display linguistic info in search results, and for showing emoji:
422+
used to display linguistic info in search results, to provide semantic class information, and for showing emoji:
423423

424424
- `inflectional_category`, String: The inflectional category for an
425425
entry, with hyphen, e.g., `NI-1`. (CW's `\ps`)
@@ -428,6 +428,15 @@ used to display linguistic info in search results, and for showing emoji:
428428
If we were naming this today following our glossary, we would call it
429429
the *general word class*.
430430

431+
- `rw_domains`, list of String: The RapidWords semantic classification domain names for this entry, in the canonical form defined in both rapidwords.net and semdom.org: e.g. `[ "Sleep" ]`
432+
433+
- `rw_indices`, dictionary mapping String to a list of String: For each of the sources in the entry (using the same short abbreviations as in `source`), we provide the list of indices for the RapidWords semantic classification domains for the entry, in the canonical form defined in both rapidwords.net and semdom.org: e.g.:
434+
```
435+
{
436+
"CW": [ "5.7.1" ]
437+
}
438+
```
439+
431440
- `stem`, String: The FST stem for this entry.
432441
433442
For Plains Cree specifically, there are two variants of linguistic
@@ -459,6 +468,8 @@ used to display linguistic info in search results, and for showing emoji:
459468
of the minimal CW stem were some day added to morphodict, that would of
460469
course require the dictionary data to include that data at that time.
461470
471+
- `wn_domains`, a list of String: The WordNet semantic classifications for this entry, using the same format as in the Altlab wordnet server, e.g., `[ "(v) sleep#1", "(adv) together#4" ]`.
472+
462473
- `wordclass`, String: The word class for this entry (`VTA` / `VAI` / etc.).
463474
At one time our glossary called this a *specific word class*.
464475

0 commit comments

Comments
 (0)