You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tweak the ranking of the Cree entries within the senses. Currently we are using corpus-based lemma frequencies, when they exist, but we might want to factor in the glossary-counts as well as dictionary-morpheme-based entry frequencies as well. [This needs an update of the source files with the frequencies, by @aarppe]
The corpus-based lemma frequencies cover only a part of all the Cree entries in CW and the other dictionary resources, and they are skewed due to the corpora that we have. The following would be options to consider:
Include the glossary-based rankings. This will ensure that core vocabulary is ranked up (some 3 thousand entries).
Include dictionary-based morpheme aggregate rankings. This will ensure that all entries in CW (over 30k) will receive a ranking (which will cover most of the other sources as well).
Include the extent of matches of English search terms (the lexical parts remaining after English phrase analysis) with the English definitions of the Cree entries under each sense.
Include an improved form of vector similarity between the English search terms and the English definitions of the Cree entries.
A ranked combination of 1-4 above.
The text was updated successfully, but these errors were encountered:
Originally posted by @aarppe in #1138 (comment)
The corpus-based lemma frequencies cover only a part of all the Cree entries in CW and the other dictionary resources, and they are skewed due to the corpora that we have. The following would be options to consider:
The text was updated successfully, but these errors were encountered: