Skip to content

Runnable morphological analysis tools from the UniMorph project

Notifications You must be signed in to change notification settings

unimorph/analyzers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

analyzers

Runnable Morphological Analysis Tools from the UniMorph Project

Warning

This software is at an alpha stage.

Prerequisites

  1. DirecTL+

  2. CTranslate

Installation

Due to the size of DirecTL+ models, we do not include them here; please contact gnicola2 AT jhu DOT edu for pre-trained models. Uncompress DTL models into models/DTL directory. See the Releases tab above to download.

tar -xvzf DTLModel.tgz

Set environment variables to point to required binaries.

export DTL=<location of DTL binary>
export CTRANSLATE=<location of ctranslate binary>

Usage

python src/analyze.py -i input.wordlist -o output.analyses -l language

For example:

python analyze.py -i Welsh.toAnalyze -o Welsh.out -l welsh

Supported Languages

adyghe, albanian, amharic, arabic, armenian, asturian, azeri
bashkir, basque, belarusian, bengali, breton, bulgarian, 
catalan, cornish, crimean-tatar, czech, danish, dutch,
english, estonian, faroese, finnish, french, friulian, galician,
georgian, german, greek, greenlandic, haida, hebrew, hindi, 
hungarian, icelandic, indonesian, ingrian, irish, italian, 
kabardian, kannada, karelian, kashubian, kazakh, khakas,
khaling, kurmanji, ladin, latin, latvian, lithuanian, livonian, 
lower-sorbian, macedonian, maltese, mapudungun, middle-high-german,
middle-low-german, murrinhpatha, navajo, neapolitan, norman,
northern-sami, norwegian-bokmaal, norwegian-nynorsk, occitan,
old-armenian, old-french, old-irish, old-saxon, pastho, persian,
polish, portuguese, quechua, romanian, russian, sanskrit, 
scottish-gaelic, serbo-croatian, slovak, slovene, sorani,
spanish, swahili, swedish, tagalog, tamil, tatar, telugu, 
tibetan, turkish, turkmen, ukrainian, urdu, uzbek, venetian, welsh, 
west-frisian, yiddish, zulu, ...

About

Runnable morphological analysis tools from the UniMorph project

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages