Skip to content

Usage Examples

Luiz Rocha edited this page Apr 22, 2016 · 5 revisions

Creating a new stemmer instance:

stemmer = OrengoStemmer() # or PorterStemmer() or SavoyStemmer()

Simple usage:

print stemmer.getWordStem("extremamente")

Ignoring stopwords and known entities:

stemmer.enableCaching(1000)
stemmer.ignore(PTStemmerUtilities.fileToSet("data/stopwords.txt"))
stemmer.ignore(PTStemmerUtilities.fileToSet("data/namedEntities.txt")) 

stem = stemmer.getWordStem("ciências") print PTStemmerUtilities.removeDiacritics(stem)
Clone this wiki locally