Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 422 Bytes

README.md

File metadata and controls

13 lines (11 loc) · 422 Bytes

Text normalization tool (supports russian language)

Using

text = 'Пример текста для нормализации. Пример текста для нормализации'
text_normalizer = TextNormalizer()
result = text_normalizer.normalize_text(
    text=texts, 
    split_words=True, 
    split_sentences=True, 
    stop_words_ignore=True, 
    split_docs=False
)