strings-regular-expressions-and-text-data-analysis

While built-in string methods have limited flexibility and regular expressions have limited expressive power, both can still be leveraged in creative ways to implement scalable workflows that process and analyze text data. This article explores these tools and introduces a few useful peripheral techniques within the context of a use case involving a large text data corpus: the set of article abstracts found in the English-language edition of Wikipedia.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

strings-regular-expressions-and-text-data-analysis

Files

README.md

Latest commit

History

README.md

File metadata and controls

strings-regular-expressions-and-text-data-analysis