The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
Analysis and operations being performed:
- Text cleaning
- Generation of n-grams
- Word frequency analysis
- Wordcloud visualization
- Topic modeling
- General text statistics