Welcome to the coding task for the Team Process Mapping project! Before you begin, please make sure to read through the full instructions here.
The packages required for you to complete the task are listed in requirements.txt
. You can use a virtual environment for managing the dependencies associated with this project.
To complete this assignment, you will need to modify 4 different files:
- features/word_count.py:
count_words(text)
- features/type_token_ratio.py:
get_word_ttr(text)
- features/politeness_features.py:
get_politeness_strategies(text)
- utils/calculate_chat_level_features.py (For calling the functions; you only need to modify
apply_politeness()
).
At the end of Part 2, we will need the following:
- A link to your GitHub cloned repository. This should contain:
- Your Python code;
- Your Part 2 Reflection (directly edit this README!).
- A copy of your chat-level CSV that contains columns for the features you generated. Note: this should be reproducible! We should be able to get the same results by running your code from the GitHub link you submit.
Please answer the following four questions:
Open up your output CSV and look at the columns you generated for each of the three features. Do the values “make sense” intuitively? Why or why not?
[Your Answer here]
How would you implement tests for these features?
[Your Answer here]
Please provide an overall reflection of your experience. How did you approach this task? What challenge(s) did you encounter? Is there anything you would be curious to explore in the future, if you had more time?
[Your Answer here]
How much time did it take you to complete Parts 1 and 2? (Please be honest; we are looking for feedback to make sure the tasks are scoped appropriately.)
[Your Answer here]