People type with typos or unordered words inevitably. But surprisingly, we still get to understand these passages with minor errors.
e.g. 研表究明,漢字的序順並不定一影閱響讀,比如當你看完這句話後,才發這現裡的字全是都亂的。
In this research, we focused on misordered characters in Mandarin and aim to answer the following research questions:
- What types of "mess" can we tolerate in reading comprehension?
- How "messy" can an understandable sentence be?
For the research details and experiment results, please refer to the slides.
Feel free to try out the experiment webpage at https://linguisticsexperiment.z7.web.core.windows.net/.
Note that the API for data export is currently disabled, so your data won't be collected.
If you are curious about the data analysis results, feel free to reproduce them with the following commands.
cd data_analysis
python3 analysis.py