Skip to content
This repository was archived by the owner on Apr 10, 2021. It is now read-only.

Master Thesis: Detection of historical spelling variations using cluster analysis and string similarity.

Notifications You must be signed in to change notification settings

martialblog/master-thesis-code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

3a9fb68 · Feb 10, 2021

History

68 Commits
Aug 25, 2020
Jan 8, 2021
Jan 8, 2021
Oct 29, 2020
Aug 25, 2020
Feb 10, 2021
Feb 10, 2021
Aug 29, 2020

Repository files navigation

Detection of historical spelling variations using cluster analysisand string similarity

This repository contains the source code for the master thesis: Detection of historical spelling variations using cluster analysis and string similarity.

Jupyter Setup

The Jupyter Notebooks provide an overview of the experiments and preprocessing.

python3 -m venv
source .venv/bin/activate

# (optional) pip update, if errors occur during installation
pip3 install -U pip

pip3 install -r requirements.txt
jupyter labextension install jupyterlab-plotly

jupyter lab

Hint: Some Jupyter graphics require Node.js

Stacked Tree Setup

The Stacked Tree Visualisation is an interactive tool to explore the cluster analysis.

# Requires a local webserver to load the HTML/Javascript.
# Examples:

python3 -m http.server
php -S localhost:8000
docker run -ti --rm -v $(pwd):/usr/share/nginx/html -p 8000:80 nginx:stable

About

Master Thesis: Detection of historical spelling variations using cluster analysis and string similarity.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published