Language Classifier

This project focuses on training an algorithm to correctly identify the language that the user types in.

The algorithms I used were:

K Nearest Neighbors
Multinomial Naive Bayes
Random Forest

Out of these three algorithms, Random Forest performed the best (although Multinomial Nayes Bayes was a close second).

Note: In case you're wondering how I chose the parameters for each algorithm: I used RandomizedSearchCV from the sci-kit learn library to arrive at those parameters. I did not include the code for this because it took a long time to run each time.

How to Run the Application

There are two ways to run the Streamlit web application:

1. Visit the website

The simplest way to view the web app is by visiting the following link: https://share.streamlit.io/johng034/language-classifier/app.py

2. Run locally

If you wish to run the application on your machine, then complete the following steps:

Clone the repository (click here for instructions on how to clone a GitHub repository)
Open the folder of this repository in your editor of choice (or in the terminal/command prompt)
In the terminal, install the packages with pip install requirements.txt (you may want to install using a virtual environment for this)
Once the packages are installed, you can run the streamlit application by typing streamlit run app.py in the terminal

Improvements

I am currently considering adding data from wikipedia pages or tweets to train the algorithm on a wider range of data.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
__pycache__		__pycache__
saved-items		saved-items
wili-2018		wili-2018
1. Data Preparation.ipynb		1. Data Preparation.ipynb
2. Modeling.ipynb		2. Modeling.ipynb
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Language Classifier

How to Run the Application

1. Visit the website

2. Run locally

Improvements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

johng034/Language-Classifier

Folders and files

Latest commit

History

Repository files navigation

Language Classifier

How to Run the Application

1. Visit the website

2. Run locally

Improvements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages