Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
README.md		README.md
bert_pred.py		bert_pred.py
bert_train.py		bert_train.py
conlleval.py		conlleval.py
datamodule.py		datamodule.py
lstm.py		lstm.py
output.txt		output.txt
pred.py		pred.py
preprocessing.py		preprocessing.py
train.py		train.py
train_pred_eval.py		train_pred_eval.py
utils.py		utils.py
w2v.py		w2v.py

Repository files navigation

Chunking Shared Task in Conll2000

Dataset: https://www.clips.uantwerpen.be/conll2000/chunking/
connlleval.py: https://github.com/sighsmile/conlleval/blob/master/conlleval.py

Download and unzip dataset.
Set the dataset path in preprocessing.py.
Set the hyperparameter in train.py.
Change other setting you want to change.
Do python train.py if you want to train with BERT, do python bert_train.py.
Set model path and do python pred.py or python bert_train.py.
Do python connlleval.py.
You can monitor or confirm　loss curve by tensorboard.

trained models: https://drive.google.com/drive/folders/11uyVskbp9oLQVsj7A5lfIsKhhPJOoFKr?usp=sharing

current score  
LSTM-crf: 92.6,  
LSTM-w2v-crf: 92.49,  
BERT-crf: 96.38: current output.txt,

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

Contributors

Languages

Python 100.0%