CyberClassic-trainer

This is a training environment for model of CyberClassic collection. Current training environment pipline contain three steps: FineTune GPT2 model, FineTune T5 model, Reinforcement learning of text generator Env has two separete datasets

True dataset. Size 13048 rows. Column: Text - single sentence from the texts of Dostovesky F.M.
False dataset. Size 5771 rows. Column: Text - single sentence from the texts of Kuprin A.I. and sentences geenerated with RuGPT3

FineTune GPT2 model

On this step base GPT2 model finetuned on true dataset.

FineTune T5 model

On this step we choose from true dataset 6000 rows, add new colunt "labels" with values 1 and 0 to true dataset and false dataset respectively, then contcatenet them. In the end we have model for binary classification of text sequence by belonging to style of Dostovesky F.M.

Reinforcement learning

On this step we perform second round of training text generation model, with TRL dependencie. Reward function is a simple socer from classifier multiplied by 10.

Part of CyberClassic model

Trainer enviroment for ML-modle of telegram bot

HuggingFace Collection

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dataset.csv		dataset.csv
discriminator_metrics.json		discriminator_metrics.json
false_dataset.csv		false_dataset.csv
generator_metrics.json		generator_metrics.json
main.py		main.py
rl_metrics.json		rl_metrics.json
test_generation.json		test_generation.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CyberClassic-trainer

FineTune GPT2 model

FineTune T5 model

Reinforcement learning

Part of CyberClassic model

About

Languages

License

Roaoch/CyberClassic-trainer

Folders and files

Latest commit

History

Repository files navigation

CyberClassic-trainer

FineTune GPT2 model

FineTune T5 model

Reinforcement learning

Part of CyberClassic model

About

Topics

Resources

License

Stars

Watchers

Forks

Languages