-
Jožef Stefan Institute
- Ljubljana, Slovenia
- @TajaKuzman
- in/taja-kuzman
Pinned Loading
-
AGILE-Automatic-Genre-Identification-Benchmark
AGILE-Automatic-Genre-Identification-Benchmark PublicA benchmark for evaluating robustness of automatic genre identification models to test their usability for the automatic enrichment of large text collections with genre information.
Jupyter Notebook 4
-
Hate-Speech-Classification
Hate-Speech-Classification PublicClassification of hate speech and implicitness of hate speech, using Transformer language models (BERT). This repository can be used as an introduction to text classification with BERT-like models.
Jupyter Notebook
-
NER-recognition
NER-recognition PublicAn evaluation of various encoder Transformer-based large language models on the named entity recognition task. The models are compared on 6 datasets, manually-annotated with named entitites.
Jupyter Notebook
-
Parlamint-translation
Parlamint-translation PublicA pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic…
Jupyter Notebook 2
-
Topic-Classification-FastText-Transformers
Topic-Classification-FastText-Transformers PublicTraining and evaluating topic classification models (fastText and Transformer-based language models) for topic classification of Slovenian news texts. The repository can be used as a tutorial to le…
Jupyter Notebook 4
If the problem persists, check the GitHub status page or contact support.