Add ModernBERT config #119

jamt9000 · 2024-12-28T20:50:23Z

Adds a ModernBERT config for the original toxic comment classification challenge, using the ModernBERT-base model.

Currently requires installing transformers from git to train: huggingface/transformers#35362 (comment)

CUDA_VISIBLE_DEVICES=1 python train.py -c configs/Toxic_comment_classification_ModernBERT.json

jamt9000 · 2024-12-29T20:09:20Z

Currently the modernbert forward pass gives NaN outputs:

outputs
tensor([[    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [-0.8802, -0.9810, -0.5694, -0.4875,  0.7263,  0.0181],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan],
        [    nan,     nan,     nan,     nan,     nan,     nan]],
       device='cuda:0')

Edit: Seems to be fixed after installing flash attention pip install flash-attn --no-build-isolation

jamt9000 · 2025-01-02T09:13:06Z

configs/Toxic_comment_classification_ModernBERT.json

+        }
+    },
+    "optimizer": {
+        "type": "Adam",


Might be worth using the hyperparameters given here:

https://github.com/AnswerDotAI/ModernBERT/blob/main/examples/finetune_modernbert_on_glue.ipynb

"optimizer": { "type": "AdamW", "args": { "lr": 8e-5, "weight_decay": 8e-6, "betas": [0.9, 0.98], "eps": 1e-6, "amsgrad": false } }

laurahanu

nice! would be good to have something about the benefits of modernbert in the description too

Add ModernBERT config

426781f

jamt9000 commented Jan 2, 2025

View reviewed changes

jamt9000 requested a review from laurahanu January 2, 2025 09:13

laurahanu approved these changes Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ModernBERT config #119

Add ModernBERT config #119

jamt9000 commented Dec 28, 2024 •

edited

Loading

jamt9000 commented Dec 29, 2024 •

edited

Loading

jamt9000 Jan 2, 2025

laurahanu left a comment

Add ModernBERT config #119

Are you sure you want to change the base?

Add ModernBERT config #119

Conversation

jamt9000 commented Dec 28, 2024 • edited Loading

jamt9000 commented Dec 29, 2024 • edited Loading

jamt9000 Jan 2, 2025

Choose a reason for hiding this comment

laurahanu left a comment

Choose a reason for hiding this comment

jamt9000 commented Dec 28, 2024 •

edited

Loading

jamt9000 commented Dec 29, 2024 •

edited

Loading