max-andr

Follow

🚀

Maksym Andriushchenko max-andr

🚀

Follow

Faculty at ‪the ELLIS Institute Tübingen and Max Planck Institute for Intelligent Systems. Leading the AI Safety and Alignment group.

310 followers · 357 following

ELLIS Institute Tübingen
Tübingen
15:56 (UTC +01:00)
https://andriushchenko.me/
@maksym_andr
https://scholar.google.com/citations?user=ZNtuJYoAAAAJ
https://aisagroup.substack.com/

Achievements

Achievements

Organizations

Pinned Loading

aisa-group/PostTrainBench aisa-group/PostTrainBench Public

PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours

Python 139 18
tml-epfl/os-harm tml-epfl/os-harm Public

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]

Jupyter Notebook 48 3
tml-epfl/llm-past-tense tml-epfl/llm-past-tense Public

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

Python 77 11
tml-epfl/llm-adaptive-attacks tml-epfl/llm-adaptive-attacks Public

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

Shell 377 43
JailbreakBench/jailbreakbench JailbreakBench/jailbreakbench Public

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]

Python 531 58
RobustBench/robustbench RobustBench/robustbench Public

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

Python 769 101