Skip to content
@uw-nsl

UW-NSL

Network Security Lab at University of Washington

Pinned Loading

  1. SafeDecoding SafeDecoding Public

    Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

    Jupyter Notebook 129 11

  2. ArtPrompt ArtPrompt Public

    [ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

    Python 67 14

  3. ChatBug ChatBug Public

    [AAAI25] Official Repo of Paper `ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates`

    Python 7

  4. CleanGen CleanGen Public

    [EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

    Python 14 2

  5. safechain safechain Public

    SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

    Python 12 2

Repositories

Showing 8 of 8 repositories
  • magpie Public Forked from magpie-align/magpie
    uw-nsl/magpie’s past year of commit activity
    Python 0 MIT 60 0 0 Updated Apr 8, 2025
  • safechain Public

    SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

    uw-nsl/safechain’s past year of commit activity
    Python 12 GPL-3.0 2 0 0 Updated Apr 2, 2025
  • ChatBug Public

    [AAAI25] Official Repo of Paper `ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates`

    uw-nsl/ChatBug’s past year of commit activity
    Python 7 MIT 0 0 0 Updated Mar 22, 2025
  • kodcode Public Forked from KodCode-AI/kodcode

    Generate diverse coding questions and verifiable solutions - all in one framework

    uw-nsl/kodcode’s past year of commit activity
    Python 0 Apache-2.0 10 0 0 Updated Mar 15, 2025
  • CleanGen Public

    [EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

    uw-nsl/CleanGen’s past year of commit activity
    Python 14 2 1 0 Updated Mar 9, 2025
  • ArtPrompt Public

    [ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

    uw-nsl/ArtPrompt’s past year of commit activity
    Python 67 MIT 14 0 0 Updated Mar 7, 2025
  • SafeDecoding Public

    Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

    uw-nsl/SafeDecoding’s past year of commit activity
    Jupyter Notebook 129 MIT 11 1 1 Updated Jul 19, 2024
  • edc Public

    Source Code for "EDC: Effective and Efficient Dialog Comprehension For Dialog State Tracking" (NAACL 2024)

    uw-nsl/edc’s past year of commit activity
    Python 0 0 1 0 Updated Jun 18, 2024

Top languages

Loading…

Most used topics

Loading…