Skip to content
@WhitzardAgent

WhitzardAgent

WhitzardAgent is a research group supported by Fudan, working on the security and safety of agentic systems powered by foundation models.

Hi there 👋 This is WhitzardAgent

We are a research group focused on building safer AI systems. We are working on the security and safety of LLM-powered agentic systems powered by foundation models.

Ongoing Projects

Agent Infrastructure

Framework

  • YOGAYet Another Generalist Agent (Modular and Extensible)

    GitHub

Data Synthesis

  • Mirror GUILLM-based GUI Simulator for Agentic Data Synthesis and Evaluation

    GitHub

    Mirror GUI is a GUI simulator driven by large language models (LLMs), designed to test and evaluate AI agents interacting with a desktop-like environment. It simulates an Ubuntu-style desktop with application windows, UI elements and a file system so agents can perform GUI actions and researchers can analyze behavior and safety.

Agentic Security Toolkits

CoT Monitoring and Correction

  • ThoughtAligner - A plug-and-play safety aligner module for your tool-use agent's chain-of-thought

    arXiv Hugging Face GitHub

Are you worried about your AI deleting your important files without asking for permission? Or it just does something unexpected yet dangerous. ThoughtAligner is here for you.

  • MirrorGuard - A plug-and-play safety aligner module for your GUI agent's chain-of-thought

    arXiv Hugging Face GitHub

CUA Agents provide additional challenges due to its multi-modal nature. Don't worry. MirrorGuard is here for you.

  • ReasoningShield - A lightweight model for monitoring the content safety of the CoT.

    arXiv GitHub Model 1B Model 3B Dataset

Agent Sandbox

  • XuanwuBox - Your AI security advisor in the Docker runtime for your agentic system (To be released)

    GitHub

Frontier AI Safety Research

  • NVWA Project - Preparing for the emergence of silicon-based life. Indentifying the risks of autonomy

AI research is accelerating the transition toward silicon-based life. Our mission is to identify the risks of autonomous emergence, prevent uncontrolled proliferation, and develop essential control technologies.

Pinned Loading

  1. XuanwuBox XuanwuBox Public

    An intelligent secure layer for agentic execution environments.

  2. .github .github Public

Repositories

Showing 10 of 20 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…