WhitzardAgent

Hi there 👋 This is WhitzardAgent

We are a research group focused on building safer AI systems. We are working on the security and safety of LLM-powered agentic systems powered by foundation models.

Ongoing Projects

Agent Infrastructure

Framework

YOGA — Yet Another Generalist Agent (Modular and Extensible)

Data Synthesis

Mirror GUI — LLM-based GUI Simulator for Agentic Data Synthesis and Evaluation

Mirror GUI is a GUI simulator driven by large language models (LLMs), designed to test and evaluate AI agents interacting with a desktop-like environment. It simulates an Ubuntu-style desktop with application windows, UI elements and a file system so agents can perform GUI actions and researchers can analyze behavior and safety.

Agentic Security Toolkits

CoT Monitoring and Correction

ThoughtAligner - A plug-and-play safety aligner module for your tool-use agent's chain-of-thought

Are you worried about your AI deleting your important files without asking for permission? Or it just does something unexpected yet dangerous. ThoughtAligner is here for you.

MirrorGuard - A plug-and-play safety aligner module for your GUI agent's chain-of-thought

CUA Agents provide additional challenges due to its multi-modal nature. Don't worry. MirrorGuard is here for you.

ReasoningShield - A lightweight model for monitoring the content safety of the CoT.

Agent Sandbox

XuanwuBox - Your AI security advisor in the Docker runtime for your agentic system (To be released)

Frontier AI Safety Research

NVWA Project - Preparing for the emergence of silicon-based life. Indentifying the risks of autonomy

AI research is accelerating the transition toward silicon-based life. Our mission is to identify the risks of autonomous emergence, prevent uncontrolled proliferation, and develop essential control technologies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WhitzardAgent

Hi there 👋 This is WhitzardAgent

Ongoing Projects

Agent Infrastructure

Framework

Data Synthesis

Agentic Security Toolkits

CoT Monitoring and Correction

Agent Sandbox

Frontier AI Safety Research

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!