v0.9.0.11
What's Changed
- Probe for repetition-based nudging into replay/spurious generation by @leondz in #404
- Probe for invisible text prompt injections by @leondz in #397
- Probe for the 'DAN in the wild' paper's library of jailbreak prompts by @leondz in #405
- Probe for NYT & The Guardian content in training data by @leondz in #402
- Add NVIDIA cloud functions generator by @leondz in #398
- Add toxicity generation deep test config by @leondz in #413
- Generator enhancements and minor improvements by @shubhobm in #391
- Update HF inf api generator to match their current expectations by @leondz in #400
- Invoke garak on the command line, with
garak
by @leondz in #410 - Mitigate continuation probe oversensitivity by @leondz in #394
- Handle nvcf container timeouts by @leondz in #399
- Fixing Exception Cause By Type Error When Scanning LLMs Via Replicate by @DavidLee528 in #401
- Make sure triggers attempt.note is saved in hitlog by @leondz in #403
- Repeat replay now optionally overrides generator max len by @leondz in #408
- Replay.Repeat now preserves attempt when restoring generator max_tokens by @leondz in #409
- Gracefully handle NVCF request timeouts & other failures by @leondz in #411
- Fix deprecated encoding by @leondz in #412
- Better coverage in mitigation bypass detector
Full Changelog: v0.9.0.10...v0.9.0.11