Helium Model Worldview includes structured cue-swap pairs (name swap, label swap, topic swap) that could seed PyRIT multi-turn adversarial scenarios.
304 prompts with per-model flip rates on Hugging Face. Safety split shows wide refusal/compliance spread across frontier models.
Dataset: https://huggingface.co/datasets/HeliumTrades/helium-model-worldview-benchmark
Scoring guide: https://heliumtrades.com/benchmarks/
Helium Model Worldview includes structured cue-swap pairs (name swap, label swap, topic swap) that could seed PyRIT multi-turn adversarial scenarios.
304 prompts with per-model flip rates on Hugging Face. Safety split shows wide refusal/compliance spread across frontier models.
Dataset: https://huggingface.co/datasets/HeliumTrades/helium-model-worldview-benchmark
Scoring guide: https://heliumtrades.com/benchmarks/