logo

An introduction to automated LLM red teaming

ID: d9a9567d-d86f-529c-8271-1fc6c2b456a9

STIX ID: report--d9a9567d-d86f-529c-8271-1fc6c2b456a9

Feed Name: NVISO Labs

Threat Score
30/100

Date Published: 2026-02-05

Date Updated: 2026-04-28

Author: Tanguy Snoeck

...
...

This article demonstrates automating LLM red teaming using promptfoo against a purposely vulnerable ChainLit chatbot: it explains the testing architecture (target/adversarial/grader models), describes plugins and strategies (security/access-control plugins such as BFLA, BOLA, RBAC and strategies like Iterative Jailbreaks), and shows lab results where the model sometimes performs unauthorized tool actions (e.g., retrieving SharePoint content). The piece emphasizes the probabilistic nature of LLM failures, the need for repeated testing and strategy tuning, and the importance of treating LLMs and agentic systems as attack surfaces requiring systematic security testing.

Your team is not currently subscribed to this feed. You must subscribe to it in order to see this post.