logo

Project Naptime: Evaluating Offensive Security Capabilities of Large Language Models

ID: fc2252a3-2e18-58b6-a5d6-b29d94009c85

STIX ID: report--fc2252a3-2e18-58b6-a5d6-b29d94009c85

Feed Name: Google Project Zero

Threat Score
30/100

Date Published: 2024-06-20

Date Updated: 2026-04-27

Author: Google Project Zero

...
...

Project Zero presents 'Naptime', an LLM-powered vulnerability research framework that equips models with specialised tools (code browser, Python interpreter, debugger, reporter) to iteratively discover and verify memory-safety bugs. Using this architecture the authors significantly improved results on the CyberSecEval2 benchmark for buffer overflow and advanced memory corruption tests and demonstrate a concrete proof-of-concept buffer overflow exploit in a sample C++ challenge, validated with AddressSanitizer.

Your team is not currently subscribed to this feed. You must subscribe to it in order to see this post.