MemSafeEval Part 1: A Bottom-Up Classification of Memory Safety Vulnerabilities in C and C++
Why frontier LLMs fail at vulnerability classification—and what it reveals about CWE labels
We help teams surface trust boundaries, invariants, failure modes & make them testable