InfoTechTarget and Informa Tech's Digital Businesses Combine.

Together, we power an unparalleled network of 220+ online properties covering 10,000+ granular topics, serving an audience of 50+ million professionals with original, objective content from trusted sources. We help you gain critical insights and make more informed decisions across your business priorities.

Benchmarking AI Models: Metrics, Evaluations & Leaderboards

Presented by

Hack The Box

About this talk

As organizations experiment with AI in security, a key challenge is measuring whether AI agents are actually effective and improving over time. This webinar focuses on benchmarking and evaluation methods for AI in cyber operations, exploring which performance metrics truly matter—from threat detection accuracy and exploit success rates to response speed and reliability. We’ll also walk through Hack The Box’s AI Range methodology, showcasing board-ready scorecards and leaderboards that compare AI models on common security scenarios, such as an OWASP Top 10 web application framework. Attendees will learn how to validate AI security performance and make data-driven decisions when investing in AI-driven security tools.
Hack The Box

Hack The Box

7011 subscribers18 talks
Cyber Performance Center
Hack The Box provides a human-first platform creating and maintaining high performing cybersecurity individuals and organizations.

Related topics