Home Industry Microsoft Launches ExCyTIn-Bench: An Open-Source Benchmark for Real-World AI Cybersecurity Evaluation
Industry

Microsoft Launches ExCyTIn-Bench: An Open-Source Benchmark for Real-World AI Cybersecurity Evaluation

Share
Share

Microsoft has introduced ExCyTIn-Bench, an open-source benchmarking framework designed to evaluate how effectively AI systems perform real-world cybersecurity investigations. Unlike traditional benchmarks that focus on static knowledge or trivia, ExCyTIn-Bench simulates dynamic, multi-stage cyberattacks within a virtual Security Operations Center (SOC) in Microsoft Azure. Using 57 log tables from Microsoft Sentinel and related services, it mirrors the complexity and noise of genuine security incidents.

The tool helps CISOs and IT leaders assess how well AI models reason, adapt, and explain findings in realistic threat scenarios providing actionable insights into detection and response capabilities. Microsoft also uses ExCyTIn internally to enhance its own AI-powered security tools, including Security Copilot, Sentinel, and Defender.

ExCyTIn-Bench’s key innovations include fine-grained, transparent scoring metrics, realistic investigative workflows, and extensibility for custom benchmarks. Early results show GPT-5 achieving a 56.2% average reward, outperforming earlier models and highlighting the importance of advanced reasoning in cyber defense. Open-source models are also closing performance gaps, making sophisticated security automation more accessible.

Through ExCyTIn-Bench, Microsoft aims to accelerate global collaboration, improve trust in AI-driven cybersecurity, and foster innovation in automated threat investigation.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles

Garmin introduces OnBoard System for wireless Man Overboard detection

Garmin announced the launch of Garmin OnBoard, a new wireless Man Overboard...

Philips Opens New Global Headquarters in Amsterdam, Marking a New Era in Health Technology Innovation

Royal Philips has officially opened its new global headquarters in Amsterdam, inaugurated...

New Bosch subsidiary develops solution for identifying fake products

Bosch Secure Authentication GmbH, a new subsidiary of Robert Bosch GmbH, has...

Dell Named a Leader in 2025 Gartner Magic Quadrant for Enterprise Storage Platforms

Dell Technologies has been recognized as a Leader in the 2025 Gartner®...