Sabermetrics for Cyber: Collecting and Analyzing User Activity Data from Ephemeral Exercises

Authors

  • Jael Rivera Carnegie Mellon University, Pittsburgh, United States
  • Jarrett Booz Carnegie Mellon University, Pittsburgh, United States
  • Josh Hammerstein Carnegie Mellon University, Pittsburgh, United States

DOI:

https://doi.org/10.34190/eccws.24.1.3354

Keywords:

cybersecurity exercises, cyber workforce development, cybersecurity training, data collection, data analysis, performance analytics

Abstract

The term sabermetrics was coined in the 1970s by members of the Society for American Baseball Research (SABR) to describe how baseball teams use advanced analytics to evaluate talent and maximize performance both offensively and defensively. Sabermetrics transformed professional baseball through its data-driven approach, enabling teams to devise new tactics and strategies for improving individual and overall team performance. The concept of sabermetrics or advanced analytics can also be applied to the cybersecurity domain to improve performance, both offensively and defensively, and to better evaluate talent. To do this, data is needed. Cybersecurity exercises are well suited for providing this data because they are designed to develop critical technical skills in controlled, simulated environments that closely mirror real-world threats. However, preserving data for ephemeral cybersecurity exercises can be challenging because these environments are temporary, and when they are torn down, log data is lost unless deliberate actions are taken to retain the data for future use. This includes all information regarding the actions participants took in the exercise. x`Recognizing that important information can be gleaned by analyzing this data, the Software Engineering Institute (SEI) at Carnegie Mellon University developed a capability to capture a high-fidelity record of user activities during cybersecurity exercises. This paper discusses the motivation behind this development, the insights that can be gained from the collected data, and how the SEI configures exercises used in cybersecurity competitions to collect and store user activity data for future detailed analysis.

Author Biographies

Jael Rivera, Carnegie Mellon University, Pittsburgh, United States

Jael Rivera is a cybersecurity engineer at Carnegie Mellon University Software Engineering Institute, where he has three years of experience in developing cybersecurity exercises and managing cloud infrastructure. Jael is currently pursuing a master’s degree in Information Security and Assurance at CMU.

Jarrett Booz, Carnegie Mellon University, Pittsburgh, United States

Jarrett Booz has been a cybersecurity engineer with the Carnegie Mellon University Software Engineering Institute since graduating from Carnegie Mellon with a master’s degree in information security. Jarrett has experience in cybersecurity exercise development and cybersecurity infrastructure.

Josh Hammerstein, Carnegie Mellon University, Pittsburgh, United States

Josh Hammerstein is a Technical Manager at the Carnegie Mellon University Software Engineering Institute. His team focuses on improving the readiness of cybersecurity practitioners. Josh holds a Master of Science in Information Security Policy and Management and a Bachelor of Science in Information and Decision Systems from Carnegie Mellon.

Downloads

Published

2025-06-25