Predicting Sabotaged Open-source Libraries

Alexander Petty; William Glisson; Ryan Benton

doi:10.34190/eccws.25.1.4601

Authors

Alexander Petty Louisiana Tech University
William Glisson Louisiana Tech University
Ryan Benton University of South Alabama

DOI:

https://doi.org/10.34190/eccws.25.1.4601

Keywords:

Vulnerabilities, Supply Chain Attack, Open Source, Machine Learning, Malicious Code

Abstract

Open-source software provides free and publicly available software maintained by the open-source community. The variety of contributors creates an environment conducive to the intentional and unintentional introduction of software bugs by participating organizations. Enemy nation-states and independent hackers can exploit these attack vectors to gain access to industry and government systems. Repositories of known vulnerabilities and tools to check vulnerable versions and analyze code exist, but realistically, reviewers can miss issues within many repositories due to constant updates and technological advances. Hence, this research investigates an alternative, non-code-based method for identifying high-risk repositories using repository metadata and commit history, which, when coupled with machine learning, enables us to identify at-risk repositories at rates above 60%. This was achieved using a dataset composed of 41,710 repositories. The contribution of this research is twofold. First, it presents an empirical evaluation of the viability of a non-code-based analysis approach to detecting high-risk, i.e., potentially compromised code repositories. Second, it provides foundational research for non-code-based filtering of open-source repositories, potentially accelerating software investigations and reducing resource requirements.

Author Biographies

Alexander Petty, Louisiana Tech University

Alexander Petty is a Cyber Engineering graduate (minoring in Mathematics) who earned a Master’s degree in Computer Science/Data Science from Louisiana Tech University.

William Glisson, Louisiana Tech University

Dr. William Bradley Glisson is a Professor of Computer Science, Entergy Corporation LP&L/NOPSI Endowed Professor, and Director of the Center for Secure Cyberspace at Louisiana Tech University. His research focuses on digital forensics, information assurance, software engineering, and applied computing science. He can be contacted at glisson@latech.edu.

Ryan Benton, University of South Alabama

Dr. Ryan Benton is a professor of computer science at the University of South Alabama. He received his PhD in computer science from the University of Louisiana at Lafayette in 2001. He conducts research in data mining, with emphasis in pattern mining and applications in cybersecurity and medicine/health.

Predicting Sabotaged Open-source Libraries

Authors

DOI:

Keywords:

Abstract

Author Biographies

Alexander Petty, Louisiana Tech University

William Glisson, Louisiana Tech University

Ryan Benton, University of South Alabama

Downloads

Published

Issue

Section

License

Current Issue

Information