A Machine Learning-Based Intrusion Detection Algorithm for Securing Bioinformatics Pipelines
DOI:
https://doi.org/10.34190/iccws.20.1.3363Keywords:
Machine learning, Intrusion detection, Algorithms, Cyber-BiosecurityAbstract
Bioinformatics pipelines, which process vast amounts of sensitive biological data, are increasingly targeted by cyberattacks. Traditional security measures often fail to provide adequate protection due to the unique computational and network characteristics of these pipelines. This study proposes a machine learning-based Intrusion Detection System (IDS) tailored specifically for bioinformatics workflows. While the CICIDS2017 dataset serves as the primary benchmark, we augment the study with bioinformatics-specific network traffic to ensure relevance. We compare the performance of four machine learning algorithms Random Forest (RF), Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Gradient Boosting Machine (GBM) and explore hybrid models for enhanced detection. Our findings highlight GBM's superior accuracy (98.3%) while also addressing its computational overhead and susceptibility to adversarial attacks. The study contributes novel insights by integrating real-world bioinformatics traffic data and proposing adaptive security strategies for genomic research environments.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Jude Osamor, Aliyu Yisa, Febisola Olanipekun, Omotolani Olowosule, Samuel Akerele, Onyekachi Anyalechi, Simbiat Sadiq, Iretioluwa Akerele, Xavier Palmer, Michaela Barnett

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.