A Comparative Analysis of Undersampling Techniques for Network Intrusion Detection Systems Design

Bruno Riccelli Silva; Ricardo Jardel Silveira; Manuel Gonçalves da Silva Neto; Paulo Cesar Cortez; Danielo Gonçalves Gomes

doi:10.14209/jcis.2021.3

A Comparative Analysis of Undersampling Techniques for Network Intrusion Detection Systems Design

Authors

Bruno Riccelli Silva Federal University of Ceará
Ricardo Jardel Silveira Federal University of Ceara (UFC)
Manuel Gonçalves da Silva Neto Federal University of Ceara (UFC)
Paulo Cesar Cortez Federal University of Ceara (UFC)
Danielo Gonçalves Gomes Federal University of Ceara (UFC)

DOI:

https://doi.org/10.14209/jcis.2021.3

Abstract

Intrusion Detection Systems (IDS) figure as one of the leading solutions adopted in the network security area to prevent intrusions and ensure data and services security. However, this issue requires IDS to be assertive and efficient processing time. Undersampling techniques allow classifiers to be evaluated from smaller subsets in a representative manner, aiming high assertive metrics in less processing time. There are several solutions in literature for IDS projects, but some criteria are not respected, such as the adoption of a replicable methodology. In this work, we selected three undersampling methodologies: random, Cluster centroids, and NearMiss in two novel unbalanced datasets (CIC2017 and CIC2018) for comparison between five classifiers using cross-validation and Wilcoxon statistical test. Our main contribution is a systematic and replicable methodology for using subsampling techniques to balance the data sets adopted in the IDS project. We choose three metrics for classifier's choice in an IDS design: accuracy, f1-measure, and processing time. The results indicate that the under-sampling by Cluster centroids presents the best performance when applied to distance-based classifiers. Moreover, under-sampling techniques influence the process of choosing the best classifier in the design of an IDS.

Downloads

Download data is not yet available.

Downloads

Published

2021-02-18

How to Cite

Silva, B. R., Silveira, R. J., Silva Neto, M. G. da, Cortez, P. C., & Gomes, D. G. (2021). A Comparative Analysis of Undersampling Techniques for Network Intrusion Detection Systems Design. Journal of Communication and Information Systems, 36(1), 31–43. https://doi.org/10.14209/jcis.2021.3

Download Citation

Issue

Vol. 36 No. 1 (2021)

Section

Regular Papers

License

Authors who publish in this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a CC BY-NC 4.0 (Attribution-NonCommercial 4.0 International) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors can enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) before and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

___________

Received 2020-07-22
Accepted 2021-01-15
Published 2021-02-18

A Comparative Analysis of Undersampling Techniques for Network Intrusion Detection Systems Design

Authors

DOI:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Keywords

Information