Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost

Pedro Henrique Hauy Netto de Araujo; Anderson Silva; Norisvaldo Ferraz Junior; Fabio Cabrini; Alessandro Santiago; Adilson Guelfi; Sergio Kofuji

doi:10.14209/jcis.2021.22

Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost

Authors

Pedro Henrique Hauy Netto de Araujo Universidade de São Paulo
Anderson Silva
Norisvaldo Ferraz Junior
Fabio Cabrini
Alessandro Santiago
Adilson Guelfi
Sergio Kofuji

DOI:

https://doi.org/10.14209/jcis.2021.22

Abstract

Distributed Denial of Service (DDoS) attacks impose a major challenge for today's security systems, given the variety of implementations and the scale they can achieve. One approach for their early detection is the use of Machine Learning (ML) techniques, which create rules for classifying traffic from historical data. However, different types of data contribute unequally to the assertiveness of the trained model. The use of Feature Selection (FS) techniques as a pre-processing step allows identification of the most relevant features for the problem in question. This action reduces training time and can even improve performance when noisy variables are eliminated. The current work is based on a public dataset and the XGBoost algorithm to measure the impact of FS techniques on the DDoS attack classification problem. We consider both techniques independent of the sample labels, as well as methods that use this information to rank the variables in order of importance. We analyzed the problem from the point of view of Binary and Multiclass classification. We also created a benchmark of classification metrics and execution times. Our comparisons involved the Accuracy, Precision, Recall, and F1 Score metrics for different FS methods, in addition to training and execution time. In the results it is possible to verify both for the Binary (30% reduction of the features) and Multiclass classifiers (40% reduction of the features), that the ANOVA method showed as the most advantageous.

Downloads

Download data is not yet available.

Downloads

Published

2021-12-28

How to Cite

Hauy Netto de Araujo, P. H., Silva, A., Ferraz Junior, N., Cabrini, F., Santiago, A., Guelfi, A., & Kofuji, S. (2021). Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost. Journal of Communication and Information Systems, 36(1), 200–214. https://doi.org/10.14209/jcis.2021.22

Download Citation

Issue

Vol. 36 No. 1 (2021)

Section

Regular Papers

License

Authors who publish in this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a CC BY-NC 4.0 (Attribution-NonCommercial 4.0 International) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors can enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) before and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

___________

Received 2021-06-14
Accepted 2021-12-26
Published 2021-12-28

Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost

Authors

DOI:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Keywords

Information