Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost

  • Pedro Henrique Hauy Netto de Araujo Universidade de São Paulo
  • Anderson Silva
  • Norisvaldo Ferraz Junior
  • Fabio Cabrini
  • Alessandro Santiago
  • Adilson Guelfi
  • Sergio Kofuji


Distributed Denial of Service (DDoS) attacks impose a major challenge for today's security systems, given the variety of implementations and the scale they can achieve. One approach for their early detection is the use of Machine Learning (ML) techniques, which create rules for classifying traffic from historical data. However, different types of data contribute unequally to the assertiveness of the trained model. The use of Feature Selection (FS) techniques as a pre-processing step allows identification of the most relevant features for the problem in question. This action reduces training time and can even improve performance when noisy variables are eliminated. The current work is based on a public dataset and the XGBoost algorithm to measure the impact of FS techniques on the DDoS attack classification problem. We consider both techniques independent of the sample labels, as well as methods that use this information to rank the variables in order of importance. We analyzed the problem from the point of view of Binary and Multiclass classification. We also created a benchmark of classification metrics and execution times. Our comparisons involved the Accuracy, Precision, Recall, and F1 Score metrics for different FS methods, in addition to training and execution time. In the results it is possible to verify both for the Binary (30% reduction of the features) and Multiclass classifiers (40% reduction of the features), that the ANOVA method showed as the most advantageous.

How to Cite
Hauy Netto de Araujo, P. H., Silva, A., Ferraz Junior, N., Cabrini, F., Santiago, A., Guelfi, A., & Kofuji, S. (2021). Impact of Feature Selection Methods on the Classification of DDoS Attacks using XGBoost. Journal of Communication and Information Systems, 36(1), 200-214.
Regular Papers