Making Use of NXt to Nothing: The Effect of Class Imbalances on DGA Detection Classifiers

Drichel, Arthur; Meyer, Ulrike; Schüppen, Samuel; Teubert, Dominik

doi:10.1145/3407023.3409190

Computer Science > Cryptography and Security

arXiv:2007.00300 (cs)

[Submitted on 1 Jul 2020]

Title:Making Use of NXt to Nothing: The Effect of Class Imbalances on DGA Detection Classifiers

Authors:Arthur Drichel, Ulrike Meyer, Samuel Schüppen, Dominik Teubert

View PDF

Abstract:Numerous machine learning classifiers have been proposed for binary classification of domain names as either benign or malicious, and even for multiclass classification to identify the domain generation algorithm (DGA) that generated a specific domain name. Both classification tasks have to deal with the class imbalance problem of strongly varying amounts of training samples per DGA. Currently, it is unclear whether the inclusion of DGAs for which only a few samples are known to the training sets is beneficial or harmful to the overall performance of the classifiers. In this paper, we perform a comprehensive analysis of various contextless DGA classifiers, which reveals the high value of a few training samples per class for both classification tasks. We demonstrate that the classifiers are able to detect various DGAs with high probability by including the underrepresented classes which were previously hardly recognizable. Simultaneously, we show that the classifiers' detection capabilities of well represented classes do not decrease.

Comments:	Accepted at The 15th International Conference on Availability, Reliability and Security (ARES 2020)
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2007.00300 [cs.CR]
	(or arXiv:2007.00300v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2007.00300
Journal reference:	In The 15th International Conference on Availability, Reliability and Security (ARES 2020), ACM, 9 pages
Related DOI:	https://doi.org/10.1145/3407023.3409190

Submission history

From: Arthur Drichel [view email]
[v1] Wed, 1 Jul 2020 07:51:12 UTC (828 KB)

Computer Science > Cryptography and Security

Title:Making Use of NXt to Nothing: The Effect of Class Imbalances on DGA Detection Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Making Use of NXt to Nothing: The Effect of Class Imbalances on DGA Detection Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators