Malicious Web Sites Detection using C4.5 Decision Tree
Abstract
The technology advancement poses the challenge to the cybercriminals for doing various online criminal acts, such as identity theft, extortion of money or simply, viruses and worms spreading. The common aim of the online criminals is to attract visitors to the Web site, which can be easily accessed by clicking on the URL. Blacklisting seems not to be the successful way of marking Web sites with the “bad” content, considering that many malicious Web sites are not blacklisted. The aim of this paper is to evaluate the ability of C4.5 decision tree classifier in detecting malicious Web sites, based on the features that characterize URLs. The classifier is evaluated through several performance evaluation criteria, namely accuracy, sensitivity, specificity and area under the ROC curve. C4.5 decision tree classifier achieved significant success in malicious Web sites detection, considering all four criteria (accuracy 96.5, sensitivity 96.4, specificity 96.5 and area under the curve 0.958).
Keywords
Malicious Web Sites;Blacklisting;URL;C4.5 Decision Tree
Full Text:
PDFDOI: http://dx.doi.org/10.21533/scjournal.v5i1.109
Refbacks
- There are currently no refbacks.
Copyright (c) 2016 Zerina Mašetic, Abdulhamit Subasi, Jasmin Azemovic
ISSN 2233 -1859
Digital Object Identifier DOI: 10.21533/scjournal
This work is licensed under a Creative Commons Attribution 4.0 International License