Phishing Website Detection Using Several Machine Learning Algorithms: A Review Paper
Abstract
Phishing is one of the major web social engineering attacks. This has led to demand for a better way to predict and stop them in a commercial environment. This paper seeks to understand the research done in the field and analyse the next steps forward. This is done by focusing on what goes into the selection of proper features, from manual selection to the use of Genetic Algorithms such as ADABoost and MultiBoost. Then a look into the classifiers in use, Neural Networks and Ensemble algorithms which were prominent alongside some novel approaches. This information is then processed into a framework for cloud-based and client-based phishing website detection, alongside suggestions for possible future research and experiments that could help progress the field.
References
Abdelnabi, S., Krombholz, K., & Fritz, M. (2020, October). VisualPhishNet: Zero-day phishing website detection by visual similarity. In Proceedings of the 2020 ACM SIGSAC conference on computer and communications security (1681-1698).
Ali, W., & Malebary, S. (2020). Particle swarm optimization-based feature weighting for improving intelligent phishing website detection. IEEE Access, 8, 116766-116780.
Alsariera, Y. A., Elijah, A. V., & Balogun, A. O. (2020). Phishing Website Detection: Forest by Penalizing Attributes Algorithm and Its Enhanced Variations. Arabian Journal for Science and Engineering, 45(12), 10459–10470.
Assefa, A., & Katarya, R. (2022, March). Intelligent Phishing Website Detection Using Deep Learning. In 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS) 1, 1741-1745. IEEE.
Chen, J. L., Ma, Y. W., & Huang, K. L. (2020). Intelligent Visual Similarity-Based Phishing Websites Detection. Symmetry, 12(10), 1681.
Mandadi, A., Boppana, S., Ravella, V., & Kavitha, R. (2022, April). Phishing Website Detection Using Machine Learning. In 2022 IEEE 7th International conference for Convergence in Technology (I2CT) (1-4). IEEE.
Mourtaji, Y., & Bouhorma, M. (2017, October). Perception of a new framework for detecting phishing web pages. In Proceedings of the Mediterranean Symposium on Smart City Application (1-6).
Sánchez-Paniagua, M., Fernández, E. F., Alegre, E., Al-Nabki, W., & González-Castro, V. (2022). Phishing URL Detection: A Real-Case Scenario Through Login URLs. IEEE Access, 10, 42949-42960.
Saravanan, P., & Subramanian, S. (2020). A framework for detecting phishing websites using GA based feature selection and ARTMAP based website classification. Procedia Computer Science, 171, 1083-1092.
Subasi, A., & Kremic, E. (2020). Comparison of adaboost with multiboosting for phishing website detection. Procedia Computer Science, 168, 272-278.
Suleman, M. T., & Awan, S. M. (2019). Optimization of URL-based phishing websites detection through genetic algorithms. Automatic Control and Computer Sciences, 53(4), 333-341.
Zhou, J., Liu, Y., Xia, J., Wang, Z., & Arik, S. (2020). Resilient fault-tolerant antisynchronization for stochastic delayed reaction–diffusion neural networks with semi-Markov jump parameters. Neural Networks, 125, 194-204.
Zhou, Z., & Zhang, C. (2022, May). Phishing website identification based on double weight random forest. In 2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA) (263-266). IEEE