An Efficient Fuzzy Clustering Algorithm for Mining User Session Clusters on Web Log Data

  • Moksud Alam Mallik International Islamic University Malaysia, Kuala Lumpur
  • Nurul Fariza Zulkurnain VNR Vignana Jyothi Institute of Engineering & Technology, Hyderabad
Keywords: Data Mining, Web usage mining (WUM), Data Preprocessing, Fuzzy Clustering

Abstract

Data mining is extremely vital to get important
information from the web. Additionally, web usage
mining (WUM) is essential for companies. WUM
permits organizations to create rich information related
to the eventual fate of their commercial capacity. The
utilization of data that is assembled by Web Usage
Mining gives the organizations the capacity to deliver
results more compelling to their organizations and
expanding of sales. Client access patterns can be mined
from web access log information using Web Usage
Mining (WUM) techniques. Because there are so many
end-user sessions and URL resources, the size of web
user session data is enormous. Human communications
and non-deterministic browsing patterns increment
equivocalness and dubiousness of client session
information. The fuzzy set-based approach can solve
most of the challenges listed above. This paper proposes
an efficient Fuzzy Clustering algorithm for mining
client session clusters from web access log information
to find the groups of client profiles. In addition, the
methodologies to preprocess the net log data as well as
data cleanup client identification and session
identification are going to be mentioned. This
incorporates the strategy to do include choice (or
dimensionality decrease) and meeting weight task
assignments

References

Ansari, Z., Azeem, M. F., Babu, A. V., & Ahmed, W. (2015). A fuzzy approach for
feature evaluation and dimensionality reduction to improve the quality of web
usage mining results. arXiv preprint arXiv:1509.00690.
Ansari, Z., Azeem, M. F., Babu, A. V., & Ahmed, W. (2015). A fuzzy clustering based
approach for mining usage profiles from web log data. arXiv preprint
arXiv:1509.00693.
Ansari, Z., Babuy, A. V., Ahmed, W., & Azeemz, M. F. (2011, September). A fuzzy set
theoretic approach to discover user sessions from web navigational data.
In 2011 IEEE Recent Advances in Intelligent Computational Systems (pp. 879-884).
ieee.
Bezdek, J. C., Ehrlich, R., & Full, W. (1984). FCM: The fuzzy c-means clustering
algorithm. Computers & geosciences, 10(2-3), 191-203.
Castellano, G., Fanelli, A. M., & Torsello, M. A. (2006, September). Mining usage
profiles from access data using fuzzy clustering. In The 6th WSEAS international
conference on simulation, modelling and optimization, Portugal.
Castellano, G., Fanelli, A. M., Mencar, C., & Torsello, M. A. (2007, November).
Similarity-based fuzzy clustering for user profiling. In 2007 IEEE/WIC/ACM
International Conferences on Web Intelligence and Intelligent Agent TechnologyWorkshops (pp. 75-78). IEEE.
Castellano, G., Mesto, F., Minunno, M., & Torsello, M. A. (2007, July). Web user
profiling using fuzzy clustering. In International Workshop on Fuzzy Logic and
Applications (pp. 94-101). Springer, Berlin, Heidelberg.
Cooley, R., Mobasher, B., & Srivastava, J. (1997, November). Web mining: Information
and pattern discovery on the world wide web. In Proceedings ninth IEEE
international conference on tools with artificial intelligence (pp. 558-567). IEEE.
Han, J., Kamber, M., & Pei, J. (2012). Data Mining Concepts and Techniques 3rd
Edition Vol. 3rd.
Mallik, M. A., Zulkurnain, N. F., Nizamuddin, M. K., & Aboosalih, K. C. (2021,
February). An Efficient Fuzzy C-Least Median Clustering Algorithm. In IOP
Conference Series: Materials Science and Engineering (Vol. 1070, No. 1, p. 012050).
IOP Publishing.
Nasraoui, O., Frigui, H., Krishnapuram, R., & Joshi, A. (2000). Extracting web user
profiles using relational competitive fuzzy clustering. International journal on
artificial intelligence tools, 9(04), 509-526
Published
2022-06-20
How to Cite
[1]
M. Mallik and N. Zulkurnain, “An Efficient Fuzzy Clustering Algorithm for Mining User Session Clusters on Web Log Data”, INJIISCOM, vol. 2, no. 2, pp. 80-93, Jun. 2022.