Elpub : Digital Library : Works

Paper 331elpub2004:
A web-based user-profile generator: foundation for a recommender and expert finding system

id 331elpub2004
authors Petr Grolmus, Jiri Hynek and Karel Jezek
year 2004
title A web-based user-profile generator: foundation for a recommender and expert finding system
source ELPUB2004. Building Digital Bridges: Linking Cultures, Commerce and Science: Proceedings of the 8th ICCC/IFIP International Conference on Electronic Publishing held in Brasília - DF, Brazil 23-26 June 2004 / Edited by: Jan Engelen, Sely M. S. Costa, Ana Cristina S. Moreira. Universidade de Brasília, 2004
summary The objective of our research is to create a universal tool for recommending non-visited interesting web pages as well as experts working in the same field of specialty. We accentuate practical adaptability of user profiles. User profiles are generated on the basis of Suffix Tree Clustering (STC) algorithm, which is similar to creating an inverted list of phrases occurring in a document collection. We are computing similarity of characteristic phrases identified by STC in order to find clusters of phrases. Phrases linked by similarity relationships form a phrase association graph. Clusters of phrases generated by our tool define interests of each user. We have tested the system by means of various document collections, such as Reuters Corpus Volume One – RCV1, 20Newsgroups, CTK – Czech Press Agency and Reuters-21578. Experimental results based on our extensive simulations as well as real-life environment are presented in the paper. Precision of our recommender system is 85 to 95 %.
keywords Text mining; user profile; recommender system; expert search; clustering; suffix tree; phrase search; characteristic phrase; similarity; packet filter
series ELPUB:2004
type full paper
email indy@civ.zcu.cz
content file.pdf (232,185 bytes)
discussion No discussions. Post discussion ...
urn:nbn urn:nbn:se:elpub-331elpub2004
last changed 2004/06/20 17:41
HOMELOGIN (you are user _anon_185898 from group guest) Powered by SciX Open Publishing Services 1.002