Documents Categorization in Multilingual Environment
||Karel Jezek, Michal Toman
||Documents Categorization in Multilingual Environment
||ELPUB2005. From Author to Reader: Challenges for the Digital Content Chain: Proceedings of the 9th ICCC International Conference on ElectronicPublishing held at Katholieke Universiteit Leuven in Leuven-Heverlee(Belgium), 8-10 June 2005 / Edited by: Milena Dobreva & Jan Engelen, ed. byPeeters Publishing Leuven, ISBN 90-429-1645-1, 2005
||This paper deals with various methods for multilingual document categorization and informs about the results of experiments in which EuroWordNet (EWN) plays the central role and serves as a fundamental problem solving tool. We describe both the algorithmic principles and the methodologies used in our classification system and consequently prove their functionality by experimental results. The aim of experiments was to verify the impact of multilingual collection on the quality of categorization and also find how thesaurus can be used to improve the classification and how the use of multilingual thesaurus can generalize monolingual version of categorization.
||file.pdf (633,331 bytes)
Post discussion ...
These pages are best viewed with any standards compliant browser (e.g. Mozilla).