Keyword and metadata extraction from pre-prints
||Tonkin, Emma; Muller, Henk L
||Keyword and metadata extraction from pre-prints
||ELPUB2008. Open Scholarship: Authority, Community, and Sustainability in the Age of Web 2.0 - Proceedings of the 12th International Conference on Electronic Publishing held in Toronto, Canada 25-27 June 2008 / Edited by: Leslie Chan and Susanna Mornati. ISBN 978-0-7727-6315-0, 2008, pp. 30-44
||In this paper we study how to provide metadata for a pre-print archive. Metadata includes, but is not limited to, title, authors, citations, and keywords, and is used to both present data to the user in a meaningful way, and to index and cross-reference the pre-prints. We are particularly interested in studying different methods to obtain metadata for a pre-print. We have developed a system that automatically extracts metadata, and that allows the user to verify and correct metadata before it is accepted by the system.
||metadata extraction; Dublin Core; user evaluation; Bayesian classification
||file.pdf (293,263 bytes)
These pages are best viewed with any standards compliant browser (e.g. Mozilla).