Elpub : Digital Library : Works

Paper 030_elpub2008:
Keyword and metadata extraction from pre-prints

id 030_elpub2008
authors Tonkin, Emma; Muller, Henk L
year 2008
title Keyword and metadata extraction from pre-prints
source ELPUB2008. Open Scholarship: Authority, Community, and Sustainability in the Age of Web 2.0 - Proceedings of the 12th International Conference on Electronic Publishing held in Toronto, Canada 25-27 June 2008 / Edited by: Leslie Chan and Susanna Mornati. ISBN 978-0-7727-6315-0, 2008, pp. 30-44
summary In this paper we study how to provide metadata for a pre-print archive. Metadata includes, but is not limited to, title, authors, citations, and keywords, and is used to both present data to the user in a meaningful way, and to index and cross-reference the pre-prints. We are particularly interested in studying different methods to obtain metadata for a pre-print. We have developed a system that automatically extracts metadata, and that allows the user to verify and correct metadata before it is accepted by the system.
keywords metadata extraction; Dublin Core; user evaluation; Bayesian classification
series ELPUB:2008
type normal paper
email henkm@cs.bris.ac.uk
content file.pdf (293,263 bytes)
discussion 1 discussions
ratings Ratings: 4 5
urn:nbn urn:nbn:se:elpub-030_elpub2008
last changed 2008/08/03 05:39
HOMELOGIN (you are user _anon_993100 from group guest) Powered by SciX Open Publishing Services 1.002