| id |
030_elpub2008 |
| authors |
Tonkin, Emma; Muller, Henk L |
| year |
2008 |
| title |
Keyword and metadata extraction from pre-prints |
| source |
ELPUB2008. Open Scholarship: Authority, Community, and Sustainability in the Age of Web 2.0 - Proceedings of the 12th International Conference on Electronic Publishing held in Toronto, Canada 25-27 June 2008 / Edited by: Leslie Chan and Susanna Mornati. ISBN 978-0-7727-6315-0, 2008, pp. 30-44 |
| summary |
In this paper we study how to provide metadata for a pre-print archive. Metadata includes, but is not limited to, title, authors, citations, and keywords, and is used to both present data to the user in a meaningful way, and to index and cross-reference the pre-prints. We are particularly interested in studying different methods to obtain metadata for a pre-print. We have developed a system that automatically extracts metadata, and that allows the user to verify and correct metadata before it is accepted by the system. |
| keywords |
metadata extraction; Dublin Core; user evaluation; Bayesian classification |
| series |
ELPUB:2008 |
| type |
normal paper |
| email |
henkm@cs.bris.ac.uk |
| content |
file.pdf (293,263 bytes) |
| discussion |
1 discussions
|
| ratings |
Ratings: 4
5
|
| urn:nbn |
urn:nbn:se:elpub-030_elpub2008 |
| last changed |
2008/08/03 05:39 |