Elpub : Digital Library : Works

Paper 153_elpub2007:
Towards an Ontology of ElPub/SciX: A Proposal

id 153_elpub2007
authors Costa, Sely M.S.; Gottschalg-Duque, Claudio
year 2007
title Towards an Ontology of ElPub/SciX: A Proposal
source ELPUB2007. Openness in Digital Publishing: Awareness, Discovery and Access - Proceedings of the 11th International Conference on Electronic Publishing held in Vienna, Austria 13-15 June 2007 / Edited by: Leslie Chan and Bob Martens. ISBN 978-3-85437-292-9, 2007, pp. 249-256
summary A proposal is presented for a standard ontology language defined as ElPub/SciX Ontology, based on the content of a web digital library of conference proceedings. This content, i.e., ElPub/SciX documents, aims to provide access to papers presented at the total editions of the International Conference in Electronic Publishing (ElPub). After completing its 10th years in 2006, ElPub/SciX is now a comprehensive repository with over 400 papers. Previous work has been used as a basis to build up the ontology described here. It has been presented at Elpub2004 and it dealt with an Information Retrieval System using Computational Linguistics (SiRILiCo). ElPub/SciX ontology constitutes a lightweight ontology (classes and just some instances) and is the result of two basic procedures. The first one is a syntactic analysis carried out through the Syntactic Parser-VISL. This free tool, based on lingsoft's ENGCG parser, is made available through the Visual Interactive Syntactic Learning, a research and development project at the University of Southern Denmark, Institute of Language and Communication (ISK). The second one, carried out after that, is a semantic analysis (concept extraction) conducted through GeraOnto, an acronym that stands for “generating an ontology”, which extracts the concepts needed in order to build up the ontology. The program has been developed by Gottschalg-Duque, in 2005, in Brazil. The ensuing ontology is then edited via Protégé, a free, open source ontology editor. The motivation to carry out the work reported here came from problems faced during the preparation of a paper to Elpub2006, which aimed to present data about a number of aspects regarding the ElPub/SciX collection. While searching the collection, problems with the lack of standardization of authors and institutions names and the non-existence of any control of keywords had been identified. Such problems seem to be related to an apparent absence of “paper preparation” before entering into the SciX database. Lack of preparation, in turn, has brought about the desire of finding a solution, which is expected to support the work of those interested in searching the collection to retrieve information. ElPub/SciX ontology, therefore, is seen as that helping solution to support ElPub information retrieval.
keywords ontology; Elpub conferences; information retrieval
series ELPUB:2007
type normal paper
email selmar@unb.br
more http://info.tuwien.ac.at/elpub2007/presentations/153.ppt
content file.pdf (1,363,924 bytes)
discussion No discussions. Post discussion ...
ratings
urn:nbn urn:nbn:se:elpub-153_elpub2007
last changed 2007/06/14 16:16
HOMELOGIN (you are user _anon_864559 from group guest) Powered by SciX Open Publishing Services 1.002