XML Based Technologies for Advanced Publishing Services: the DiVA Publishing System Under the Hood
||Uwe Klosa, Erik Siira, Stefan Andersson
||XML Based Technologies for Advanced Publishing Services: the DiVA Publishing System Under the Hood
||ELPUB2005. From Author to Reader: Challenges for the Digital Content Chain: Proceedings of the 9th ICCC International Conference on ElectronicPublishing held at Katholieke Universiteit Leuven in Leuven-Heverlee(Belgium), 8-10 June 2005 / Edited by: Milena Dobreva & Jan Engelen, ed. byPeeters Publishing Leuven, ISBN 90-429-1645-1, 2005
||Within DiVA (Digital Scientific Archive) XML is used for storage and archiving, for inter-application-communication, for creation of web pages, and for the export of metadata. Many parts of the publication work flow (print and electronic), most parts of the archiving work flow, and the metadata work flow are supported by the use of XML. All metadata are stored in XML, more specifically in XML files with UTF-8 encoding. The format used in the XML files is defined by an XML schema that was developed by the Electronic Publishing Centre (EPC) at Uppsala University Library. This format, the DiVA Document Format (DDF), was created to achieve the storage of the full-text content – not including multimedia like images, films and sounds – and its metadata in a single XML file. Another purpose was to obtain a rich format, which would meet our requirements, such as holding sufficient metadata and the content of a document, and which would be compatible with most standard formats or de-facto standard formats. Furthermore, this format had to be extensible, and usable within an archiving work flow.This poster will show how XML is produced from the full-text content and with the services that are built and are planned to be built for both document authors and users of the DiVA web pages.In order to create an XML format from full-text files, the authors have to use templates in the word processor of their choice. It is important that these templates can easily be mapped to the required XML format. This can involve the necessity of a transformation of a proprietary XML format, exported from word processors, to the required format. In the DiVA system this is done from the Open Office XML to DDF.An XML format supporting both metadata and full-text content has several benefits for the printing work flow, the archiving work flow and the dissemination of publications and its metadata. The printing work flow can be supported by the just in time creation of title or abstract pages. The end user, who wants to get information about a publication, can be provided with all metadata. In addition, the user can be offered important information like table of contents or reference lists directly on the web page, without any need to open the often large full-text files.
||file.pdf (38,664 bytes)
Post discussion ...
These pages are best viewed with any standards compliant browser (e.g. Mozilla).