e-Science logo Nesc logo
 
 
About NeSC
e-Science Institute
e-Science Hub
TOE
Contacts
e-Science Events
Resources
Newsroom
Presentations & Lectures
Technical Papers
Global Grid Links
Projects
UK e-Science Centres
UK e-Science Teams
Career Opportunities
Bibliographic Database
 

 

Paper ID: 1721

Retrieving hierarchical text structure from typeset scientific articles
Bill,Hollingsworth Ian,Lewin Dan,Tidhar

Appeared in: Proceedings of the UK e-Science All Hands Conference 2005 website: http://www.allhands.org.uk/2005/
Page Numbers:
Publisher: Engineering and Physical Sciences Research Council
Year: 2005
ISBN/ISSN: 1-904425-53-4
Contributing Organisation(s):
Field of Science: e-Science

URL: http://www.allhands.org.uk/2005/proceedings/papers/515.pdf

Abstract: We discuss a processing framework (PTX) for scientific documents guided by two main principles. First, most published material is available in PDF, a layout or document appearance format. For text processing, the hierarchical structure of the text is required. Secondly, most likely users of scientific text processing will be scientists exploring literature within a particular specialism. Consequently, the framework should exploit specialism-specific knowledge. We explain and justify our framework, and demonstrate high accuracy using standard evaluation measures (precision, recall, f-measure).

Keywords: e-Science, AHM 2005


BIB DOC HTM HTML PDF PPT PS RTF TEX TXT ZIP




 

Last Updated: 22 Jun 12 11:02
This is an archived website, preserved and hosted by the School of Physics and Astronomy at the University of Edinburgh. The School of Physics and Astronomy takes no responsibility for the content, accuracy or freshness of this website. Please email webmaster [at] ph [dot] ed [dot] ac [dot] uk for enquiries about this archive.