e-Science logo Nesc logo
 
 
About NeSC
e-Science Institute
e-Science Hub
TOE
Contacts
e-Science Events
Resources
Newsroom
Presentations & Lectures
Technical Papers
Global Grid Links
Projects
UK e-Science Centres
UK e-Science Teams
Career Opportunities
Bibliographic Database
 

 

Paper ID: 1539

Flexible Interfaces in the Application of Language Technology to an eScience Corpus
C J,Rupp Ann,Copestake Simone,Teufel Benjamin,Waldron

Appeared in: Proceedings of the UK e-Science All Hands Conference 2006 website: http://www.allhands.org.uk/2006/
Page Numbers:622 - 630
Publisher: National e-Science Centre
Year: 2006
ISBN/ISSN: 0-9553988-0-0
Contributing Organisation(s):
Field of Science: e-Science

URL: http://www.allhands.org.uk/2006/proceedings/papers/678.pdf

Abstract: We describe two key interfaces used in an architecture for applying a range of Language Technology tools to a corpus of Chemistry research papers, in order to provide a basis of robust linguistic analyses for Information Extraction tasks. This architecture is employed in the context of the eScience project `Extracting the Science from Scientific Publications' (a.k.a. SciBorg). The interfaces in question are the common representation for the papers, delivered in a range of formats, and the coding of various types of lingustic information as standoff annotation. While both of these interfaces are coded in XML their structure and usage are quite distinct. However, they are employed at the main convergence points in the system architecture. What they share is the ability to represent information from diverse origins in a uniform manner. We emphasise this degree of flexibility in our description of the interface structures and the design decisions that led to these definitions.

Keywords: e-Science, AHM 2006


BIB DOC HTM HTML PDF PPT PS RTF TEX TXT ZIP




 

Last Updated: 22 Jun 12 11:02
This is an archived website, preserved and hosted by the School of Physics and Astronomy at the University of Edinburgh. The School of Physics and Astronomy takes no responsibility for the content, accuracy or freshness of this website. Please email webmaster [at] ph [dot] ed [dot] ac [dot] uk for enquiries about this archive.