e-Science logo Nesc logo
 
 
About NeSC
e-Science Institute
e-Science Hub
TOE
Contacts
e-Science Events
Resources
Newsroom
Presentations & Lectures
Technical Papers
Global Grid Links
Projects
UK e-Science Centres
UK e-Science Teams
Career Opportunities
Bibliographic Database
 

 

Paper ID: 1548

An Architecture for Language Processing for Scientific Texts
Ann,Copestake Peter,Corbett Peter,Murray-Rust CJ,Rupp Advaith,Siddharthan Simone,Teufel Ben,Waldron

Appeared in: Proceedings of the UK e-Science All Hands Conference 2006 website: http://www.allhands.org.uk/2006/
Page Numbers:614 - 622
Publisher: National e-Science Centre
Year: 2006
ISBN/ISSN: 0-9553988-0-0
Contributing Organisation(s):
Field of Science: e-Science

URL: http://www.allhands.org.uk/2006/proceedings/papers/689.pdf

Abstract: We describe the architecture for language processing adopted on the eScience project `Extracting the Science from Scientific Publications' (nicknamed SciBorg). In this approach, papers from different sources are first processed to give a common XML format (SciXML). Language processing modules operate on the SciXML in an architecture that allows for (partially) parallel deep and shallow processing and for a flexible combination of domain-independent and domain-dependent techniques. Robust Minimal Recursion Semantics (RMRS) acts both as a language for representing the output of processing and as an integration language for combining different modules. Language processing produces RMRS markup represented as standoff annotation on the original SciXML. Information extraction (IE) of various types is defined as operating on RMRSs. Rhetorical analysis of the texts also partially depends on IE-like patterns and supports novel methods of information access.

Keywords: e-Science, AHM 2006


BIB DOC HTM HTML PDF PPT PS RTF TEX TXT ZIP




 

Last Updated: 22 Jun 12 11:02
This is an archived website, preserved and hosted by the School of Physics and Astronomy at the University of Edinburgh. The School of Physics and Astronomy takes no responsibility for the content, accuracy or freshness of this website. Please email webmaster [at] ph [dot] ed [dot] ac [dot] uk for enquiries about this archive.