|NeSC Bibliographic Database|
Khurshid,Ahmad Lee,Gillam David,Cheng
Appeared in: Proceedings of the UK e-Science All Hands Conference 2005 website: http://www.allhands.org.uk/2005/
Publisher: Engineering and Physical Sciences Research Council
Field of Science: e-Science
Abstract: A grid implementation is described that can deal with large volumes of streaming free natural language text in conjunction with large sets of time series data. Processing speed-ups on a cluster of 24 machines (81 CPUs) for dealing with texts in excess of 100 million words (of text) are reported. The application area is econometrics, specifically the behaviour of financial markets, and the methodology reported can extend the scope of the Surrey’s Society Grid to strategically important areas of crime science and social anthropology. The data and compute requirements identified in the three areas compare well with the traditional concerns in grid computing. Our studies indicate problems of scalability especially when dealing with multi-modal data – texts and numbers.
Keywords: e-Science, AHM 2005
|Last Updated: 22 Jun 12 11:02|