Speaker
Affilliation
Linguamatics Ltd., Cambridge
Abstract
Interactive Information Extraction brings together search and information extraction to provide fast, interactive text mining over large volumes of text such as Medline abstracts, full text scientific articles, patents etc. As well as covering the two ends of the spectrum: keyword search over documents, and detailed linguistic patterns within sentences, Linguamatics' I2E also covers the points in between such as keywords within the same sentence, or co-occurrence of biological entities within sentences or documents. In this talk I will describe how I2E is being used in the life sciences, the use of ontologies within the system, and how statistical and linguistic processing can be combined to provide high quality results. I will also show how information discovered in different documents can be combined to discover new, long-distance relationships.