Speaker
Affilliation
Lexicography MasterClass Ltd
Abstract
This is an exciting time for our understanding of language. Linguists are becoming familiar with corpora, and so the possibilities they offer are now beginning to open up. Language-processing tools like part-of-speech taggers are also now reaching a level of maturity, so we can work with corpora that handle lemmas and grammar, and potentially more, as well as simple word forms. In this talk I will sketch out the empiricist programme, illustrating it with 'word sketches', one-page summaries of a word's grammatical and collocational behaviour, and distributional thesauruses.