Three Views on Corpora: Corpus Linguistics, Literary Computing, and Computational Linguistics
Anke Lüdeling, Amir Zeldes
Abstract
Digital corpora are used as a data source in corpus linguistics, literary computing and computational linguistics. Although differences in these disciplines dictate different kinds of work with corpora, many of their respective methods either are applied or could be applicable in the other disciplines. With the recent emergence of richly annotated multi-level and multipurpose corpora in mind, we review differences and similarities in research questions, corpus resources and their qualitative and quantitative exploitation in the three disciplines, along with suggestions for further development and mutual enrichment.
Full paper
http://computerphilologie.tu-darmstadt.de/jg07/luedzeldes.html
Anke Lüdeling, Amir Zeldes
Abstract
Digital corpora are used as a data source in corpus linguistics, literary computing and computational linguistics. Although differences in these disciplines dictate different kinds of work with corpora, many of their respective methods either are applied or could be applicable in the other disciplines. With the recent emergence of richly annotated multi-level and multipurpose corpora in mind, we review differences and similarities in research questions, corpus resources and their qualitative and quantitative exploitation in the three disciplines, along with suggestions for further development and mutual enrichment.
Full paper
http://computerphilologie.tu-darmstadt.de/jg07/luedzeldes.html