František Čermák
Today's Corpus Linguistics: Some Open Questions
International Journal of Corpus Linguistics 7,2, 2003, 265-282.
Abstract
In general, the paper is concerned with problems of methodology. Against this background, situation of today´s corpora is being discussed and some fields are being pointed at as being in a far from satisfactory shape. Place of corpora in linguistics is briefly looked at, suggesting that structuralist tradition might be using them most. Problems of annotation and ways, less (statistical) or more succesful (rule-based), are noted. Here, some of the most serious shortcomings, such as multi-word units or status of language units in general that computational linguists should deal with, is listed. In a more general direction, implications and status of paradigmatics and syntagmatics is discussed, too, with a considerable and critical attention paid to ontologies.
Today's Corpus Linguistics: Some Open Questions
International Journal of Corpus Linguistics 7,2, 2003, 265-282.
Abstract
In general, the paper is concerned with problems of methodology. Against this background, situation of today´s corpora is being discussed and some fields are being pointed at as being in a far from satisfactory shape. Place of corpora in linguistics is briefly looked at, suggesting that structuralist tradition might be using them most. Problems of annotation and ways, less (statistical) or more succesful (rule-based), are noted. Here, some of the most serious shortcomings, such as multi-word units or status of language units in general that computational linguists should deal with, is listed. In a more general direction, implications and status of paradigmatics and syntagmatics is discussed, too, with a considerable and critical attention paid to ontologies.