我是一名新手,想自建一个小型语料库。
请问建小型语料库的详细步骤。
谢谢!
请问建小型语料库的详细步骤。
谢谢!
我是一名新手,想自建一个小型语料库。
请问建小型语料库的详细步骤。
谢谢!
Thanks for the info.This book is not organized in the line of corpus tools
As a corpus is a neat tool for both linguists and computer scientists, I think I should reserve my opinion regarding this issue, although I myself a linguist by trade (and training).Corpus analyses should be led by a linguist, not a computer scientist.
That is a true enough if we do not want to implement our ideas directly, but some of us do. So I can see no wrong in knowing more about the tech side, altough it means we have to split our time and energy between theory and practice, risking that we(I) might harvest nothing in both worlds.Nonetheless, it is certainly important to know what is possible technologywise on the part of the main investigator. Linguistics is at the heart of language corpus studies, and corpus technology is but the implementation or solutions, and most often the partial solutions.
By "design decisions", I intend to know more about the inner thinking when you design the BFSU series of corpus tools, for example, why you choose sentence as the basic unit of presentation in some of your programs (e.g. the FLERIC Learner Corpus Portal, Setence Collector) instead of paragraphs or other linguistic units.BTW: What are "design decisions"? Did you mean the organization of the book?
I started my CL journey with this book. It is really a good book. Thanks for your sharing.