Open American National Corpus (OANC) is now available

xujiajin

管理员
Staff member
http://www.anc.org/OANC/

The 15 million word Open American National Corpus (OANC) is now available in the finalized version of the ISO Linguistic Annotation Framework (LAF) Graph Annotation Format (GrAF). Formerly, the OANC was available in an early version of the LAF specification for a pivot format for standoff linguistic annotations.

The ANC provides tools that enable loading GrAF annotations into widely-used annotation platforms, including GATE and UIMA. We also provide a beta version of a web application (ANC2Go) that enables generating all or parts of the OANC in a variety of formats of the user's choice.

The OANC and the Manually Annotated Sub-Corpus (MASC) provide exemplars of GrAF use to annotate linguistic corpora.

Please consult the ANC web pages (http://www.anc.org) for information about the OANC, MASC, tools for processing, and the wide variety of annotations of these data that have been produced or contributed so far.

American National Corpus Project
Department of Computer Science
Vassar College, USA
anc@anc.org
 
回复: Open American National Corpus (OANC) is now available

收藏了,谢谢分享。
 
Back
顶部