Tagset for Biber tagger

xujiajin

管理员
Staff member
http://americannationalcorpus.org/FirstRelease/Biber-tags.txt

Tag descriptions
Doug Biber
15 June, 1993

There are five tag fields, separated by a plus sign (+). For most words,
only one or two of the fields are used. The primary grammatical category of
a word is usually marked in the first tag field; many of these first-field
tags are identical to tags used in the LOB tag set.
In the case of adjectives (TAG = JJ), nouns (TAG = NN), and verbs
(TAG = VB), the tag ?? can appear in Tag Field 4 to mark words that were not in
the dictionary; in these cases, the grammatical category is assigned
based on morphology and the surrounding context.
The tags xvbn and xvbnx in Field 4 mark a word as being a past participle
form, regardless of function. Thus, some adjectives, nouns, and base verb forms
are marked as xvbn. All past tense verbs, perfect aspect verbs, and
passive verbs have this tag. The tag xvbnx is used to mark cases where
the grammatical function (e.g., perfect or passive) has been identified with
a very high degree of accuracy from the context; the tag xvbn is used for cases
where the assigned grammatical function is less certain.
The tags xvbg and xvbgx in Field 4 mark a word as being a present
participle form, regardless of function. Thus, some adjectives and nouns
are marked as xvbg. All present progressive verbs have this tag. The tag xvbgx
is used to mark cases where the grammatical function has been identified with a
very high degree of accuracy; the tag xvbg is used for
cases where the assigned grammatical function is less certain.



--------------------------------------------------------------------------
Field 1 + Field 2 + Field 3 + Field 4 + Field 5
Tag
Sequence
:+clp+++ colon + clause punctuation
;+clp+++ semi-colon + clause punctuation
?+clp+++ question mark + clause punctuation
!+clp+++ exclamation mark + clause punctuation
,++++ comma
-++++ dash
"++++ double quote mark
'++++ single quote mark
(++++ left parenthesis
)++++ right parenthesis
$++++ dollar sign
%++++ percent sign
&fo++++ formula symbols

&fw++++ foreign word


abl++++ pre-qualifier (rather, such)
abn++++ pre-quantifier (all, half)
abx++++ pre-quantifier/double conjunction (both)
ap++++ post-determiner (many, more, most, only, other, own, same, ...)
aps++++ (others)
at++++ singular indefinite article (a, an)
ati++++ singular definite article (the, no)

cc++++ coordinating conjunction (and, but, or)
cc+cls+++ coordinating conjunction + clausal connector
cc+phrs+++ coordinating conjunction + phrasal connector
cc"++++ multi-word coordinating conjunction (as well as)
cc++neg++ coordinating conjunction + + negation (nor)

cd++++ cardinal number (2, 3, 4, two, three, four, hundred, ...)
cd+date+++ cardinal number + date (year only)
cd1++++ cardinal number: 1, one
cd1s++++ cardinal number: ones
cds++++ cardinal plural (tens, hundreds, thousands)
od++++ ordinal number (1st, 2nd, first, second, ...)

cs+cnd+++ subordinating conjunction + conditional (if, unless)
cs+con+++ subordinating conjunction + concessive (although, though)
cs+cos+++ subordinating conjunction + causative (because)
cs+who+++ subordinating conjunction + WH word (whether)
cs+sub+++ subordinating conjunction + other (as, except, until, ...)
cs"++++ multi-word subordinating conjunction (in that, so that, ...)

dt+dem+++ determiner + demonstrative (this,that,these,those modifying N)
dt+pdem+++ determiner + demonstrative pronoun (this, that, these, those)
dti++++ singular or plural determiner (any, enough, some)
dt++++ other singular determiner (another, each)
dtx++++ determiner/double conjunction (either)

ex+pex+++ existential there

in++++ preposition
in+ppvb+++ preposition + prepositional verb (account for, join in, ...)
in+pl+++ preposition + place marker (above, behind, beside, ...)
in"++++ multi-word perposition (as to, away from, instead of, ...)
in+strn+++ preposition + stranded

jj+atrb+++ adjective + attributive function
jj+atrb++xvbg+ adjective + attributive function + + -ing form
jj+atrb++xvbn+ adjective + attributive function + + past participle form
jj+pred+++ adjective + predicative function
jj++++ adjective + indeterminate function
jjb+atrb+++ attributive-only adjective + attributive (chief, entire)
jjr+atrb+++ comparative adjective + attributive function
jjr+pred+++ comparative adjective + predicative function
jjt+atrb+++ superlative adjective + attributive function


-----------------------------------------------------------------------
All modal forms can be marked as 0 in Field 5 (e.g., md+prd+++0) to show that they
are contracted forms (e.g., 'll, 've)

md+nec+++ modal + necessity (ought, should, must)
md+pos+++ modal + possibility (can, may, might, could)
md+prd+++ modal + prediction (will, would, shall)
md"++pmd"++ modal + + multi-word periphrastic modal (e.g., be going to)


nn++++ singular common noun
nn+nom+++ singular noun + nominalization
nvbg+++xvbg+ singular noun + + + -ing form
nn+++xvbn+ singular noun + + + past participle form
nns++++ plural common noun
nns+nom+++ plural noun + nominalization
nnu++++ unit of measurement (lb, kg, ...)
np++++ singular proper noun
nps++++ plural proper noun
npl++++ locative noun
npt++++ singular titular noun
npts++++ plural titular noun
nr++++ singular adverbial noun (east, west, today, home, ...)
nrs++++ plural adverbial noun


----------------------------------------------------------------------
NB: In the following pronoun tags, be careful of the difference between the
number 1, used to mark first person, and the letter l (i.e. lower case L), used
to mark reflexives.

pp1a+pp1+++ first person subject pronoun + first person pronoun
pp1a+pp1+++0 first person subject pronoun + 1st person pro. + contracted
pp1o+pp1+++ first person object pronoun + first person pronoun
pp$+pp1+++ possessive determiner + first person pronoun (my, our)
ppl+pp1+++ singular reflexive pronoun + first person pronoun (myself)
ppls+pp1+++ plural reflexive pronoun + first person pronoun (ourselves)
pp2+pp2+++ second person pronoun + second person pronoun
pp$+pp2+++ possessive determiner + second person pronoun (your)
ppl+pp2+++ singular reflexive pronoun + second person pronoun (yourself)
pp3a+pp3+++ third person subject pronoun + third person personal pronoun
pp3o+pp3+++ third person object pronoun + third person personal pronoun
pp3+pp3+++0 third person pronoun + 3rd person personal pro. + contracted
pp$+pp3+++ possessive + 3rd pers. personal pro. (his, her, their)
ppl+pp3+++ sg. reflexive pronoun + 3rd pers. personal pro. (her/himself)
ppls+pp3+++ pl. reflexive pronoun + 3rd pers. personal pro. (themselves)
pp3+it+++ third person pronoun + third person impersonal pronoun (it)
pp$+it+++ possessive determiner + third person impersonal pronoun (its)
pp$$++++ possessive pronoun (mine, yours, ...)
pn"++++ multi-word nominal pronoun (no one, ...)
pn++++ nominal pronoun (someone, everything, ...)

ql++++ qualifier + (as, less, more, too)
ql+amp+++ qualifier + amplifier (very)
ql+emph+++ qualifier + emphatic (most)
qlp++++ post-qualifier (enough, indeed)


All adverb forms can be marked as splt in Field 3 (e.g., rb+amp+splt++)
to indicate that the adverb occurs within the auxiliary
(e.g., they've probably been looking...).
.............................
 
Back
顶部