seanxpq
corpus explorer
http://www.textanalysis.com/Apps/POS_Tagger/pos_tagger.html
We are proud to announce the release of a standalone freeware executable of TAIParse featuring part-of-speech tagging. A tagger is a necessary component of most text analysis systems, as it assigns a syntax class (e.g., noun, verb, adjective, adverb) to every word in a sentence.
The tagger produces an output format almost identical to that of the Penn Treebank Project, including bracketing of noun phrases. The current version achieves 93% accuracy in a blind test that we use to assess progress.
The tagger has been built manually with general rules and methods. The entire analyzer definition, in our NLP++ language, is supplied with the download In contrast to other taggers, which are overtrained for particular document sets and use overly specific rules, this tagger can readily be applied to unseen text types.
Installation of the tagger is simple. Download and unzip. View the README file, edit the run.bat script for your environment. Then from a DOS window invoke the run.bat script. Note: an executable is offered for Windows OS only, at this time.
Editing, enhancing, and compiling the tagger requires VisualText. A 90-day free trial version is available on this website by simply filing out and submitting a form.
We welcome your feedback, questions, and suggestions. Note that this is a beta version and has yet to be subjected to a massive number of texts.
Download TAIParse 0.7 beta, focusing on POS tagging and shallow parsing. (7/10/05)
Reference: Tagset used in Penn Treebank.
CC
Coordinating conjunction
CD
Cardinal number
DT
Determiner
EX
Existential there
FW
Foreign word
IN
Preposition or subordinating conjunction
JJ
Adjective
JJR
Adjective, comparative
JJS
Adjective, superlative
LS
List item marker
MD
Modal
NN
Noun, singular or mass
NP
Proper noun singular
NPS
Proper noun plural
PDT
Predeterminer
POS
Possessive ending
PP
Personal pronoun
PP$
Possessive pronoun
RB
Adverb
RBR
Adverb, comparative
RBS
Adverb, superlative
RP
Particle
SYM
Symbol
TO
To
UH
Interjection
VB
Verb, base form
VBD
Verb, past tense
VBG
Verb, gerund or present participle
VBN
Verb, past participle
VBP
Verb, noun-3rd person singular present
VBZ
Verb, 3rd person singular present
WDT
Wh-determiner
WP
Wh-pronoun
WP$
Possessive wh-pronoun
WRB
Wh-adverb
We are proud to announce the release of a standalone freeware executable of TAIParse featuring part-of-speech tagging. A tagger is a necessary component of most text analysis systems, as it assigns a syntax class (e.g., noun, verb, adjective, adverb) to every word in a sentence.
The tagger produces an output format almost identical to that of the Penn Treebank Project, including bracketing of noun phrases. The current version achieves 93% accuracy in a blind test that we use to assess progress.
The tagger has been built manually with general rules and methods. The entire analyzer definition, in our NLP++ language, is supplied with the download In contrast to other taggers, which are overtrained for particular document sets and use overly specific rules, this tagger can readily be applied to unseen text types.
Installation of the tagger is simple. Download and unzip. View the README file, edit the run.bat script for your environment. Then from a DOS window invoke the run.bat script. Note: an executable is offered for Windows OS only, at this time.
Editing, enhancing, and compiling the tagger requires VisualText. A 90-day free trial version is available on this website by simply filing out and submitting a form.
We welcome your feedback, questions, and suggestions. Note that this is a beta version and has yet to be subjected to a massive number of texts.
Download TAIParse 0.7 beta, focusing on POS tagging and shallow parsing. (7/10/05)
Reference: Tagset used in Penn Treebank.
Coordinating conjunction
CD
Cardinal number
DT
Determiner
EX
Existential there
FW
Foreign word
IN
Preposition or subordinating conjunction
JJ
Adjective
JJR
Adjective, comparative
JJS
Adjective, superlative
LS
List item marker
MD
Modal
NN
Noun, singular or mass
NP
Proper noun singular
NPS
Proper noun plural
PDT
Predeterminer
POS
Possessive ending
PP
Personal pronoun
PP$
Possessive pronoun
RB
Adverb
RBR
Adverb, comparative
RBS
Adverb, superlative
RP
Particle
SYM
Symbol
TO
To
UH
Interjection
VB
Verb, base form
VBD
Verb, past tense
VBG
Verb, gerund or present participle
VBN
Verb, past participle
VBP
Verb, noun-3rd person singular present
VBZ
Verb, 3rd person singular present
WDT
Wh-determiner
WP
Wh-pronoun
WP$
Possessive wh-pronoun
WRB
Wh-adverb