基于赋码语料的as不同词性检索

i assume it shouldn't be so difficult and i shouldn't be that stupid but duiring the whole week i tortued myself by finding a basic function :sorting a pos-tagged word
Ityped as_in to sort the word as (prep) but every time the wst would cut me dead say "NO COllocate entries found !" I don't know what my problemis , anybody , please help!!
please , someone's dying here :mad::mad::mad::mad::confused::confused:
 
回复: Killing me ,help!!!

1. Please mind you language and font settings when posting a message here;
2. You need specify which corpus you are using and what tools you used for the query, otherwise nobody can help you.
 
回复: Killing me ,help!!!

iam trying to do an essy comparing the multi-part-of speech word "as "between two corpra , BNC and CLEC , frequency rate of the word is a MUST, I used tree-tagger , to be precise wintreetagger coped with the tagging job , i turn out very effecient but when sorting word with pos tag
i could not carry on , I typed as_IN concord sorting box but no entry is found I don't know why , Need a hand here!:)
 
回复: Killing me ,help!!!

你把你的文本样本上传上来看看。
 
回复: Killing me ,help!!!

我在clec里复制了一部分, 不知道这个是不是您想要的







<ST 4> <SEX ?> <Y ?> <AGE ?> <DIC ?> <TYP 1> <WAY 1> <SCH 2745> <TITLE My View on Job-Hopping> <SCORE 9> <ID 440921>
After graduated, everyone is faced on such problem as hunting a job. Then, what's your view on Job-Hopping? Some people like to engaging [vp5,1-] in a job consistently. They think if you do a job for a long time, little by little, you can be good at it. Then, it is more possible for you to do some achievements. Especially, if you have studies [vp9,1-2] some certain subjects, you must want to do a relative job to testify how do you study. [sn8,s-] While [cj1,-s], [sn9,s-] other people prefer to change [vp1,2-2] their jobs frequently. Because they think if they do many kinds of jobs, they can obtain many kinds of knowledge and experience from it. I agree with [wd3,1-3] the first opinion, ie. doing a job forever. Because I want to succeed in one field.


赋码之后的格式如下
<ST -> JJ <unknown>
<ST 5> NN <unknown>
, , ,
<SEX ?> JJ <unknown>
, , ,
<Y ?> JJ <unknown>
, , ,
<SCH GDUFS> JJ <unknown>
, , ,
<AGE ?> JJ <unknown>
, , ,
<WAY ?> JJ <unknown>
, , ,
<DIC ?> JJ <unknown>
, , ,
<TYP 2> JJ <unknown>
My NP My
Education NP Education
Although IN although
I PP I
was VBD be
very RB very
young JJ young
, , ,
I PP I
showed VVD show
great JJ great
interests NNS interest
in IN in
studies NNS study
and CC and
used VVN use
to TO to
urge VV urge
my PP$ my
father NN father
to TO to
teach VV teach
me PP me
read VV read
and CC and
write VV write
before IN before
I PP I
receiving VVG receive
the DT the
school NN school
education.[sn8,s NNS <unknown>
] SYM ]
At IN at
that DT that
time NN time
, , ,
I PP I
enjoyed VVD enjoy
leaning VVG lean
[ SYM [
wd3,s NNS <unknown>
] SYM ]
so RB so
much JJ much
that IN/that that
I PP I
had VHD have
[ SYM [
vp6,s- NN <unknown>
] SYM ]
formed VVD form
a DT a
good JJ good
habit NN habit
of IN of
hard-working JJ hard-working
since IN since
then RB then
. SENT .
I PP I
went VVD go
to TO to
primary JJ primary
school NN school
at IN at
the DT the
age NN age
of IN of
7 CD 7
, , ,
and CC and
left VVD leave
it PP it
at IN at
13 CD @card@
after IN after
successfully RB successfully
passing VVG pass
the DT the
entrance NN entrance
examination NN examination
to TO to
junior JJ junior
Middle NP Middle
School NP School
. SENT .
It PP it
took VVD take
me PP me
3 CD 3
years NNS year
 
回复: Killing me ,help!!!

你的赋码有问题。

你的赋码语料里没有你搜索的as_IN,因为你的赋码语料是vertical形式,即一行一个单词。
你是用的新版的TreeTagger标注的吗?新版的标注结果应该是horizontal的,即as_IN for_IN,而不是一个单词在一行。
 
回复: Killing me ,help!!!

果然你没有找对版本。

你用这个试试:
http://www.corpus4u.org/attachment.php?attachmentid=549&d=1239153276

你应该得到类似下面的结果:
There_EX are_VBP no_DT orthographic_JJ boundaries_NNS between_IN words_NNS in_IN Chinese_NP ._SENT
This_DT is_VBZ the_DT main_JJ difficulty_NN of_IN working_VVG with_IN Chinese_JJ computationally_NNS (_( in_IN addition_NN to_TO the_DT bewildering_VVG array_NN of_IN encodings_NNS used_VVN for_IN Chinese_NP and_CC the_DT simplified/traditional_JJ script_NN controversy_NN )_) ._SENT
A_DT Chinese_JJ word_NN frequently_RB consists_VVZ of_IN two_CD ,_, three_CD or_CC more_JJR characters_NNS ,_, while_IN the_DT definition_NN of_IN what_WP counts_VVZ as_IN a_DT word_NN in_IN Chinese_NP is_VBZ the_DT subject_NN of_IN intense_JJ debates_NNS (_( though_IN the_DT same_JJ is_VBZ true_JJ for_IN other_JJ languages_NNS ,_, constructions_NNS like_VVP as_RB well_RB as_RB or_CC give_VVP up_RB have_VH all_PDT the_DT properties_NNS of_IN a_DT single_JJ word_NN ,_, and_CC names_NNS ,_, like_IN White_NP House_NP ,_, also_RB mean_VV what_WP they_PP are_VBP supposed_VVN to_TO mean_VV only_RB taken_VVN as_IN a_DT whole_NN )_) ._SENT
 
回复: 基于赋码语料的as不同词性检索

First of all
SO many times when i was on the edge of giving up you and other senior researchers changed the whole situation with a few line of caring words, i‘m so grateful !thank u
and
Thank U ALL!!!!!!!!!!!!!!
Here is the “But”(sorry always come with the buts)
It seems that the old edtion worked more quickly than this one , cause i have been waiting for the software finish its tagging job since adam and eve time , heehee:p (30ms actually)
it doesn’t show anything except that the interface turned to a blank as soon as i press down either button on it
 
回复: 基于赋码语料的as不同词性检索

Good to know that you succeeded.

The "new" one is supposed to be very fast, and as fast as the one on Stuttgart site.
 
回复: 基于赋码语料的as不同词性检索

i know , i will try it on another pc tomorrow
thank u xu!!
 
Back
顶部