回复: 如何检索 it +be+adj/n+that 等
thanks for laohong's brilliant introduction to Nooj. But at the same time i think this type of search pattern can be realized with some simply regular expressions and a software surpporting regular expressions, such as PhraseContext introduced in this forum.
Surely you are right, but I don't know whether you have tried with NooJ. To talk about query with regular expressions, NooJ provides wild cards (boolean), perl regular expression, NooJ regular expression and NooJ grammar for you to query any patterns in texts. Can you list any other corpus tools competitive enough in this perspective?
As we know, sometimes it's also quite troublesome to search with normal regular expressions. Just take "it +be+adj/n+that" as an example, to search it with perl regular expression, it is not easy for most of people here to write the expression which covers the following:
it => IT, It, it
be => am is are was were ... (and their variations with upper or lower case)
adj => with an untagged text, how can you search something like this?
n => same as above.
adj/n => how?
that => simple string match
However, NooJ can do a much better job in this case. If you query with NooJ regular expression, simply input this line to search: "
it <be> (<A> + <N>) that" or "
it <be> (<ADJ> + <N>) that" (without quotation marks), you will get all the concordances like:
ith the unfamiliar. It is true that among her contempor
ad he wrote to you; it was right that he should, and he w
ll as Mr. Luce, and it was probable that as his experience a
s eyes as well; but it was probable that the movement had in
," laughed Isabel, " it was better that you should do that
ly got over it, and it was natural that , as that affair had
ventured to ask. " It was natural that as an old friend of
ad been vanquished. It was well that Mr. Edward Rosier h