论文急问,editpad的检索问题

检索出来的结果是所有带hyphen的词,想用editpad搜索2个单词组成的、3个组成的等等的个数,碰到以下两个问题:

1. 不知道为什么,在删除代码的时候,一replace all就在一半未响应?
2. 最关键的,用\w+-\w+先搜2个的,结果find显示选中的是所有复合词的前2个组成部份,应该怎样搜?
 
回复: 论文急问,editpad的检索问题

检索出来的结果是所有带hyphen的词,想用editpad搜索2个单词组成的、3个组成的等等的个数,碰到以下两个问题:

1. 不知道为什么,在删除代码的时候,一replace all就在一半未响应?
2. 最关键的,用\w+-\w+先搜2个的,结果find显示选中的是所有复合词的前2个组成部份,应该怎样搜?

发一段检索结果,让偶看一下?:p
 
回复: 论文急问,editpad的检索问题

检索出来的结果是所有带hyphen的词,想用editpad搜索2个单词组成的、3个组成的等等的个数,碰到以下两个问题:

1. 不知道为什么,在删除代码的时候,一replace all就在一半未响应?
2. 最关键的,用\w+-\w+先搜2个的,结果find显示选中的是所有复合词的前2个组成部份,应该怎样搜?
(\w+-)+\w+ 可检索所有包含hyphen的复合词
若想找只含一个hyphen的,无非是规定前面或后面不出现其他hyphen而已([^-])
 
回复: 论文急问,editpad的检索问题


: ArthurW可能已经回答了你的问题.但我仍希望你能传上一个txt格式或可以操作的格式文本,因为图形文件我看不清.
 
回复: 论文急问,editpad的检索问题

\w+(-\w+){m,m}[^-]
这里的m = 1时,matches words such as "two-word"
m =2时,matches words such as "word-for-word"
依此类推
也可以是\w+(-\w+){1,}[^-]
 
回复: 论文急问,editpad的检索问题

\w+(-\w+){m,m}[^-]
这里的m = 1时,matches words such as "two-word"
m =2时,matches words such as "word-for-word"
依此类推
也可以是\w+(-\w+){1,}[^-]

tried but failed...
i don't know it's me, the file itself or the method...
 
回复: 论文急问,editpad的检索问题

tried but failed...
i don't know it's me, the file itself or the method...

先转换成word,用查找替换方式标记,结合excel给整出来了.发现其中有许多并非是英文单词.尽管如此,"-"是标志.结果见附件.
 

附件

  • 统计.doc
    35.5 KB · 浏览: 10
  • 识别发现有些并非英语.jpg
    识别发现有些并非英语.jpg
    88 KB · 浏览: 5
Last edited:
回复: 论文急问,editpad的检索问题

tried but failed...
i don't know it's me, the file itself or the method...

It must be you, not the file (if it IS raw text), nor the method (if you DID use the expression).
------------------
\w+(-\w+){1,}[^-]
------------------
With the expression above, this is what I got:
==============
end-of-season
end-of-term
end-of-the-century
End-of-Year
end-product
end-product
end-to-end
end-user
end-year
en-echelon
energy-efficient
energy-inefficient
energy-intensive
energy-saving
aid-for-economic-reform
bit-on-the-side
blue-roan-and-white
con-on-the-run
end-of-the-century
every-this-and-thatspins
feet-on-the-ground
first-in-first-out
first-past-the-post
four-square-to-the-wind
grip-it-and-rip-it
her-today-gone-tomorrow
horror-of-it-all
Hutton-in-the-Forest
jack-in-the-box
lily-of-the-valley
live-and-let-live
model-T-Ford-type
never-slow-on-the-uptake
not-in-my-backyard
once-in-a-lifetime
one-and-a-half
one-hundred-year-old
one-member-one-vote
out-of-the-way
pee-on-the-audience
run-of-the-mill
School-of-Sadlers-Wells
seventy-eight-year-old
shot-on-the-turn
s-in-the-fields
sixteen-and-a-half
sock-it-to-em
soon-to-be-famous
spur-of-the-moment
state-of-the-art
state-of-the-art
St-Germain-des-Pres
stuck-in-the-mud
third-of-an-acre
three-and-a-half
three-and-a-half-hour
three-to-seven-year
thrill-of-a-lifetime
time-for-a-change
to-be-looked-at-ness
top-of-the-range
twenty-five-year-old
twenty-one-year-old
twenty-three-year-old
two-and-a-half
two-hour-a-week
two-up-two-down
 
回复: 论文急问,editpad的检索问题

It must be you, not the file (if it IS raw text), nor the method (if you DID use the expression).
------------------
\w+(-\w+){1,}[^-]
------------------
With the expression above, this is what I got:
==============
end-of-season
end-of-term
...
...
two-up-two-down

it's great that your regex functions well as it can highlight perfectly all the (multi-)hyphenated strings in the list rather than only the first pair of clusters as liviawu realized, but it doen'st seem to have solved the problem of liviawu, and what s/he wants to crack is" 想用editpad搜索2个单词组成的、3个组成的等等的个数". i'm expecting a better solution of yours or someone else's.;)
 
Last edited:
回复: 论文急问,editpad的检索问题

先转换成word,用查找替换方式标记,结合excel给整出来了.发现其中有许多并非是英文单词.尽管如此,"-"是标志.结果见附件.

噢看来是可以用excel解决的呀
真的非常感谢~
 
回复: 论文急问,editpad的检索问题

it's great that your regex functions well as it can highlight perfectly all the (multi-)hyphenated strings in the list rather than only the first pair of clusters as liviawu realized, but it doen'st seem to have solved the problem of liviawu, and what s/he wants to crack is" 想用editpad搜索2个单词组成的、3个组成的等等的个数". i'm expecting a better solution of yours or someone else's.;)

exactly
 
回复: 论文急问,editpad的检索问题

噢看来是可以用excel解决的呀
真的非常感谢~
[FONT="仿宋体"]其实不用excel,只用word排序也完全可以.首先做的是,用word查找替换的方法,在string首标记"连字符"的个数,然后排个序即可.[/FONT]
[2]100-a-term
[2]100-a-ticket
[2]100m-a-year
[2]100-metre-high
[2]100-per-decoder
[2]100-year-old
。。。
。。。
。。。
[10]aw-shucks-gee-look-what-happened-to-me-when-i-wasn
[10]how-i-conquered-my-bad-habits-and-became-a-normal-person
[10]post-summer-oh-my-god-how-can-i-afford-Christmas
[10]run-down-at-the-end-of-a-long-hard-term
[10]sail-it-on-its-ear-and-hard-luck-if-you
[10]sponsored-eating-jelly-with-chopsticks-in-half-an-hour-athon
[10]the-house-in-the-yalley-where-the-yew-trees-grow
[10]ve-had-a-number-one-the-only-way-is-down
[11]i-am-about-to-explode-in-front-of-your-very-eyes
[11]noise-grunge-new-wave-of-alternative-corporate-underground-anarcho-core-punk
[11]we-come-from-place-bilong-far-on-big-bird-that-doesn
[11]who-is-this-jerk-and-why-am-i-married-to-him
[12]im-in-the-belly-ya-big-ugly-bag-of-chisels-yer-tattoo
[12]kowabunga-mutant-ninja-turtle-most-unfortunately-named-clothing-company-of-the-century
[12]my-man-once-made-this-pass-at-her-at-a-party-but
[12]praise-my-soul-the-king-of-heaven-to-his-feet-thy-tribute
[12]ve-got-a-spoon-and-seal-and-my-family-lies-at-kingsbere
[13]if-you-get-there-before-me-go-in-and-get-on-with-something
[13]steam-radio-stanley-holloway-meets-flanders-and-swann-meets-Joyce-grenfell-old-fashioned
[14]come-on-fred-we-giye-you-all-this-advertising-how-about-an-in-depth-
[14]go-into-groups-discuss-an-important-issue-and-find-a-dramatstatement-for-us-to-discuss
[14]i-am-not-really-dead-but-just-popped-out-for-a-packet-of-fags
[15]i-can-do-anything-i-want-any-time-i-want-and-make-you-like-it
[16]if-we-sold-our-suburban-villa-we-could-buy-a-georgian-manor-house-the-country
[20]christ-this-is-so-delicious-it-seems-a-shame-to-go-on-about-it-but-i-feel-it-is-my-duty
 
回复: 论文急问,editpad的检索问题

" 想用editpad搜索2个单词组成的、3个组成的等等的个数",这个问题,用editpad如何解决?谢谢!
2个组成的,可检索[^-]\w+-\w+[^-]
3个组成的,可检索[^-](\w+-){2}\w+[^-]
4个组成的,可检索[^-](\w+-){3}\w+[^-]
5个组成的,可检索[^-](\w+-){4}\w+[^-]
6个组成的,可检索[^-](\w+-){5}\w+[^-]
:p
 
Back
顶部