回复:研究习作: 离合词的语料库研究
The best way to do this kind of study, in my view, is to prepare a list of, say, 40 most commonly used words of this category in MS-Word (40 is chosen because the search/context string must be less than 80 characters including /). Then do the following:
1) Add a space or tab between separable element (suppose there are only two elements);
2) Select All and convert the selection into a table;
3) Select the first column and cut and paste into Notepad and save as a Unicode file (part1.txt);
4) Conver the second column back into text (now one iitem per line);
5) Find and Replace all ^p (new line character in Word) with / (now item1/item2/item3/);
6) Remove the final /;
7) Start WST4 and load the Chinese corpus;
8) Start concord and use file-based concordances;
9) Type in Path:\file1.txt (Path=the folder you save file1.txt) e.g. C:\file1.txt and Press Load;
10) In advanced search copy the string item1/item2/item3...etc into the box for context;
11) Set L=0 and R=n (n>0, the greater n is, the noisy the result contains);
12) search as usual.
So you get something like below, if tis is what you want: