急求解:对BNC的标注语料如何纯净化?

回复: 急求解:对BNC的标注语料如何纯净化?

孙教授总是那么的幽默。
我们都希望您的detagger出家的好。

呵呵,看来真的该“出家”了。希望大家多提要求,多给样本,一定给弄出一个没技术但有模样的简单易用的山寨版全能或多能的detagger.
 
回复: 急求解:对BNC的标注语料如何纯净化?

呵呵,看来真的该“出家”了。希望大家多提要求,多给样本,一定给弄出一个没技术但有模样的简单易用的山寨版全能或多能的detagger.
here is a sample of ICE-HK, pls try and tell me how.thanks
 

附件

  • sample ICE-HK.rar
    261.2 KB · 浏览: 9
回复: 急求解:对BNC的标注语料如何纯净化?

here is a sample of ICE-HK, pls try and tell me how.thanks

Ok, I got the sample. It seems to be a tough job,but we still could use the same and easy way to detag it. The only efforts i should make now is to go over the Markup Manual of ICE for Spoken Texts first.:)
 
回复: 急求解:对BNC的标注语料如何纯净化?

Ok, I got the sample. It seems to be a tough job,but we still could use the same and easy way to detag it. The only efforts i should make now is to go over the Markup Manual of ICE for Spoken Texts first.:)
Thank you.
I tried to use 文本整理器, but it's very slow and not satisfactory.Waiting for your news,dear.
 
回复: 急求解:对BNC的标注语料如何纯净化?

还有一个懒的方法,广州中医药大学的薛学彦老师做了一个slimBNC,把所有附码都去掉了,只剩下文本了。可以给他联系。
 
回复: 急求解:对BNC的标注语料如何纯净化?

here is a sample of ICE-HK, pls try and tell me how.thanks[/QUOTE
pls refer to the attachments for the detagged samples
 

附件

  • ICE_HK(1-21)detagged.doc
    1.4 MB · 浏览: 41
  • ICE_HK(1-21)detagged.txt
    479.1 KB · 浏览: 13
回复: 急求解:对BNC的标注语料如何纯净化?

How to detag the whole 500, dear?Thank you very much.You are so helpful
 
回复: 急求解:对BNC的标注语料如何纯净化?

How to detag the whole 500, dear?Thank you very much.You are so helpful

the whole 500 or more can be conveniently detagged by using the same way as i did for your 21 files.
 
Back
顶部