口语转写的软件有不少,标准也因人而已。LDC 的 Steven Bird 在其 Linguistic Annotation 网页上对一些相关软件有很好的介绍和对比。见 http://www.ldc.upenn.edu/annotation/
我们目前的一个大型课堂话语语料库的项目采用就是Transcriber, 总体来说效果很好。关于Transcriber具体可参阅:
Transcriber --a tool for segmenting, labeling and transcribing speech http://trans.sourceforge.net/en/presentation.php
Transcriber is free software for transcribing and annotating digital audio, aimed initially at transcription of broadcast news data. Its user interface is written in Tcl/Tk. It uses the same transcription formats as the LDC's Broadcast News data, and has also been adapted for XML I/O. It was developed by Claude Barras and Edouard Geoffrois, at DGA in Paris, in collaboration with LDC.