L2 Syntacic Complexity Analyzer

xflu76 · 2009-12-29

http://www.personal.psu.edu/xxl13/downloads/l2sca.html

About

L2 Syntactic Complexity Analyzer is designed to automate syntactic complexity analysis of written English language samples produced by advanced learners of English using fourteen different measures proposed in the second language development literature. The analyzer takes a written English language sample in plain text format as input and generates 14 indices of syntactic complexity of the sample. This software is an implementation of the system described in:

Lu, Xiaofei (forthcoming). Automatic analysis of syntactic complexity in second language writing. International Journal of Corpus Linguistics.

The analyzer is implemented in python and runs on UNIX-like (LINUX, MAC OS, or UNIX) systems with Java 1.5 and python 2.5 or higher installed. The analyzer takes as input a plain text file, counts the frequency of the following 9 structures in the text: words (W), sentences (S), verb phrases (VP), clauses (C), T-units (T), dependent clauses (DC), complex T-units (CT), coordinate phrases (CP), and complex nominals (CN), and computes the following 14 syntactic complexity indices of the text: mean length of sentence (MLS), mean length of T-unit (MLT), mean length of clause (MLC), clauses per sentence (C/S), verb phrases per T-unit (VP/T),, clauses per T-unit (C/T), dependent clauses per clause (DC/C), dependent clauses per T-unit (DC/T), T-units per sentence (T/S), complex T-unit ratio (CT/T), coordinate phrases per T-unit (CP/T), coordinate phrases per clause (CP/C), complex nominals per T-unit (CN/T), and complex nominals per clause (CP/C). The analyzer calls the Stanford praser (Klein & Manning, 2003) to parse the input file and Tregex (Levy & Andrew, 2006) to query the parse trees. Both the Stanford parser and Tregex are bundled in this download and installation along with the appropriate licenses.

westboy · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

Thanks for information.
FREE???

xflu76 · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

Yes, free for research purposes.

xiaoz · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

Many thanks, Xiaofei. Is there a way to use the software package in Windows directly - without support of Cygwin etc I mean?

作者 xflu76:
Yes, free for research purposes.

xflu76 · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

Not yet. Unfortunately I probably won't have time to make that happen in the near future. At the moment it should run fairly easily (both on individual text files and on multiple text files in a folder) on a UNIX-like system.

作者 xiaoz:
Many thanks, Xiaofei. Is there a way to use the software package in Windows directly - without support of Cygwin etc I mean?

清风出袖 · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

thanks Dr Lu Xiaofei a lot to share with us your latest research!

seinewang · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

作者 xflu76:
Not yet. Unfortunately I probably won't have time to make that happen in the near future. At the moment it should run fairly easily (both on individual text files and on multiple text files in a folder) on a UNIX-like system.

Pity that it won't run on the Windows system. Anyway, thanks for your generous sharing.

seanxpq · 2009-12-30

回复: L2 Syntacic Complexity Analyzer

It runs on JAVA?

xflu76 · 2010-01-01

回复: L2 Syntacic Complexity Analyzer

I'm not sure about that. At the moment I really only have confidence that it runs well from a command line in a UNIX-like system.

作者 seanxpq:
It runs on JAVA?

fountainli · 2010-01-02

回复: L2 Syntacic Complexity Analyzer

thanks, happy new year!

L2 Syntacic Complexity Analyzer

xflu76

westboy

xflu76

xiaoz

永远的超级管理员

xflu76

清风出袖

高级会员

seinewang

seanxpq

corpus explorer

xflu76

fountainli