I've finally given up trying to make out the differentiation between Z-scores calculated by SARA and the method proposed in Yang's book, 'An Introduction to Corpus Linguistics'. I took 动态语法's suggestion "最好是找到不同语料库的相匹配的原始数据,然后用同一个统计软件计算", and left this issue unsolved.
However, by making a few tests, I don't tend to believe that the differentiation is caused by different delimitations of span. I use the clearest way to define S, left 5 and right 5, or left 4 and right 4, etc. Listed below are my tests:
C': co-occurrence with the node
C: occurrence of the collocate
W: total number of words in BNC
N: occurrence of the node word