Images help a lot.
Before concordancing, collocation computing, etc, preprocessing has to be done:
Mark up the corpus in XML
1. Markup can be very complex or very simple
2. If your corpus is not XML marked up, use Index Tool (Tools C Preprocess in the Index Toolkit) to add simple XML markup
3. For a non-alphabet language corpus, convert it into Unicode (e.g. UTF-8, UTF-16)
Use the Index tool (Tools C Index Wizard in the Index Toolkit) to index your corpus.