This is a Windows program for generating and searching a KWIC concordance of a document ("KWIC" = "Keywords in Context"). A KWIC concordance is a list of the different words occurring in the document, with each instance of each word shown in context (that is, within a phrase). Word frequency is shown. Context size is user-definable, anything from 3 to 19 words long. The software acts on text files and on MS Word docx files, skipping over "stop" words. The concordance can be displayed alphabetically or by frequency, and can be written to a file.
WordSmith allows users to develop concordances, find keywords, and develop word lists from plain text files.
"TextSTAT is a simple programme for the analysis of texts. It reads plain text files (in different encodings) and HTML files (directly from the internet) and it produces word frequency lists and concordances from these files. This version includes a web-spider which reads as many pages as you want from a particular website and puts them in a TextSTAT-corpus. The new news-reader, too, puts news messages in a TextSTAT-readable corpus file.
TextSTAT reads MS Word and OpenOffice files. No conversion needed, just add the files to your corpus...
A software tool for performing concordance – the analysis of a set of words within its immediate context - on a body of text. The tool performs full concordance, reading and analysing each and every word in a text. It was initially written for the analysis of English texts, but has since been extended to cater for other Western languages. Limited support is also provided for text in East Asian scripts, such as Chinese and Korean.
AntConc is free concordance software. It is multi-platform and easy to deploy and use.
AntConc is part of a suite of related tools for text processing and analysis, including applications for parallel corpus analysis, word profiling, PDF to text conversion, text structure analysis, detecting and converting character encodings, Japanese and Chinese segmenter and tokenizer, wordclass tagger, and spelling variant anaysis. The developer is currently drafting a more explicit licence for the use of the software.
Wmatrix is web-based software for corpus analysis and comparison. It provides a web interface to the USAS and CLAWS corpus annotation tools, and standard corpus linguistic methodologies such as frequency lists and concordances. It also extends the keywords method to key grammatical categories and key semantic domains.
The Field Linguist's Toolbox is Windows software for maintaining lexical data, and for parsing and interlinearizing text.