web spider

"TextSTAT is a simple programme for the analysis of texts. It reads plain text files (in different encodings) and HTML files (directly from the internet) and it produces word frequency lists and concordances from these files. This version includes a web-spider which reads as many pages as you want from a particular website and puts them in a TextSTAT-corpus. The new news-reader, too, puts news messages in a TextSTAT-readable corpus file.
TextSTAT reads MS Word and OpenOffice files. No conversion needed, just add the files to your corpus...

Last updated: 15 Jan 2013

ScrapBook is a Firefox extension, for saving web pages and managing the resulting collections. Saved websites can be searched, filtered, and edited.

Last updated: 15 Jan 2013
Subscribe to web spider