What kind of data should the tool work with?

Textable is an open source program for text analysis. It offers a set of basic text-analytic components (e.g. import text from files, segment into words, measure segment diversity, etc.), which the user combines using a visual interface to build custom analytic workflows.

Code license: GNU GPL v3
Last updated: 20 Aug 2017

Audacity is a free, easy-to-use and multilingual audio editor and recorder. Basic features, as listed on their website, include:

  • Record live audio.
  • Record computer playback on any Windows Vista or later machine.
  • Convert tapes and records into digital recordings or CDs.
  • Edit WAV, AIFF, FLAC, MP2, MP3 or Ogg Vorbis sound files.
  • Cut, copy, splice or mix sounds together.
  • Change the speed or pitch of a recording.
Code license: Open source, GNU GPL
Last updated: 24 Feb 2016

SentimentBuilder is an online tool that performs text analytics on emails, reviews, feedback, chat data or any unstructured texts via Natural Language Processing. It's the only tool where you can upload a file for processing and then visually view the results in a Sankey Flow Report to quickly identify trends, issues and strengths and then customize each view, save and share! Export any result for your own offline analysis! Try the Always Free version today and upload your own data or try one of our sample files.

Code license: Closed source
Last updated: 31 Mar 2018

DEVONthink is a database that helps users organize, manage and collaborate on digital files, including Office files, links, e-mails, research data and PDFs.

Code license: Closed source
Last updated: 10 Aug 2015

iPhoto is a digital photograph manipulation software application developed by Apple Inc. It has been included with every Macintosh personal computer since 2002, originally as part of the iLife suite of digital media management applications. iPhoto can import, organize, edit, print and share digital photos.

Code license: Closed source
Last updated: 19 May 2015

Web-based image editing with special effects and image manipulation features that can import content from PhotoBucket, Facebook, MySpace, Picasa, Flickr, Phanfare, or Smugmug. A free account is needed to save images.

FotoFlexer also offers an Open API to provide a simple interface for external websites to use the editing features - no API keys, REST or SOAP.

Code license: Closed source
Last updated: 18 May 2015

Picnik closed on April 19, 2013.

Picnik was a simple web-based image editor that could import photos from Facebook, Myspace, Picasa Web Albums, Flickr, Yahoo Image search, Google Plus, and also has upload capabilities. It was acquired by Google in 2010.

Code license: Closed source
Last updated: 18 May 2015

Corel Painter is a raster-based digital art application designed to simulate the appearance and behaviour of traditional media associated with drawing, painting, and printmaking. It is meant to be used with a graphics tablet or mouse.

The software offers a range of traditional artists' tools and materials (watercolour, oil paint, pastels, air brush, felt pens, chalk, charcoal & coloured pencil) and non-traditional effects (Image Hose, pattern pens, F/X, distortion, etc). Corel offers an online webinar library, tutorials, and other learning tools through their website.

Code license: Closed source
Last updated: 18 May 2015

Clipping Magic lets you easily edit out the background of photos. Mark up the foreground and background in the editor, use the scalpel tool for precision selections, then download the new image. You can upload and edit as many images as you like but you must have a paid subscription to download ($4 - $15 per month depending on the number of download credits needed).

Code license: Closed source
Last updated: 17 May 2015

ANTHROPAC is a menu-driven DOS program for collecting and analyzing data on cultural domains. The program assists with the collection and analysis of structured qualitative and quantitative data, and provides analytical and multivariate tools.

Last updated: 2 May 2015

OpenRefine (formerly Google Refine) is a tool for cleaning messy data (e.g. fixing inconsistencies), transforming it between different formats, and exploring data.

Code license: Open source, BSD
Last updated: 22 Mar 2015

Praat is software for the phonetic analysis of speech, including support for articulatory and speech synthesis.

Code license: GNU GPL v2
Last updated: 19 Feb 2015

VARD 2 is an interactive piece of software produced in Java designed to assist users of historical corpora in dealing with spelling variation, particularly in Early Modern English texts. The tool is intended to be a pre-processor to other corpus linguistic methods such as keyword analysis, collocations and annotation (e.g. POS and semantic tagging), the aim being to improve the accuracy of these tools

Last updated: 19 Feb 2015

Snipshot has been acquired by Ansa instant messaging service.

A browser bookmarklet that-- when used-- allows a user to select any of the images on the page to edit (basic color enhancement, cropping, etc.). Allows users to save the resulting image in a variety of formats, or email the image.

Last updated: 29 Dec 2014

Photoshop Express allows simple web-based image editing and cloud storage (2 GB free via Adobe Revel), as well as video storage and streaming, slideshow templates, and a photo gallery. Features include online galleries and slideshows, exporting and searching images, and privacy settings. Android, Windows and iOS (including iPad) apps are available.

Code license: Closed source
Last updated: 29 Dec 2014

Navicat for MySQL is an interface for working with MySQL databases, including importing data from CSV or Excel, exporting, reporting, querying, and for developing scripts etc, and general database exploration

Navicat for MySQL can also be used with MariaDB databases

Last updated: 29 Dec 2014

A software suite for displaying, converting and editing still images stored in a raster image format. Image manipulation may be performed via the command line, API libraries, or through a simply graphical user interface.


  • Software control through API and command line
  • Format conversion
  • Image transformation
  • Transparency support
  • Format identification
  • Text to image conversion
  • Read, process, or write mega-, giga-, or tera-pixel image sizes
  • Distributed pixel cache
  • Perceptual hash
Last updated: 29 Dec 2014

Insync extends Google Drive's web functionality to your desktop by integrating with Windows, Mac and Linux platforms. Insync allows for built-in sharing without a browser, multiple account support, on-demand shared file syncing, desktop notifications and more.

Code license: Closed source
Last updated: 29 Dec 2014

Nomenklatura is a reference data recon server. It is a service that allows users to define and manage manage lists of canonical entities (e.g. person or organization names) and aliases that connect to one of the canonical entities. This helps to clean up messy data in which a single entity may be referred to by many names.It includes a user interface, an API, and a reconciliation endpoint for OpenRefine for matching data from data sets with the canonical entries.

Code license: Open source
Last updated: 29 Dec 2014
Subscribe to Cleanup