What kind of data should the tool work with?

Recogito is an online platform for collaborative document annotation.

Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts and images - and collaborate in their annotation and interpretation. Recogito enables you to make your work more visible on the Web more easily, and to expose the results of your research as Open Data.

Code license: Open source, Apache License
Last updated: 21 Dec 2016

WARCreate is a Google Chrome extension that allows a user to create a Web ARChive (WARC) file from any browseable webpage. The resulting files can then be used with other tools like the Internet Archive's open source Wayback Machine. The tool is an evolving product with the end result pushing toward being a personal web archiving solution for those that wish to securely archive their metadata in a standardize way.

Last updated: 30 Aug 2016

This is an open source platform for georectifying scanned images of maps, so that they can be displayed in web maps or used in GIS applications. It could be particularly useful for creating map overlays for Google Earth or similar applications, without using specialized desktop software. Map images must first be uploaded and made publicly available, and then by aligning with Open Street Map reference data, the maps are warped into georeferenced images. Georectified images are downloadable as GeoTIFF, PNG, or KML files, along with map tiles or WMS capabilities.

Code license: MIT License
Last updated: 7 Jun 2016

GapVis is an interface for exploring and reading texts that reference ancient places. It includes maps and data visualizations that show what locations are referred to a different points in the narrative and allows you to dig into the details to learn more.

Code license: Open source
Last updated: 7 Jun 2016

TAToo is an embeddable Flash widget that displays TAPOR analytics for the page on which it resides.

Code license: Apache License
Last updated: 23 Feb 2016

RefSeek is still in public beta, but I’ve had a lot of luck with this search engine. RefSeek gives students access to more than 1 billion documents, web pages, books, journals, and newspapers. What I really appreciate about the site is how clean the interface is—no ads, sponsored links or commercial results appear when you use RefSeek.

For a full review visit:

Code license: Creative Commons
Last updated: 9 Feb 2016

DH Press (originally called diPH) is a toolkit conceived as an easy-to-use WordPress plugin which allows potentially every kind of user to visualise and mashup historic and geographic information, documents and various types of multimedia content to develop digital humanities project.

Code license: Open source
Last updated: 10 Aug 2015

Kartograph is a pair of free and open source libraries for representing data with a spatial component on webpages or in print. The first library,, is a Python library that builds lightweight vector graphic maps from either shapefiles or PostGIS tables. These graphics files can be styled either using a cascading style sheet during creation or later on the second library, Kartograph.js. The vector files can also be edited and enhanced in a vector graphics program like Adobe Illustrator.

Code license: GNU Affero GPL v.3, GNU LGPL
Last updated: 2 Aug 2015

Zoom feature for high resolution images to be embedded in websites.
Hosted website for online exhibitions of uploaded images, text video and sound.
Metadata management for digitised items.
Clear interface allows for searches and comparisons.
Free to use but one must apply for an invitation.

Code license: Closed source
Last updated: 16 Jul 2015

Editors' Notes is an open-source, web-based tool for recording, organizing, preserving, and opening access to research notes, built with the needs of documentary editing projects, archives, and library special collections in mind.

Code license: Open source
Last updated: 8 Jul 2015

Snapzen is a browser tool that is used to collaborate with others about the information on any web page - right from your browser.

Discuss information on web pages with your colleagues, friends or family. It is easy to collaborate with others because they see exactly what you see on the web pages.

If you still use copy and paste, screenshot tools, email or chat to discuss web pages, Snapzen will show you a better way.

Code license: Closed source
Last updated: 15 Jun 2015

Tesla is a virtual research environment for text engineering - a framework you can use to create experiments in corpus linguistics, and to develop new algorithms for natural language processing. Tesla is a client-server application, which can be used by individual researchers as well as by workgroups. The screenshot below shows the experiment editor of Tesla's Client application.

Last updated: 24 May 2015

FromThePage is free software that allows volunteers to transcribe handwritten documents on-line. It's easy to index and annotate subjects within a text using a simple, wiki-like mark-up. Users can discuss difficult writing or obscure words within a page to refine their transcription. The resulting text is hosted on the web, making documents easy to read and search.

Code license: Open source, GNU Affero GPL
Last updated: 2 May 2015

Exhibit 3.0 is a publishing framework for large scale data-rich interactive Web pages. The beta version is scalable up to 100k items.

Last updated: 29 Dec 2014

Korbo is a powerful aggregation platform for gathering Linked Data objects relevant to your area of research into single workspaces or “baskets”.

Korbo is targeted primarily at developers who want to build applications on top of its API and make full use of the linked cultural data from sources such as Europeana, FreeBase and DBPedia.

Korbo is currently in the early stages of development, but you can already try out a demo version of the platform.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

Kaleidoscope is one of the world's best tools for spotting differences in images and text, and now it supports merging of files and folders, too. Kaleidoscope integrates directly with Git, Subversion, Mercurial, and Bazaar to fit perfectly in your workflow.

Last updated: 29 Dec 2014

Textexture is a tool for visualizing any text as a network. The resulting graph can be used to get a quick visual summary of the text, read the most relevant excerpts (by clicking on the nodes), and find similar texts.

Last updated: 29 Dec 2014

TVE is an interactive Java tool for exploring the effect of window size on three common linguistic measures: type-token ratio, proportion of hapax legomena, and average word length. In addition, TVE can cluster the text fragments according to a user-given set of words by applying principal component analysis (PCA).

Last updated: 29 Dec 2014

From the website: NodeXL is a free, open-source template for Microsoft® Excel® 2007 and 2010 that makes it easy to explore network graphs. With NodeXL, you can enter a network edge list in a worksheet, click a button and see your graph, all in the familiar environment of the Excel window. (

Last updated: 29 Dec 2014

Ptolemaic is a computer application for music visualization and analysis written in the Java programming language. The software is designed to aid in the analysis of all types of Western music using established analytical techniques, including tonal functional analysis (Harrison 1994), pitch-class set analysis (Forte 1973), hierarchical linear analysis (Schenker 1935, Jones 2002), tonal pitch-space analysis on the Tonnetz (Riemann 1915), pitch-class set analysis (Forte 1973), and transformation analysis (Lewin 1987).

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

Channelkit lets you save, organize and share links to websites, articles, places, videos and anything else you might need on the web and discover content curated by others.

Last updated: 29 Dec 2014
Subscribe to Beta