data collection

What kind of data should the tool work with?

HEURIST ( is an extremely flexible, end-user oriented, web-based data management system designed specifically for Humanities data. Developed since 2005, it has been in active use across many projects since 2009. It is available both as a free web service for researchers (hosted at the University of Sydney Data Centre) or for installation on a physical or virtual server (Open Source on gitHub).

Researchers can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without the need for a programmer(s). Quite complex databases can be built in a few hours by borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Databases created by Heurist are stored in MySQL with a repeatable structure facilitating independant access by other software.

Advanced features include record linking, graph structure, drill-down facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms. The team provides initial email and skype assistance for project setup at no cost, and special customisations at modest cost.

Code license: Open source, GNU GPL, GNU GPL v3
Last updated: 16 May 2018

CartoDB is a cloud based mapping, analysis and visualization engine that lets users build spatial applications for both mobile and the web. Users input tabular data and then construct an interactive visualisation through the web interface. It provides automatic georeferencing functionality and provides APIs for mobile data collection and dissemination.

Use is free for up to five tables; after that, there are monthly pricing plans.
Development was funded through EU and Spanish research programmes.

Code license: Open source
Last updated: 7 Jun 2016

Online web survey tool written in PHP using MySQL, MSSQL or Postgres database. Multilingual site with demo, feature list, documentation [Open Source, GPL3]

Code license: Open source, GNU GPL, GNU GPL v3
Last updated: 7 Aug 2015

STACK is an extensible social media research toolkit designed to collect, process, and store data from online social networks. The toolkit is an ongoing project via the Syracuse University iSchool, and currently supports the Twitter Streaming API. Collecting from Facebook public pages and Twitter search API are under development. The toolkit architecture is modular and supports extending. Basic Linux / Mac command line skills needed.

To learn more:

Code license: Open source
Last updated: 21 Apr 2015

MLA, APA, Chicago / Turabian and most-common Bluebook forms as an integrated citing and note-taking platform for individual or group projects. Prompts for analysis of source types and is unique in offering teaching support and personal help on any citation. Instructor / librarian view allows teacher to comment on work-in-progress providing just-in-time feedback in-context. Archives copies of web pages and pdfs which can be annotated. Dashboard provides long-term access to a portfolio of work.

Code license: Closed source
Last updated: 29 Dec 2014

LimeService is basically the hosted version of the GNU licensed LimeSurvey. It is a survey service-platform to prepare, run and evaluate on-line surveys. Besides basic free usage you are always getting the full feature set with no monthly fees or subscription plans.

I've used it before and found it to pretty robust.

Last updated: 29 Dec 2014
Subscribe to data collection