Discovering

What kind of data should the tool work with?

There is an unlimited number of videos, PDFs, etc. that can be used for education, training, instruction, or professional development.

Finding and curating them into playlists, integrating with existing workflow, and sharing with others is time consuming, inefficient, and often limited by ‘vendor lock-in.’

Media Share is a productivity tool that saves time, requires no training to use, and does not limit how or where content can be used.

Code license: Closed source
Last updated: 25 Oct 2017

HEURIST (http://HeuristNetwork.org) is an extremely flexible, end-user oriented, web-based data management system designed specifically for Humanities data. Developed since 2005, it has been in active use across many projects since 2009. It is available both as a free web service for researchers (hosted at the University of Sydney Data Centre) or for installation on a physical or virtual server (Open Source on gitHub).

Researchers can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without the need for a programmer(s). Quite complex databases can be built in a few hours by borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Databases created by Heurist are stored in MySQL with a repeatable structure facilitating independant access by other software.

Advanced features include record linking, graph structure, drill-down facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms. The team provides initial email and skype assistance for project setup at no cost, and special customisations at modest cost.

Code license: Open source, GNU GPL, GNU GPL v3
Last updated: 13 Oct 2017

Pastpin is a cultural search engine that aims to showcase the 7M historical images available from Flickr Commons.

Users are prompted to add new metadata (geocodes, temporal data) to existing images that they may recognise based on their current location. The goal of the project is to crowdsource spatial and temporal metadata for these objects.

Visitors to the site can search or browse by using free text, or by going directly to the map and clicking on a thumbnail to view images directly.

Last updated: 28 Jul 2017

Pastpin is a cultural search engine that aims to showcase the 7M historical images available from Flickr Commons. Users are prompted to add new metadata (geocodes, temporal data) to existing images that they may recognise based on their current location. The goal of the project is to crowdsource spatial and temporal metadata for this objects.

Visitors to the site can search or browse by using free text, or by going directly to the map and clicking on a thumbnail to view images directly.

Last updated: 28 Jul 2017

Gephi is graphing software that provides a way to explore data through visualization and network analysis.

Code license: Open source, GNU GPL v3
Last updated: 15 Feb 2017

Yahoo Pipes allows users to combine, filter, translate, and geocode data from RSS feeds, JSON, KML, or other similar formats, and power widgets/badges using that data.

Last updated: 18 Jan 2017

RedZ is a 3D search engine that displays its results in the form of web page snapshots. The user may navigate through these relatively large snapshots, which are displayed in a Cover Flow layout.

Last updated: 16 Jun 2016

A global geographical database that may be used to identify and tag all references to location. The database contains over 8 million entries, each of which possesses a geographic name (in various languages), latitude, longitude, elevation, population, administrative subdivision and postal codes and information on unique features.

Features:

Last updated: 7 Jun 2016

Visualizes a series of events across both time and space. Allows researcher to create of an interactive timeline and map that are linked together. Users of the timeline can press "play" to watch the timeline scroll forward and the map zoom from place to place as they highlight each event (and the researcher's attached images and text) in turn. Users can also pause the progress of history, move forward or back at their own pace, and zoom in or out of either the map or timeline to examine areas of interest.

Compare to: StoryMap JS, MapStory, Odyssey.js

Code license: Closed source
Last updated: 7 Sep 2016

This software is for indexing and searching Word docx files, HTML files, XML files, text files or files of a similar kind (such as XML files) in any folder of a hard disk, CD-ROM, etc.

This program does not simply create a list of files or simply search for character strings in a file. It indexes every word in every file, and any file containing a particular set of words (or any of those words, or an exact phrase) can be found by searching for those words. Indexed files can be edited lightly without needing to re-index.

Code license: Closed source
Last updated: 19 Mar 2016

Overview is a tool for analyzing large sets of documents. In includes a sophisticated search engine, word clouds, entity detection, and topic-based document clustering. If that’s not good enough, you can write your own plugins using the API. It is open source and you can run it on your own computer.

It was originally designed for investigative journalists, but it’s now also used for qualitative research, social media conversation analysis, legal document review, digital humanities, and more.

Overview is built to do several types of tasks:

Code license: Open source
Last updated: 9 Mar 2016

Sigma is a JavaScript library that allows for the deployment of a graph file. It makes it easy to publish networks on Web pages, and allows developers to integrate network exploration in rich Web applications.
It is highly interactive and allows a researcher to extend their work from a dedicated graph analysis package such as Gephi and share it via the web to allow for communication of research outputs, while permitting viewers to explore and discover their own findings from the raw graph network.

Code license: MIT License
Last updated: 14 Nov 2015

CulturalAnalytics is an R package containing functions for statistical analysis and plotting of image properties, including statistics such as the standard deviation and mean in the RGB and HSV color spaces, image entropy and histograms in greyscale (intensity) and color, and for plotting color clouds and image scatter charts.

Code license: Open source, GNU GPL
Last updated: 12 Nov 2015

SentimentBuilder is an online tool that performs text analytics on emails, reviews, feedback, chat data or any unstructured texts via Natural Language Processing. It's the only tool where you can upload a file for processing and then visually view the results in a Sankey Flow Report to quickly identify trends, issues and strengths and then customize each view, save and share! Export any result for your own offline analysis! Try the Always Free version today and upload your own data or try one of our sample files.

Code license: Closed source
Last updated: 4 Sep 2015

nodegoat is a web-based data management, analysis & visualisation environment.

Using nodegoat, you can define, create, update, query, and manage any number of datasets by use of a graphic user interface. Your custom data model autoconfigures the backbone of notegoat's core functionalities.

Code license: Closed source
Last updated: 17 Aug 2015

PolyMeta is a metasearch engine that displays the search results in the form of clusters and images. The results are sorted according to source and category.

Last updated: 6 Aug 2015

iBoogie is a clustering search engine that puts documents with similar content or with related topics into the same group. Each group is assigned a label based on the content of the documents. The results are presented to the user in a hierarchy of topics (clusters) for browsing.

Last updated: 3 Aug 2015

This product can filter or format text-based content. It also includes a document or link organiser and search capabilities and might more correctly be termed a text management system. With the large number of documents stored on your computer and online links that you might use, this is a helpful application that allows you to navigate the environment more easily. Although the feature set is now well developed, an inexperienced user should still be able to use it relatively easily. It is not intended only for the expert managers.

Code license: GNU GPL v3
Last updated: 15 Jun 2015

The Open Science Framework (OSF) is a free, open source tool designed to help researchers manage the entire research workflow: planning, execution, reporting, archiving and discovery. It is part collaboration software and part version control system. The OSF can be used to manage individual projects or large collaborative ones. Privacy and sharing settings allow for fine-grained control over access to files and materials stored on the platform - share privately with collaborators or publicly with the community at large.

Code license: Apache License
Last updated: 14 Jun 2015

A text-mining system for scientific literature. Textpresso's two major elements are (1) access to full text, so that entire articles can be searched, and (2) introduction of categories of biological concepts and classes that relate to objects (e.g., association, regulation, etc.) or describe one (e.g., methods, etc).

Code license: Open source
Last updated: 28 May 2015

Users can upload photos and organize them into albums, and they can search photos that have been posted in public albums and filter the results by license (any Creative Commons license, licenses that allow commercial use, licenses that allow remixing).

Last updated: 18 May 2015

Lynks provides an easy to use, in-browser tool that helps you to create your own networks. Lynks is an initiative by Centre for Innovation, part of Leiden University (Campus The Hague). The software has been developed in 2014 in co-creation, with expertise from Dr. Eelke Heemskerk from University of Amsterdam. The software development has been supported by the financial contributions from the European Union Fund for Regional Development (EFRO) and the Municipality of The Hague.

Code license: Closed source
Last updated: 12 May 2015

MDID is software for teaching and learning with digital images, with tools for discovering, aggregating, and presenting digital media in a variety of learning spaces.

Code license: Open source, GNU GPL
Last updated: 8 May 2015

Text analysis software aimed at beginners to qualitative research, and using live visualizations as the interface. Quirkos supports standard code-and-retrieve operations, searches and queries on the data, and can visualize connections between topics and themes.

Find more information at http://www.quirkos.com/qualitative-data-analysis-software.html

Code license: Closed source
Last updated: 3 May 2015

Academia.edu is a social platform that allows academics to share research papers, gray literature, reviews and other scholarly materials. The site provides user statistics on the number and geographic origin of profile and document views. Academic affiliation is displayed in a tree-like format, grouped by universities and departments.

Code license: Closed source
Last updated: 21 Apr 2015

Content curation and topic discovery website based primarily on publishers the user follows through social media.

Code license: Open source
Last updated: 30 Jan 2015

A collection of free-to-download 3D models that have been designed using Google SketchUp.

Last updated: 29 Dec 2014

LibLime Koha is a web-based, open source integrated library system (ILS) that has also been used for virtual library systems (e.g. recreating historic libraries). LibLime Koha offers libraries circulation policies, patron management modules, parent-child relationship for patron records, club and service management features, in-depth "holds" support, single click batch import "undo" option, EzProxy compatibility, self-checkout interface and more.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

MediaWiki is a free software open source wiki package written in PHP, originally for use on Wikipedia and other Wikimedia Foundation projects. It is designed to be run on a large server farm for a website that gets millions of hits per day.

Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 29 Dec 2014

Casual allows you to search for a concept or common word and produces a unique cloud of concepts. Results are randomly displayed as definitions, related tags, and pictures.

Code license: Open source
Last updated: 29 Dec 2014

Cluuz is a search engine that shows not only links to related pages, but also entities (people, companies, organizations) and images that are extracted from within the search results. In addition to the results, Cluuz displays a tag cloud of the most relevant entities extracted from returned results, as well as a semantic graph view of a cluster of terms.

Code license: Closed source
Last updated: 29 Dec 2014

Archive-It is a subscription web archiving service from the Internet Archive that helps organizations to harvest, build, and preserve collections of digital content. Through our user friendly web application Archive-It partners can collect, catalog, and manage their collections of archived content with 24/7 access and full text search available for their use as well as their patrons. Content is hosted and stored at the Internet Archive data centers.

Last updated: 29 Dec 2014

Search Flickr for photos, sort according to license types. Contains commercial as well as Creative Commons licensed photos.

Code license: Open source
Last updated: 29 Dec 2014
CSV
Subscribe to Discovering