Data

What kind of data should the tool work with?

HEURIST (http://HeuristNetwork.org) is an extremely flexible, end-user oriented, web-based data management system designed specifically for Humanities data. Developed since 2005, it has been in active use across many projects since 2009. It is available both as a free web service for researchers (hosted at the University of Sydney Data Centre) or for installation on a physical or virtual server (Open Source on gitHub).

Researchers can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without the need for a programmer(s). Quite complex databases can be built in a few hours by borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Databases created by Heurist are stored in MySQL with a repeatable structure facilitating independant access by other software.

Advanced features include record linking, graph structure, drill-down facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms. The team provides initial email and skype assistance for project setup at no cost, and special customisations at modest cost.

Code license: Open source, GNU GPL, GNU GPL v3
Last updated: 13 Oct 2017

TravelTime Maps make it possible to create isochrones (travel time areas) on a map by selecting a start point, the maximum travel time and preferred transport mode. For example users can visualise all locations within 15 minutes drive time from their current location. It also uses walking, cycling and public transport times. It' also possible to download a CSV of all postcodes that fall within this area on request or export to a KML.

Code license: Closed source
Last updated: 17 May 2017

Topincs is a online database software. Given a data model, it allows

Last updated: 28 Mar 2017

Yahoo Pipes allows users to combine, filter, translate, and geocode data from RSS feeds, JSON, KML, or other similar formats, and power widgets/badges using that data.

Last updated: 18 Jan 2017

Analyse-it Method Validation Edition provides the statistical analysis you need to validate and verify analytical and diagnostic methods to meet the demands of regulatory compliance. With support for 11 CLSI protocols, it lets you:

• Validate or verify analytical performance characteristics (precision, trueness, linearity, interferences, detection capability) of a measurement procedure to ensure they meet requirements for intended use or manufacturer's claims.

Code license: Closed source
Last updated: 13 Dec 2016

Analyse-it Standard Edition provides you with the powerful statistical analysis and regression you’d expect from an expensive statistics package, but without the complexity or the cost.

• Describe your data with a wide range of statistics.

• Visualize distributions, see trends and patterns, and spot outliers with powerful charts from histograms to scatter plots.

Code license: Closed source
Last updated: 13 Dec 2016

Analyse-it Quality Control & Improvement Edition provides the statistical analysis you need to understand processes, bring them under control, and find improvements that better your product.

• Bring processes under statistical control so they are stable and predictable with powerful Shewhart, cumulative sum, and moving average control charts. Include Xbar-R, Xbar-S, R, S, I-MR, time weighted CUSUM, UWMA, EMA plots, and WECO, Nelson, Montomery or custom rulesets.

Code license: Closed source
Last updated: 13 Dec 2016

A server-side software application that enables databases conformant to a relational data model to be created, managed and queried. Information, most commonly text may be stored as one or more records contained within a table. The table may exist in isolation or have some relationship to other tables. Information may be manipulated using a set of T-SQL or ANSI SQL commands. Several editions of SQL Server 2014 are available, including SQL Server Business Intelligence, Developer, Enterprise, Express, Standard and Web.
Features:

  • Service Broker
Code license: Closed source
Last updated: 24 Jul 2016

A set of dataset management and statistical plugins for Microsoft Excel 2007, 2010 & 2013 including single and multiple linear regression, polynomial regression, and scatter plot with fit, fit confidence and prediction bands. The program can be used to generate visualizations and data reports. Requires Microsoft Excel.

Code license: Closed source
Last updated: 15 Jul 2016

Recollection is a platform developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program (NDIIPP), allowing users to create and share embeddable interfaces to digital cultural heritage collections. The Library of Congress released its latest version of Recollection as Viewshare, built to increase the ease of finding, using, and sharing the project's software.

Code license: Open source, MIT License
Last updated: 6 Jul 2016

Modest Maps is a small Javascript library for interactive tile-based maps that can display maps from external sources (e.g. OpenStreetMap, but not commercial layers like Google Maps), enable panning and zooming, and track the position of points based on XML data. It is designed to be lightweight and offer a minimal set of features, and therefore has less functionality than Leaflet, a similar tool.

Code license: Open source, BSD
Last updated: 7 Jun 2016

ERDAS Imagine is a suite of geospatial data authoring software. The suite contains a raster graphics editor and remote sensing application that performs advanced remote sensing analysis and spatial modelling to create new information. ERDAS IMAGINE can also visualize results in 2D, 3D, video, and on cartographic quality map compositions. It is primarily designed for geospatial raster data processing and the creation of digital images for mapping use in GIS or CAD software.

Features:

  • Image Analysis, Remote Sensing
Code license: Closed source
Last updated: 7 Jun 2016

ZeeMaps quickly maps point data on Google base maps in two ways:
1) The user uploads a .csv file of data points and their locations.
2) A group of users all add their own data location points to the map, on their own time from their own devices.

Each point can include text, video, image, or audio annotations.

Basic functionality is free; larger uploads and large numbers of maps require a paid subscription.

Code license: Closed source
Last updated: 7 Jun 2016

BatchGeo is an online service that maps address data as points. The cut and paste interface makes it easy to convert a spreadsheet of street addressed into a map can be embedded or downloaded as a KML file. A limited number of addresses can be mapped for free; large files require a subscription.

Code license: Closed source
Last updated: 7 Jun 2016

Crowdmap allows the investigator to set up a Web map around a particular topic and invite multiple users (participants, research subjects, collaborators, multiple assistants) to contribute information to the map on their own time and from their own device.

For $10/month, users can buy fee-based services including private maps and custom branding.

Code license: GNU LGPL
Last updated: 7 Jun 2016

Weave (Web-based Analysis and Visualization Environment) is a visualization platform designed to enable visualization of any available data by anyone for any purpose. Weave is an application development platform supporting multiple levels of user proficiency — novice to advanced — as well as the ability to integrate, disseminate and visualize data at “nested” levels of geography.

Last updated: 7 Jun 2016

Viewshare is a free web application for creating interfaces and visualizations of cultural heritage collections. It can create interactive maps, timelines, facets, tag clouds, histograms, and image galleries. The intended users of Viewshare are individuals managing and creating access to digital collections of cultural heritage materials. Viewshare is offered as a software as a service (SaaS), email ndiippaccess@loc.gov to request a free account.

Code license: Open source, MIT License
Last updated: 7 Jun 2016

The Science of Science (Sci2) Tool is a modular toolset supporting temporal, geospatial, topical, and network analysis and visualization of datasets at the micro (individual), meso (local), and macro (global) levels. Users of the tool can:

  • Access science datasets online or load their own
  • Perform different types of analysis with the most effective algorithms available
  • Use different visualizations to interactively explore and understand specific datasets
  • Share datasets and algorithms across scientific boundaries
Code license: Open source
Last updated: 1 Jun 2017

Visualizes a series of events across both time and space. Allows researcher to create of an interactive timeline and map that are linked together. Users of the timeline can press "play" to watch the timeline scroll forward and the map zoom from place to place as they highlight each event (and the researcher's attached images and text) in turn. Users can also pause the progress of history, move forward or back at their own pace, and zoom in or out of either the map or timeline to examine areas of interest.

Compare to: StoryMap JS, MapStory, Odyssey.js

Code license: Closed source
Last updated: 7 Sep 2016

The DataTank is an open source tool that publishes data, stored in text-based files (e.g., CSV, XML, JSON) or in binary structures (e.g., SHP files, relational databases). The DataTank reads data from these structures and publishes them to the web using a URI as an identifier, providing these data in any format a user wants regardless of the original data structure. The DataTank requires a server with Apache2 or Nginx, mod rewrite enabled, PHP 5.4 or higher, Git, any database supported by Laravel 4.

Features

Last updated: 7 Jun 2016

Geographically Encoded Objects for RSS feeds. GeoRSS was designed as a lightweight, community driven way to extend existing feeds with geographic information.

As RSS and Atom become more prevalent as a way to publish and share information, it becomes increasingly important that location is described in an interoperable manner so that applications can request, aggregate, share and map geographically tagged feeds.
RSS Map of Digital Humanities centers

Last updated: 7 Jun 2016

Freebase "is an open, Creative Commons Attribution (aka CC-BY) licensed collection of structured data," and a "platform for accessing and manipulating that data" via API. Almost 40 million entities and assertions about those entities are stored within a graph database. The database was built by pulling in open data and relies on community contribution to stay updated. Freebase is part of the semantic web and emits Linked Open Data (via RDF) for all its entities.

Last updated: 29 May 2016

D3.js is a data visualization library by Mike Bostock, who is also the primary creator of Protovis, which D3 is designed to replace.

There is a great introductory tutorial available from Luke Franci. It is one of many other tutorials linked to from Bostock's D3 wiki.

Code license: Open source
Last updated: 25 May 2016

Quadrigram describes itself as a "visual programming environment" for living data. It is a web-based tool for data visualization that allows the user to customize and publish interactive visualizations with a range of data types. Visualization possibilities range from basic charts and graphs (e.g., pie chart, bar graph), to more sophisticated visualizations for exploring complex datasets (e.g., networks, geo-data, zoomable tree map, quadrification, stacked flow).

Code license: Closed source
Last updated: 22 May 2016

Text 2 Mind Map is a web-based tool for mind mapping. Very basic interface and functionality. Requires users to structure information in a linear text outline, which it returns as a diagram.

Code license: Closed source
Last updated: 22 Mar 2016

AroniSmartIntelligence™ is an application that performs text analytics on RSS articles, reviews, feedback, chat data or other unstructured texts organized into sub-folders. The output may be further input into other advanced statistical analytics or data mining modules available in AroniSmartIntelligence™, including regression analysis, econometrics, segmentation and Bayesian models.

Code license: Closed source
Last updated: 18 Mar 2016

Google Docs is an online environment for editing and sharing documents, spreadsheets, presentations, forms, drawings, and tables. Google Docs documents can be public or private, or shared with anyone with a Google account, e-mailed, or downloaded in various formats, including conversions to PDF and other formats not identical to the original or to the proprietary format used at creation. Designated people with whom items are shared can be given permission to comment or edit the files, thus providing a quick way to collaborate on creating and editing documents and presentations.

Code license: Closed source
Last updated: 26 Jan 2016

Specify is a database platform for museum and herbarium research data. It manages species and specimen information for computerizing biological collections, tracking museum specimen transactions, linking images to specimen records and publishing catalog data to the Internet. Specify is written in Java for Windows, Mac OS X, and Linux computers and uses the relational data manager, MySQL, as its data engine. Specify, Java, and MySQL are free and open-source.

Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 10 Jan 2016

Figshare is a repository where users can make all of their research outputs available in a citable, shareable and discoverable manner. All file formats can be published, including videos and datasets that are often demoted to the supplemental materials section in current publishing models. Users of the site maintain full control over the management of their research whilst benefiting from global access, version control and secure backups in the cloud.

Code license: Closed source
Last updated: 29 Dec 2015

Exploratree is a web-based library and editing application for "interactive thinking guides," which are templates useful for mind mapping, brainstorming, planning, and visualization. Originally developed for use in the classroom, to help students refine and focus their ideas, as well as manage plans to further their investigation. Thinking guides can be edited, printed, and downloaded directly from the browser.

Last updated: 29 Dec 2015

yWorks is a powerful set of tools for creating diagrams using any number of frameworks. There are tools for working with HTML, FLEX, AJAX, Silverlight, Java and .NET.

yEd is also available from the yWorks site. This free graph editor can be used to create diagrams manually, or to import data for analysis.

Code license: Closed source
Last updated: 1 Dec 2015

Unlock Text is a powerful geoparser that can search text hosted on the web in txt or html format for references to locations. These locations are then returned ready for use in your results page, web map or any other application.

The Unlock Text API provides access to two parsers, the Edinburgh Geoparser from the Edinburgh Language Technology Group and the CLAVIN parser.

Code license: Open source
Last updated: 19 Nov 2015

Sigma is a JavaScript library that allows for the deployment of a graph file. It makes it easy to publish networks on Web pages, and allows developers to integrate network exploration in rich Web applications.
It is highly interactive and allows a researcher to extend their work from a dedicated graph analysis package such as Gephi and share it via the web to allow for communication of research outputs, while permitting viewers to explore and discover their own findings from the raw graph network.

Code license: MIT License
Last updated: 14 Nov 2015

NVivo is commercial software for qualitative analysis of unstructured data, in a range of formats and from diverse sources. Enables users to collect, organize, and analyze content from interviews, focus group discussions, surveys, audio, social media, videos, and webpages.

Code license: Closed source
Last updated: 30 Oct 2015

A cross-platform XML editor that may be used to create and validate XML documents and associated schema. It fully supports XSL (both XSLT and FO), DTD, Schema (Relax RNG and W3C), Database, XQuery and CSS. OXygen XML Editor works with all XML-based technologies, including XML databases, XProc pipelines, and web services and comes with ready-to-use DITA, DocBook, TEI, and XHTML support.

Frequently updated and supported, and with a very large set of features, this software tool has proved popular with digital humanists.

Code license: Closed source
Last updated: 10 Sep 2015

Dataplot is free, public-domain software for statistical analysis, and non-linear modeling. It was developed by the National Insistute of Standards and Technology in the United States. It performs "scientific, engineering, statistical, mathematical, and graphical analysis" through the use of "an interactive, command-driven language/system with English-like syntax." It will function on Unix, Linux, Mac OS X, and Windows XP/VISTA/7 systems.

Code license: Open source
Last updated: 13 Aug 2015

DEVONthink is a database that helps users organize, manage and collaborate on digital files, including Office files, links, e-mails, research data and PDFs.

Code license: Closed source
Last updated: 10 Aug 2015

DH Press (originally called diPH) is a toolkit conceived as an easy-to-use WordPress plugin which allows potentially every kind of user to visualise and mashup historic and geographic information, documents and various types of multimedia content to develop digital humanities project.

Code license: Open source
Last updated: 10 Aug 2015

Plone is a powerful, flexible, open source Content Management System (CMS) built on top of Zope application server and CMF.
Features:

  • Flexible and adaptable workflow
  • Customisable
  • Free add-ons
  • Versioning, history and reverting content
  • Support for multiple mark up formats
  • Multilingual content management
  • RSS feed support
  • WebDAV and FTP support
  • WYSIWYG
  • Integrates with Active Directory, Salesforce, LDAP, SQL, Web Services, LDAP and Oracle
Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 7 Aug 2015

Bitly is a link shortening service that also helps you share, track, and analyze your links. Now includes "bundles" which can be public or private, to gather your links by categories you choose. A social network style feed of other people's bitly links can be added to your dashboard.

An iOS app is available for iPhone. Wordpress plugin and API are available. Site includes a sub-site about the API for developers here: dev.bitly.com

Last updated: 6 Aug 2015

KORA is an digital repository that allows institutions to ingest, manage, and deliver digital objects and metadata.

Code license: Open source
Last updated: 5 Aug 2015

CiteULike is a free service to help you to store, organise and share the scholarly papers you are reading. When you see a paper on the web that interests you, you can click one button and have it added to your personal library. CiteULike automatically extracts the citation details, so there's no need to type them in yourself. It all works from within your web browser so you can access it from any computer with an Internet connection. CiteULike supports annotation and rating of items, and upload of attachments (e.g. PDF file). (Attachments are only accessible privately by individual users).

Code license: GNU GPL
Last updated: 5 Aug 2015

ORBIS is an "interactive scholarly work" that allows a user to determine the cost, time, and distance of various land, sea, and river routes among hundreds of sites in the ancient world, at various times of day and in various seasons. The work can be (and has been) used as a tool to study questions in various fields of study about antiquity, including trade and social interaction.

Code license: Closed source
Last updated: 5 Aug 2015

VisualEyes is web-based authoring tool developed at the University of Virginia to weave images, maps, charts, video, and data into highly interactive and compelling dynamic visualizations.

Code license: Open source
Last updated: 3 Aug 2015

iBoogie is a clustering search engine that puts documents with similar content or with related topics into the same group. Each group is assigned a label based on the content of the documents. The results are presented to the user in a hierarchy of topics (clusters) for browsing.

Last updated: 3 Aug 2015

Omeka is a content management system designed for the display of library, museum, archives, and scholarly collections and exhibitions.

Code license: Open source, GNU GPL
Last updated: 2 Aug 2015

Zenodo builds and operates a simple and innovative service that enables researchers, scientists, EU projects and institutions to share, preserve and showcase multidisciplinary research results (data and publications) that are not part of the existing institutional or subject-based repositories of the research communities.

Code license: GNU GPL
Last updated: 2 Aug 2015

Microsoft Excel is spreadsheet software with calculation, graphing tools, and pivot table options for analyzing data. A cloud-hosted version is available as part of Office 365.

Code license: Closed source
Last updated: 13 Jul 2015

TimeRime is a web-based tool allowing people to create, view, and compare interactive timelines.

Code license: Closed source
Last updated: 8 Jul 2015

Microsoft OneNote is a digital notebook that allows you to gather notes and information in a central environment, and search across your shared notebooks to better manage information and work with others. OneNote used to be available as paid software, but is now free across platforms.

Code license: Closed source
Last updated: 5 Jul 2015

This product can filter or format text-based content. It also includes a document or link organiser and search capabilities and might more correctly be termed a text management system. With the large number of documents stored on your computer and online links that you might use, this is a helpful application that allows you to navigate the environment more easily. Although the feature set is now well developed, an inexperienced user should still be able to use it relatively easily. It is not intended only for the expert managers.

Code license: GNU GPL v3
Last updated: 15 Jun 2015

The Open Science Framework (OSF) is a free, open source tool designed to help researchers manage the entire research workflow: planning, execution, reporting, archiving and discovery. It is part collaboration software and part version control system. The OSF can be used to manage individual projects or large collaborative ones. Privacy and sharing settings allow for fine-grained control over access to files and materials stored on the platform - share privately with collaborators or publicly with the community at large.

Code license: Apache License
Last updated: 14 Jun 2015

Coggle is a web-based tool for non-linear structuring and visualization of information. Easy to create visually appealing diagrams with little to no technical expertise. Supports Markdown and LaTeX formatting (use LaTeX via the \\( \\) or \\[ \\] escape sequences). Users can add images by dragging and dropping them in the browser, view change history for each diagram and revert to previous states, and download their work as PDFs or images. Also enables real-time collaboration with others.

Possible use cases for Coggle may include:

Code license: Closed source
Last updated: 9 Jun 2015

140kit provides a management layer for tweet collection and analysis.

Raw data cannot be passed through to the users, but any analytical process can be run across your dataset, and the data is held for as long as the user wants. When new analytical processes are created, they can be run on existing sets of data. 140kit does not claim any control of the analysis, however it retains ownership of the data collected.

Last updated: 24 May 2015

AnSWR supports qualitative analysis of word-based data. This entails a set of methods for organizing, displaying, processing, summarizing, and interpreting information.

Last updated 9/23/2005.

Only available for Windows 2000 and Windows XP.

Last updated: 24 May 2015

GRETL () s a cross-platform software package for econometric analysis, written in C. It features:

Code license: Open source, GNU Affero GPL
Last updated: 23 May 2015

Dropbox is a file hosting service that includes cloud storage, personal cloud, file synchronization, and client software across multiple platforms. Dropbox allows users to create a folder on each of their computers where any type of file can be saved, synchronized, and made available across all computers. Contents of the Dropbox folder are also accessible via dropbox.com and mobile applications. Individual files and folders can be shared with other Dropbox users or made publicly accessible.

Code license: Closed source, GNU GPL v2
Last updated: 22 May 2015

Twitter allows users to send 140-character messages. There is a thriving digital humanities community of Twitter users. This tool is great for communicating and sharing ideas, micro-blogging, real-time communication. You can follow tweets about digital humanities https://twitter.com/hashtag/digitalhumanities.

Code license: Closed source
Last updated: 21 May 2015

ImageJ is a Java open source image processing program designed for scientific multidimensional images. It is highly extensible, with thousands of plugins and macros for performing a wide variety of tasks, and a strong, established user base.

There are three major versions of ImageJ:

  • ImageJ1 - The stable version, developed by Wayne Rasband at NIH since 1997
  • ImageJ2 - Focuses on analysis of scientific multidimensional image data. Includes ImageJ1 with a compatibility layer
Code license: Open source
Last updated: 19 May 2015

Minitab provides tools for statistical analysis and visualization. It includes tools for creating graphics, and working with variance, regression, reliability, sample size, time series, forecasting, equivalence tests, tables, simulations, and distributions.

Code license: Closed source
Last updated: 18 May 2015

MicrOsiris is a statistical and data management package for Windows. This freeware has been derived from OSIRIS IV, a statistics and data management package developed at the University of Michigan. It can import up to 10,000 variables from SPSS, SAS, STATA, UNESCO IDAMS, and Excel. It is distributed as freeware.

Last updated: 18 May 2015

Statistical Lab is a graphical user interface designed to make statistical analysis easier to understand. This interactive tool will connect and display data frames, frequency tables, random numbers or matrixes. Statistical Lab uses R to run calculations, conduct analyses and perform multiple simulations and manipulations.

Code license: GPL
Last updated: 17 May 2015

PhiloGL is a framework for data visualization using WebGL, a library that extends Javascript to allow it to generate interactive 3D graphics.

Code license: Open source, MIT License
Last updated: 17 May 2015

JavaScript InfoVis Toolkit enables users to create interactive web-based visualizations; the demos page has some good examples along with the underlying code.

Code license: Open source
Last updated: 17 May 2015

Project Quincy allows users to trace the development of social networks and institutions over time and space using information about people, places and organizations. It is a Django application with a MySQL database that can be installed on a web server.

Code license: Open source, GNU GPL
Last updated: 17 May 2015

A website that explains statistical concepts and provides a web-based environment for performing those calculations. Tools include a graph maker, distribution generators, t-tests and procedures, and correlation and regression tests. All tools have been written in Javascript and run within the browser.

Code license: Closed source
Last updated: 14 May 2015

HUBzero is a web publication platform and content management system designed to facilitate collaboration on research and learning. In addition to standard blog and discussion features, HUBzero's most distinctive traits are a built-in environment that can run interactive software that scholars have developed within the browser, a tool development area, and the ability to share data and documents privately between members of the hub.

Code license: Open source
Last updated: 9 May 2015

bubbl.us is a web-based mind mapping tool, useful for organizing ideas, brainstorming, analyzing relationships, and visualizing data. Simple interface, with basic and easily understood functionality. Free to try without creating an account. Also available as an iOS application for iPad.

Code license: Closed source
Last updated: 9 May 2015

Greenstone is a suite of software for building and distributing digital library collections. It also allows users to publish to the internet or CD-ROM. Software interface and documentation available in English, French, Spanish, Russian, and Kazakh.

Code license: Open source, GNU GPL
Last updated: 8 May 2015

Cross-platform app for analyzing text, video, and spreadsheet data (analyzing qualitative, quantitative, and mixed methods research)

Last updated: 2 May 2015

ANTHROPAC is a menu-driven DOS program for collecting and analyzing data on cultural domains. The program assists with the collection and analysis of structured qualitative and quantitative data, and provides analytical and multivariate tools.

Last updated: 2 May 2015

SearchTeam is a collaborative search engine that allows individuals and groups to curate search results in a public or shared SearchSpace.

Code license: Closed source
Last updated: 1 May 2015

Importing, transforming, storing and indexing data should be easy.

Catmandu provides a suite of Perl modules to ease the import, storage, retrieval, export and transformation of metadata records. Combine Catmandu modules with web application frameworks such as PSGI/Plack, document stores such as MongoDB and full text indexes such as Solr to create a rapid development environment for digital library services such as institutional repositories and search engines.

Code license: GNU GPL v3
Last updated: 22 Apr 2015

STACK is an extensible social media research toolkit designed to collect, process, and store data from online social networks. The toolkit is an ongoing project via the Syracuse University iSchool, and currently supports the Twitter Streaming API. Collecting from Facebook public pages and Twitter search API are under development. The toolkit architecture is modular and supports extending. Basic Linux / Mac command line skills needed.

To learn more: https://github.com/bitslabsyr/stack

Code license: Open source
Last updated: 21 Apr 2015

OHMS (Oral History Metadata Synchronizer) inexpensively and efficiently enhances access to oral history by providing users with word-level search capability and a time-correlated transcript or indexed interview connecting the textual search term to the corresponding moment in the recorded interview online.

OHMS is an open source, web-based application designed to improve the user experience you provide for oral history, no matter what CMS or repository you use. There are 2 main components of the OHMS system

Code license: Open source
Last updated: 6 Apr 2015

OmniGraffle is a comprehensive diagramming and drawing application. Drag and drop to create wireframes, flow charts, network diagrams, UI mockups, family trees, office layouts, etc.. Upgrading to OmniGraffle Pro adds Visio support, shared layers, presentation mode, object-geometry controls, AppleScript and Actions support and more.

Code license: Closed source
Last updated: 6 Apr 2015

Oracle Database is a powerful and extensive relational database management system (RDBMS). There are restrictions on the free version of the software.
Features:

  • Supports symmetric multiprocessing (SMP)
  • Stores data logically in the form of tablespaces and physically in the form of datafiles
  • Transportable tablespaces
  • Advanced Queuing (AQ)
  • 64-bit database
  • Data Mining Option
Code license: Closed source
Last updated: 22 Mar 2015

OpenRefine (formerly Google Refine) is a tool for cleaning messy data (e.g. fixing inconsistencies), transforming it between different formats, and exploring data.

Code license: Open source, BSD
Last updated: 22 Mar 2015

Islandora is an open-source software framework designed to help institutions and organizations and their audiences collaboratively manage, and discover digital assets using a best-practices framework. Islandora was originally developed by the University of Prince Edward Island's Robertson Library, but is now implemented and contributed to by an ever-growing international community.

Code license: Open source
Last updated: 19 Mar 2015

Time Flow is an open-source timeline built to help journalists analyze temporal data. The application offers several view modes--timeline, calendar, list, table--to help explore thousands of data points. It is not a web-based tool--it is a desktop application that can run off a thumb-drive and is built to handle large datasets, and timeline events that may include approximate dates or date spans.

Code license: Open source
Last updated: 29 Dec 2014

Google SketchUp is easy-to-use free 3D modeling software.

Code license: Closed source
Last updated: 16 Feb 2015

Data Desk implements traditional statistical techniques using a simple graphic display interface for data exploration. The program focuses specifically on the visual exploration of data.

Code license: Closed source
Last updated: 29 Dec 2014

MATLAB allows matrix manipulations, plotting of functions and data, implementation of algorithms, creation of user interfaces, and interfacing with programs written in other languages, including C, C++, Java, and Fortran.

Code license: Closed source
Last updated: 29 Dec 2014

R

R is a free software environment for statistical computing and graphics. R can be run from the command line, or using any of the many graphical user interfaces available on a variety of platforms; these are listed as separate tools.

Code license: GPL
Last updated: 29 Jan 2015

Weka provides machine learning algorithms in Java for data mining and predictive modeling tasks. These algorithms can either be incorporated into other Java code or called from the Weka Workbench, a GUI environment.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

960 Grid System is a CSS template that comes with corresponding Acorn, Fireworks, Flash, InDesign, GIMP, Inkscape, Illustrator, OmniGraffle, Photoshop, QuarkXPress, Visio, Exp Design, and printable templates to facilitate different stages of the web design process.

Code license: Open source, GNU GPL, MIT License
Last updated: 29 Dec 2014

Navicat for MySQL is an interface for working with MySQL databases, including importing data from CSV or Excel, exporting, reporting, querying, and for developing scripts etc, and general database exploration

Navicat for MySQL can also be used with MariaDB databases

Last updated: 29 Dec 2014

The Visual Understanding Environment (VUE) is concept mapping software that can integrate with multiple repositories to pull in, organize, and analyze data. Multiple features for advanced management of digital resources for teaching, learning, and research.

Last updated: 29 Dec 2014

xMod is a desktop application which can transform a repository of XML into a completely finished website.
The entire process can be setup and run to produce a basic website assuming some prerequisites:

  • A set of valid XML files. These would normally comply with a TEI DTD.
  • a configuration script that indicates the relationship between files
  • A 'personality pack' (CSS and image files) that determine its visual appearance. However if they are not present, the completed website falls back on a default look and feel.
Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 29 Dec 2014

MAXQDA is a tool for qualitative data analysis, evaluation, and text analysis. You can export parts or all data into reports in Word, Excel, XML, or Images. The MAXQDA Multimedia Browser enables to code audio and video files directly without having to create a transcript. You can code your information however you like for easy retrieval and organization.

Code license: Closed source
Last updated: 29 Dec 2014

Create simple charts.

Last updated: 29 Dec 2014

Exhibit 3.0 is a publishing framework for large scale data-rich interactive Web pages. The beta version is scalable up to 100k items.

Last updated: 29 Dec 2014

Flare is an ActionScript library for creating visualizations that run in the Adobe Flash Player. From basic charts and graphs to complex interactive graphics, the toolkit supports data management, visual encoding, animation, and interaction techniques. Flare features a modular design that lets developers create customized visualization techniques without having to reinvent the wheel.

Last updated: 29 Dec 2014

Survey Monkey is a web-based survey creation and distribution site, with free and paid plans that allow users to create surveys and collect responses through a link, email, Facebook, or being embedded in a website or blog. Survey Monkey also allows for the collect and analysis of data.

Code license: Closed source
Last updated: 29 Dec 2014

LibLime Koha is a web-based, open source integrated library system (ILS) that has also been used for virtual library systems (e.g. recreating historic libraries). LibLime Koha offers libraries circulation policies, patron management modules, parent-child relationship for patron records, club and service management features, in-depth "holds" support, single click batch import "undo" option, EzProxy compatibility, self-checkout interface and more.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

Fedora (Flexible Extensible Digital Object Repository Architecture) was originally developed by researchers at Cornell University as an architecture to store, manage, and access digital content in the form of digital objects. Fedora defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking behaviors to digital objects.

Code license: Open source, Apache License
Last updated: 29 Dec 2014

MediaWiki is a free software open source wiki package written in PHP, originally for use on Wikipedia and other Wikimedia Foundation projects. It is designed to be run on a large server farm for a website that gets millions of hits per day.

Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 29 Dec 2014

Casual allows you to search for a concept or common word and produces a unique cloud of concepts. Results are randomly displayed as definitions, related tags, and pictures.

Code license: Open source
Last updated: 29 Dec 2014

Cluuz is a search engine that shows not only links to related pages, but also entities (people, companies, organizations) and images that are extracted from within the search results. In addition to the results, Cluuz displays a tag cloud of the most relevant entities extracted from returned results, as well as a semantic graph view of a cluster of terms.

Code license: Closed source
Last updated: 29 Dec 2014

TouchGraph is a Java application that creates network graphs of websites returned in Google searches.

Code license: Closed source
Last updated: 29 Dec 2014

Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction.

Code license: Open source, BSD
Last updated: 29 Dec 2014

BuzzData was a website that allowed users to create accounts to share and follow datasets. Data was shared in public rooms under open licenses. The service included tools to help visualize and present data.

BuzzData was closed on August 1, 2013.

Last updated: 29 Dec 2014

All Our Ideas is a research project that seeks to develop a new form of social data collection by combining the best features of quantitative and qualitative methods. Using the power of the web, we are creating a data collection tool that has the scale, speed, and quantification of a survey while still allowing for new information to "bubble up" from respondents as happens in interviews, participant observation, and focus groups.

Code license: Open source, BSD
Last updated: 29 Dec 2014

Jekyll is a simple, blog aware, static site generator. It takes a template directory containing raw text files in various formats, runs it through Textile or Markdown and Liquid converters, and creates a complete, static ready-to-publish website suitable for serving with your favorite web server. Jekyll also happens to be the engine behind GitHub Pages, which means you can use Jekyll to host your project’s page, blog, or website from GitHub’s servers for free.

Code license: Open source, MIT License
Last updated: 29 Dec 2014

SpiderOak provides an private and secure online backup, sync, sharing, access & storage solution for all file and data types. It was designed with a 'Privacy-as-a-Platform' approach. The 'Zero-Knowledge' privacy cloud technology provides a central, private place to store data, free from surveillance by private and public entities. Data is fully encrypted end-to-end (not only during transit) and only the user is able to unlock it using their password. Passwords are never stored by SpiderOak and the company cannot see the names of files or folders in your account.

Code license: Closed source, GNU GPL v3
Last updated: 29 Dec 2014

DSpace is the software of choice for academic, non-profit, and commercial organizations building open digital repositories. It is free and easy to install "out of the box" and completely customizable to fit the needs of any organization.

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets. DSpace has an active community of developers and is used by thousands of institutions worldwide.

Last updated: 29 Dec 2014

Korbo is a powerful aggregation platform for gathering Linked Data objects relevant to your area of research into single workspaces or “baskets”.

Korbo is targeted primarily at developers who want to build applications on top of its API and make full use of the linked cultural data from sources such as Europeana, FreeBase and DBPedia.

Korbo is currently in the early stages of development, but you can already try out a demo version of the platform.

Code license: Open source, GNU GPL
Last updated: 29 Dec 2014

Insync extends Google Drive's web functionality to your desktop by integrating with Windows, Mac and Linux platforms. Insync allows for built-in sharing without a browser, multiple account support, on-demand shared file syncing, desktop notifications and more.

Code license: Closed source
Last updated: 29 Dec 2014

nanoc is a Ruby-based, "static site generator" --it works as a tool that runs on your local computer and compiles documents written in formats such as Markdown, Textile, Haml… into a static web site consisting of simple HTML files, ready for uploading to any web server.

Code license: MIT License
Last updated: 29 Dec 2014

Circos is a software package for visualizing data and information. It visualizes data in a circular layout — this makes Circos ideal for exploring relationships between objects or positions. There are other reasons why a circular layout is advantageous, not the least being the fact that it is attractive.
Circos is ideal for creating publication-quality infographics and illustrations with a high data-to-ink ratio, richly layered data and pleasant symmetries. You have fine control each element in the figure to tailor its focus points and detail to your audience.

Code license: GPL
Last updated: 29 Dec 2014

Nomenklatura is a reference data recon server. It is a service that allows users to define and manage manage lists of canonical entities (e.g. person or organization names) and aliases that connect to one of the canonical entities. This helps to clean up messy data in which a single entity may be referred to by many names.It includes a user interface, an API, and a reconciliation endpoint for OpenRefine for matching data from data sets with the canonical entries.

Code license: Open source
Last updated: 29 Dec 2014

Statwing is an easy-to-use, web-based tool for data analysis and visualization. Upload data, select variables of interest, and Statwing automatically selects statistical tests and visualizations, then distills the results into plain English sentences (as well as traditional statistical output for those so inclined).

Free trial available, as well as multiple pricing plans:

Code license: Closed source
Last updated: 29 Dec 2014

RAW

Raw is an open web app to create custom vector-based visualisations utilising the D3.js library through a simple interface.
It is an open and customizable project and forkable via GitHub. Primarily conceived as a tool for designers and vis geeks, RAW allows to export visualizations in vector (SVG) or raster (PNG) format and embed them in your web page.

Code license: GNU LGPL
Last updated: 29 Dec 2014

Plot.ly is a free to educational (public) user web-based graphic tool that combined leading edge visualisations with a user-friendly, guided creation experience. Accepting data in a variety of forms, it leads the user through creation and sharing and then facilitates discussion around the output objects. The data, the code and any algorithm transformations are freely accessible to all users and products are easily embedded in other documents for presentation purposes.
Plot.ly also offer their services on a paid basis to enterprise users.

Last updated: 29 Dec 2014
Code license: GNU Affero GPL v.3
Last updated: 29 Dec 2014

MapCraft is a tool for tracking mapping progress during a mapping party or other coordinated project taking place in a concentrated area. It shows a cake diagram as a bunch of clickable areas, with the ability to take ownership of cake slices, as well as commenting on them, and chatting to eachother about it. It takes the concept of a "cake diagram" and makes it more dynamic.

Code license: Open source
Last updated: 29 Dec 2014
CSV
Subscribe to Data