HEURIST is an extremely flexible data management system designed specifically for Humanities data - see http://HeuristNetwork.org. It is available as a free web service for researchers (hosted at the University of Sydney Data Centre) or for local installation (Open Source). Any confident researcher can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without programmers or consultants. Quite complex databases can be built in a few hours through borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Advanced features include record linking, drilldown facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms.
Gephi is graphing software that provides a way to explore data through visualization and network analysis.
Topincs is a online database software. Given a data model, it allows
Recogito is an online platform for collaborative document annotation.
Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts and images - and collaborate in their annotation and interpretation. Recogito enables you to make your work more visible on the Web more easily, and to expose the results of your research as Open Data.
Analyse-it Method Validation Edition provides the statistical analysis you need to validate and verify analytical and diagnostic methods to meet the demands of regulatory compliance. With support for 11 CLSI protocols, it lets you:
• Validate or verify analytical performance characteristics (precision, trueness, linearity, interferences, detection capability) of a measurement procedure to ensure they meet requirements for intended use or manufacturer's claims.
Analyse-it Standard Edition provides you with the powerful statistical analysis and regression you’d expect from an expensive statistics package, but without the complexity or the cost.
• Describe your data with a wide range of statistics.
• Visualize distributions, see trends and patterns, and spot outliers with powerful charts from histograms to scatter plots.
Analyse-it Quality Control & Improvement Edition provides the statistical analysis you need to understand processes, bring them under control, and find improvements that better your product.
• Bring processes under statistical control so they are stable and predictable with powerful Shewhart, cumulative sum, and moving average control charts. Include Xbar-R, Xbar-S, R, S, I-MR, time weighted CUSUM, UWMA, EMA plots, and WECO, Nelson, Montomery or custom rulesets.
A set of dataset management and statistical plugins for Microsoft Excel 2007, 2010 & 2013 including single and multiple linear regression, polynomial regression, and scatter plot with fit, fit confidence and prediction bands. The program can be used to generate visualizations and data reports. Requires Microsoft Excel.
Recollection is a platform developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program (NDIIPP), allowing users to create and share embeddable interfaces to digital cultural heritage collections. The Library of Congress released its latest version of Recollection as Viewshare, built to increase the ease of finding, using, and sharing the project's software.
TXM is a free and open-source cross-platform Unicode, XML & TEI based text analysis software, supporting Windows, Mac OS X and Linux. It is also available as a J2EE standard compliant portal software (GWT based) for online access with access control built in (see a demo portal: http://portal.textometrie.org/demo).
Weave (Web-based Analysis and Visualization Environment) is a visualization platform designed to enable visualization of any available data by anyone for any purpose. Weave is an application development platform supporting multiple levels of user proficiency — novice to advanced — as well as the ability to integrate, disseminate and visualize data at “nested” levels of geography.
The Science of Science (Sci2) Tool is a modular toolset supporting temporal, geospatial, topical, and network analysis and visualization of datasets at the micro (individual), meso (local), and macro (global) levels. Users of the tool can:
- Access science datasets online or load their own
- Perform different types of analysis with the most effective algorithms available
- Use different visualizations to interactively explore and understand specific datasets
- Share datasets and algorithms across scientific boundaries
CartoDB is a cloud based mapping, analysis and visualization engine that lets users build spatial applications for both mobile and the web. Users input tabular data and then construct an interactive visualisation through the web interface. It provides automatic georeferencing functionality and provides APIs for mobile data collection and dissemination.
Use is free for up to five tables; after that, there are monthly pricing plans.
Development was funded through EU and Spanish research programmes.
The DataTank is an open source tool that publishes data, stored in text-based files (e.g., CSV, XML, JSON) or in binary structures (e.g., SHP files, relational databases). The DataTank reads data from these structures and publishes them to the web using a URI as an identifier, providing these data in any format a user wants regardless of the original data structure. The DataTank requires a server with Apache2 or Nginx, mod rewrite enabled, PHP 5.4 or higher, Git, any database supported by Laravel 4.
Quadrigram describes itself as a "visual programming environment" for living data. It is a web-based tool for data visualization that allows the user to customize and publish interactive visualizations with a range of data types. Visualization possibilities range from basic charts and graphs (e.g., pie chart, bar graph), to more sophisticated visualizations for exploring complex datasets (e.g., networks, geo-data, zoomable tree map, quadrification, stacked flow).
The Altmetric Explorer is a powerful web app that allows you to track the conversations around scientific articles online. Altmetric collects and analyzes hundreds of thousands of postings about tens of thousands of articles and datasets each month. It makes this data available to end users through an intuitive user interface and to developers through an API.
Text 2 Mind Map is a web-based tool for mind mapping. Very basic interface and functionality. Requires users to structure information in a linear text outline, which it returns as a diagram.
Overview is a tool for analyzing large sets of documents. In includes a sophisticated search engine, word clouds, entity detection, and topic-based document clustering. If that’s not good enough, you can write your own plugins using the API. It is open source and you can run it on your own computer.
It was originally designed for investigative journalists, but it’s now also used for qualitative research, social media conversation analysis, legal document review, digital humanities, and more.
Overview is built to do several types of tasks:
Exploratree is a web-based library and editing application for "interactive thinking guides," which are templates useful for mind mapping, brainstorming, planning, and visualization. Originally developed for use in the classroom, to help students refine and focus their ideas, as well as manage plans to further their investigation. Thinking guides can be edited, printed, and downloaded directly from the browser.
A free iOS app for text analysis. Textal allows you to analyze documents, tweet streams, and webpages. Create clickable text clouds based on the source data that you choose. It comes pre-loaded with a large number of public domain texts. Text clouds are easily shareable via various Twitter and email.
yWorks is a powerful set of tools for creating diagrams using any number of frameworks. There are tools for working with HTML, FLEX, AJAX, Silverlight, Java and .NET.
yEd is also available from the yWorks site. This free graph editor can be used to create diagrams manually, or to import data for analysis.
NVivo is commercial software for qualitative analysis of unstructured data, in a range of formats and from diverse sources. Enables users to collect, organize, and analyze content from interviews, focus group discussions, surveys, audio, social media, videos, and webpages.
SentimentBuilder is an online tool that performs text analytics on emails, reviews, feedback, chat data or any unstructured texts via Natural Language Processing. It's the only tool where you can upload a file for processing and then visually view the results in a Sankey Flow Report to quickly identify trends, issues and strengths and then customize each view, save and share! Export any result for your own offline analysis! Try the Always Free version today and upload your own data or try one of our sample files.
Dataplot is free, public-domain software for statistical analysis, and non-linear modeling. It was developed by the National Insistute of Standards and Technology in the United States. It performs "scientific, engineering, statistical, mathematical, and graphical analysis" through the use of "an interactive, command-driven language/system with English-like syntax." It will function on Unix, Linux, Mac OS X, and Windows XP/VISTA/7 systems.
VisualEyes is web-based authoring tool developed at the University of Virginia to weave images, maps, charts, video, and data into highly interactive and compelling dynamic visualizations.
Microsoft Excel is spreadsheet software with calculation, graphing tools, and pivot table options for analyzing data. A cloud-hosted version is available as part of Office 365.
TimeRime is a web-based tool allowing people to create, view, and compare interactive timelines.
Lucidchart is a cloud-based diagramming and visual communication app with an intuitive drag-and-drop interface. Features include professional templates and an extensive shape library. Team users can edit the same file together in real time. With over four million users, Lucidchart is an excellent tool for making flowcharts, mockups, network diagrams, and more.
The ‘Stylo’ package provides easy-to-use implementations of various established analyses in the field of computational stylistics, including non-traditional authorship attribution, genre recognition, style development (“stylochronometry”), etc. The package includes a number of explanatory methods provided by the function stylo() (multidimensional scaling, principal component analysis, cluster analysis, bootstrap consensus trees).
SylvaDB is a graph database management system. It allows users with no knowledge in graph theory to model, collect, query, and analyze data in a network structure. SylvaDB provides tools for easy creation of schemas and modelling, automatic forms creation to input the data, collaborative features, a visual query editor, global and local search, reports charts generation, networks metrics, and visualizations tools.
Coggle is a web-based tool for non-linear structuring and visualization of information. Easy to create visually appealing diagrams with little to no technical expertise. Supports Markdown and LaTeX formatting (use LaTeX via the \\( \\) or \\[ \\] escape sequences). Users can add images by dragging and dropping them in the browser, view change history for each diagram and revert to previous states, and download their work as PDFs or images. Also enables real-time collaboration with others.
Possible use cases for Coggle may include:
A text-mining system for scientific literature. Textpresso's two major elements are (1) access to full text, so that entire articles can be searched, and (2) introduction of categories of biological concepts and classes that relate to objects (e.g., association, regulation, etc.) or describe one (e.g., methods, etc).
140kit provides a management layer for tweet collection and analysis.
Raw data cannot be passed through to the users, but any analytical process can be run across your dataset, and the data is held for as long as the user wants. When new analytical processes are created, they can be run on existing sets of data. 140kit does not claim any control of the analysis, however it retains ownership of the data collected.
AnSWR supports qualitative analysis of word-based data. This entails a set of methods for organizing, displaying, processing, summarizing, and interpreting information.
Last updated 9/23/2005.
Only available for Windows 2000 and Windows XP.
Find searches that correlate with real-world data: Google Correlate finds search patterns which correspond with real-world trends.
GRETL () s a cross-platform software package for econometric analysis, written in C. It features:
Whatizit can ingest up to 500,000 terms pasted into the input box and execute any of the pre-defined text analysis pipelines.
Weft QDA is a free and open-source tool for the analysis of textual data. You may import documents from plain text or PDF, apply character-level coding, category and document memos, retrieve coded text, apply simple coding statistics, apply free-text search, and export to HTML and CSV formats.
HyperRESEARCH enables users to code and retrieve, build theories, and conduct analyses of your data. You may work with text, graphics, audio and video sources.
WordSmith allows users to develop concordances, find keywords, and develop word lists from plain text files.
Qualrus is an innovative qualitative data analysis tool that helps you manage unstructured data. Additionally, Qualrus learns your coding trends, provides a visual semantic network display, and gives advice and technical support.
ImageJ is a Java open source image processing program designed for scientific multidimensional images. It is highly extensible, with thousands of plugins and macros for performing a wide variety of tasks, and a strong, established user base.
There are three major versions of ImageJ:
- ImageJ1 - The stable version, developed by Wayne Rasband at NIH since 1997
- ImageJ2 - Focuses on analysis of scientific multidimensional image data. Includes ImageJ1 with a compatibility layer
Minitab provides tools for statistical analysis and visualization. It includes tools for creating graphics, and working with variance, regression, reliability, sample size, time series, forecasting, equivalence tests, tables, simulations, and distributions.
MicrOsiris is a statistical and data management package for Windows. This freeware has been derived from OSIRIS IV, a statistics and data management package developed at the University of Michigan. It can import up to 10,000 variables from SPSS, SAS, STATA, UNESCO IDAMS, and Excel. It is distributed as freeware.
Lexos is an online tool that enables you to "scrub" (clean) your text(s), cut a text(s) into various size chunks, manage chunks and chunk sets, and choose from a suite of analysis tools for investigating those texts. Functionality includes building dendrograms, making graphs of rolling averages of word frequencies or ratios of words or letters, and playing with visualizations of word frequencies including word clouds and bubble visualizations.
Statistical Lab is a graphical user interface designed to make statistical analysis easier to understand. This interactive tool will connect and display data frames, frequency tables, random numbers or matrixes. Statistical Lab uses R to run calculations, conduct analyses and perform multiple simulations and manipulations.
Project Quincy allows users to trace the development of social networks and institutions over time and space using information about people, places and organizations. It is a Django application with a MySQL database that can be installed on a web server.
RSiena is a package for the R language that enables the statistical analysis of network data, including longitudinal network data, longitudinal data of networks and behavior, and cross-sectional network data. It provides the same functionality available in SIENA (Simulation Investigation for Empirical Network Analysis), Windows software which is no longer maintained.
Lynks provides an easy to use, in-browser tool that helps you to create your own networks. Lynks is an initiative by Centre for Innovation, part of Leiden University (Campus The Hague). The software has been developed in 2014 in co-creation, with expertise from Dr. Eelke Heemskerk from University of Amsterdam. The software development has been supported by the financial contributions from the European Union Fund for Regional Development (EFRO) and the Municipality of The Hague.
bubbl.us is a web-based mind mapping tool, useful for organizing ideas, brainstorming, analyzing relationships, and visualizing data. Simple interface, with basic and easily understood functionality. Free to try without creating an account. Also available as an iOS application for iPad.
Cross-platform app for analyzing text, video, and spreadsheet data (analyzing qualitative, quantitative, and mixed methods research)
Linguistic Inquiry and Word Count is a text analysis software program that calculates the degree to which people use different categories of words across a wide array of texts.
ANTHROPAC is a menu-driven DOS program for collecting and analyzing data on cultural domains. The program assists with the collection and analysis of structured qualitative and quantitative data, and provides analytical and multivariate tools.
ScraperWiki is an online tool to make that makes the process of data scraping simpler and more collaborative. Anyone can write a screen scraper using the online editor. In the free version, the code and data are shared with the world. Because it's a wiki, other programmers can contribute to and improve the code.
This software scans one or many Word DOCX, text and text-like files (e.g. HTML and XML files) and counts the number of occurrences of the different words or phrases. There is no limit on the size of an input text file. The words/phrases which are found can be displayed alphabetically or by frequency. The program can be told to allow or disallow words with numerals, hyphens, apostrophes, underscores or colons, to ignore words which are short or which occur infrequently, and to ignore words (e.g., common words such as 'the', a.k.a. stop words) contained in a specified file.
This software scans a Word DOCX file or a text file (including HTML and XML files) with text encoded via ANSI or UTF-8 and counts the frequencies of different words. The words which are found and displayed can be ordered alphabetically or by frequency.
OmniGraffle is a comprehensive diagramming and drawing application. Drag and drop to create wireframes, flow charts, network diagrams, UI mockups, family trees, office layouts, etc.. Upgrading to OmniGraffle Pro adds Visio support, shared layers, presentation mode, object-geometry controls, AppleScript and Actions support and more.
CollateX is a Java software for collating textual sources, for example, to produce a critical apparatus. As of January 2012 the project was at an early stage of development and lacked thorough documentation.
JGAAP is software designed for textual analysis, text categorization, and authorship attribution
This package allows users to train topic models in MALLET and load results directly into R.
TAMS Analyzer is a program that works with TAMS to let you assign ethnographic codes to passages of a text just by selecting the relevant text and double clicking the name of the code on a list. It then allows you to extract, analyze, and save coded information.
"TextSTAT is a simple programme for the analysis of texts. It reads plain text files (in different encodings) and HTML files (directly from the internet) and it produces word frequency lists and concordances from these files. This version includes a web-spider which reads as many pages as you want from a particular website and puts them in a TextSTAT-corpus. The new news-reader, too, puts news messages in a TextSTAT-readable corpus file.
TextSTAT reads MS Word and OpenOffice files. No conversion needed, just add the files to your corpus...
Time Flow is an open-source timeline built to help journalists analyze temporal data. The application offers several view modes--timeline, calendar, list, table--to help explore thousands of data points. It is not a web-based tool--it is a desktop application that can run off a thumb-drive and is built to handle large datasets, and timeline events that may include approximate dates or date spans.
Data Desk implements traditional statistical techniques using a simple graphic display interface for data exploration. The program focuses specifically on the visual exploration of data.
MATLAB allows matrix manipulations, plotting of functions and data, implementation of algorithms, creation of user interfaces, and interfacing with programs written in other languages, including C, C++, Java, and Fortran.
The Visual Understanding Environment (VUE) is concept mapping software that can integrate with multiple repositories to pull in, organize, and analyze data. Multiple features for advanced management of digital resources for teaching, learning, and research.
MAXQDA is a tool for qualitative data analysis, evaluation, and text analysis. You can export parts or all data into reports in Word, Excel, XML, or Images. The MAXQDA Multimedia Browser enables to code audio and video files directly without having to create a transcript. You can code your information however you like for easy retrieval and organization.
The main programs that comprise the Information processor are called the analyst server and query or knowledge processor. The analyst program can be called from a command line, from an html form, or through a TCP/IP socket protocol. The query processor can be accessed with any browser using HTML commands. It analyzes text and allows the user to search it.
Software for creating data dashboards. Many of the sample galleries portray corporate financial data.
Flare is an ActionScript library for creating visualizations that run in the Adobe Flash Player. From basic charts and graphs to complex interactive graphics, the toolkit supports data management, visual encoding, animation, and interaction techniques. Flare features a modular design that lets developers create customized visualization techniques without having to reinvent the wheel.
eXist-db is an open source database management system that stores XML data according to the XML data model and features efficient, index-based XQuery processing.
The Versioning Machine displays multiple versions of text encoded according to TEI Guidelines and allows for comparisons of annotation and introductory materials. This is a text editor and allows editors "to immediately see the consequences of their editorial decisions." This tool does not appear to have been updated since 2011.
SARIT (Search and Retrieval of Indic Texts" is a collection of electronic editions of Sanskrit and other Indian-language texts that have dated and embedded notes about their change history. You can perform a text search, retrieval and analysis of works in SARIT, as well as download all the texts and convert them to PDF, HTML, etc.
Qiqqa is a research management software that allows you to organize large numbers of papers; find new papers to read and new information about papers you already have; review materials and create annotation reports. Qiqqa has several PDF tools that also allow you to convert from PDFs to text, and use a clipboard function to cut and paste text into your document.
FieldWorks consists of software tools that help you manage linguistic and cultural data. FieldWorks supports tasks ranging from the initial entry of collected data through to the preparation of data for publication
Pliny is a scholarly note-taking and annotation tool. It may be used with both digital (web pages, images, PDF files) and non-digital (books, printed articles) materials, run as a desktop application on the user's computer. Pliny is useful for taking and managing annotations and notes while reading, as well as subsequently developing and presenting an interpretation.
Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction.
LATtice lets you explore and compare texts across entire corpora but also allows you to “drill down” to the level of individual LATs (language action types) to ask exactly what rhetorical categories make texts similar or different.
Gliffy is a tool that makes it easy to create, share, and collaborate on a wide range of diagrams. Gliffy works directly in your browser and can be used to build flow charts, organization charts, network diagrams, workflows, technical drawings, and wireframes.
Google Scholar Citations lets you track citations to your publications, check who is citing your publications, graph your citations over time, compute citation metrics, and view publications by colleagues.
All Our Ideas is a research project that seeks to develop a new form of social data collection by combining the best features of quantitative and qualitative methods. Using the power of the web, we are creating a data collection tool that has the scale, speed, and quantification of a survey while still allowing for new information to "bubble up" from respondents as happens in interviews, participant observation, and focus groups.
Bookworm enables you to graphically explore lexical trends in repositories of digitized texts.
Voyant Tools is a web-based reading and analysis environment for digital texts.
Create a survey to measure and record your students' progress against curriculum outcomes. Use Likert questions to rate students based on a rubric and text questions to record qualitative data to share with parents and faculty. **This item is being upgraded to PollDaddy: Your login details and account will remain the same, you will now login at Polldaddy.com.
Kaleidoscope is one of the world's best tools for spotting differences in images and text, and now it supports merging of files and folders, too. Kaleidoscope integrates directly with Git, Subversion, Mercurial, and Bazaar to fit perfectly in your workflow.
Circos is a software package for visualizing data and information. It visualizes data in a circular layout — this makes Circos ideal for exploring relationships between objects or positions. There are other reasons why a circular layout is advantageous, not the least being the fact that it is attractive.
Circos is ideal for creating publication-quality infographics and illustrations with a high data-to-ink ratio, richly layered data and pleasant symmetries. You have fine control each element in the figure to tailor its focus points and detail to your audience.
Statwing is an easy-to-use, web-based tool for data analysis and visualization. Upload data, select variables of interest, and Statwing automatically selects statistical tests and visualizations, then distills the results into plain English sentences (as well as traditional statistical output for those so inclined).
Free trial available, as well as multiple pricing plans:
NodeBox is an application for creating 2D graphics and visualizations. It provides a visual and process-based editor for an underlying Python-based analysis and visualisation package. It is developer-described as a generative design app and this really taps into the serendipitous nature of the environment. The user constructs models and can tweak them in real time via the interface and see the resulting changes too the output.
It has been described as being "similar to Processing, but without all the interactivity".