HEURIST is an extremely flexible data management system designed specifically for Humanities data - see http://HeuristNetwork.org. It is available as a free web service for researchers (hosted at the University of Sydney Data Centre) or for local installation (Open Source). Any confident researcher can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without programmers or consultants. Quite complex databases can be built in a few hours through borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Advanced features include record linking, drilldown facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms.
The OpenZoom SDK is a free and open source toolkit for Flash for developing Zoomable User Interfaces (ZUIs) for high-resolution images.
A software application that is used for analysing and visualising multi-volume seismic data.
- Visualization and analysis of 2D and 3D seismic data in a single survey
- 2D and 3D horizon tracking including auto-tracking, plane-by-plane, line and manual tracking
- On-the-fly calculation and visualization of various attributes and filters
- Plug-in architecture
QGIS is a user-friendly Open Source Geographic Information System (GIS) licensed under the GNU General Public License. QGIS is an official project of the Open Source Geospatial Foundation (OSGeo). It runs on Linux, Unix, Mac OSX, Windows and Android and supports numerous vector, raster, and database formats and functionalities.
GRASS GIS is free and open source software used for geospatial data management and analysis, image processing, graphics/maps production, spatial modeling, and visualization. GRASS is currently used in academic and commercial settings around the world, as well as by many governmental agencies and environmental consulting companies.
The goal of the Alpheios project is to help people learn how to learn languages as efficiently and enjoyably as possible, and in a way that best helps them understand their own literary heritage and culture, and the literary heritage and culture of other peoples throughout history. One of the principal tools, a Firefox plugin, allows a reader to browse a web page with Latin, ancient Greek, or Arabic, click on a word, and get a definition and morphological analysis of the word.
NewRadial is an interactive visualization environment that uses an adapter system to display and combine content from remotely-served or locally situated databases. Although initially designed for use with image-based databases, NewRadial’s capabilities have been extended to handle the manipulation and annotation, in a visual field, of text-based databases.
Serendip-o-matic extracts keywords from your Zotero library or from text that you've pasted into the web interface, and finds related content in locations such as the Digital Public Library of America (DPLA), Europeana, and Flickr Commons, including photographs, documents, maps and other primary sources.
JabRef is a graphical application for managing bibliographical databases. JabRef is designed specifically for BibTeX bases, but can import and export many other bibliographic formats. JabRef runs on all platforms and requires Java 1.6 or newer.
LilyPond is text-based music engraving software, with specific text input conventions.
Audacity is a free, easy-to-use and multilingual audio editor and recorder. Basic features, as listed on their website, include:
- Record live audio.
- Record computer playback on any Windows Vista or later machine.
- Convert tapes and records into digital recordings or CDs.
- Edit WAV, AIFF, FLAC, MP2, MP3 or Ogg Vorbis sound files.
- Cut, copy, splice or mix sounds together.
- Change the speed or pitch of a recording.
A software application that enables a user to search, manipulate and publish large SGML/XML documents. Anastasia was developed within an academic context to enable the manipulation of a single, large mark-up documents or a set of documents. It utilises two methods to interpret the structure of a mark-up document: First, it uses pattern-matching algorithms to process a hierarchical tree, similar to other XML software applications; Second, it interprets the document structure as a series of sequential 'events' which must be processed.
Chicken (originally called Chicken of the VNC) is a VNC client, or Virtual Network Computing, that allows you to display and interact with a remote computer while displaying it on a remote screen.
PhiloLine is an add-on for the Philologic text retrieval engine that provides a sequence alignment algorithm for humanities text analysis designed to identify "similar passages" in large collections of texts.
Philomine is an extension to the Philologic text retrieval engine that supports a variety of machine learning, text mining, and document clustering tasks.
A Python-based XML web publishing framework which enables dynamic pipelining of XSLT transformations. Data is processed by an XML pipeline composed of several WSGI applications and middleware components.
- Apache Cocoon Sitemap 1.0 compatible
- WSGI modularity
- URI pattern matching
The Entity Authority Tool Set (EATS) is a web application for recording, editing, using and displaying authority information about entities. It is designed to allow multiple authorities to each maintain their own independent data, while operating on a common base so that information about the same entity is all in one place. EATS also comes with client tools for automatically looking up entities in a text by name and adding appropriate TEI markup.
- A web API for importing and exporting entity data
From the website:
Specify is a database platform for museum and herbarium research data. It manages species and specimen information for computerizing biological collections, tracking museum specimen transactions, linking images to specimen records and publishing catalog data to the Internet. Specify is written in Java for Windows, Mac OS X, and Linux computers and uses the relational data manager, MySQL, as its data engine. Specify, Java, and MySQL are free and open-source.
Pundit is a semantic annotation and augmentation tool. It enables users to create structured data while annotating web pages.
Annotations span from simple comments to semantic links to web of data entities (as Freebase.com and Dbpedia.org), to fine granular cross-references and citations. Pundit can be configured to include custom controlled vocabularies. In other words, annotations can refer to precise entities and concepts as well as express precise relations among entities and contents. Read more on semantically structured annotations
Superfastmatch is designed to find exact duplicates of text strings between documents.
CulturalAnalytics is an R package containing functions for statistical analysis and plotting of image properties, including statistics such as the standard deviation and mean in the RGB and HSV color spaces, image entropy and histograms in greyscale (intensity) and color, and for plotting color clouds and image scatter charts.
SwiftRiver is free and open source web-based software for real-time filtering, curation, and qualitative analysis of social media data (Twitter, etc.)
Open Journal Systems (OJS) is a journal management and publishing system. Public Knowledge Project (the sponsor of OJS) is a multi-university initiative developing (free) open source software and conducting research to improve the quality and reach of scholarly publishing
Philologic is a full-text search, retrieval and analysis tool with support for TEI-Lite XML/SGML, Unicode encoding, plaintext, Dublin Core/HTML, and DocBook.
Plone is a powerful, flexible, open source Content Management System (CMS) built on top of Zope application server and CMF.
- Flexible and adaptable workflow
- Free add-ons
- Versioning, history and reverting content
- Support for multiple mark up formats
- Multilingual content management
- RSS feed support
- WebDAV and FTP support
- Integrates with Active Directory, Salesforce, LDAP, SQL, Web Services, LDAP and Oracle
Online web survey tool written in PHP using MySQL, MSSQL or Postgres database. Multilingual site with demo, feature list, documentation [Open Source, GPL3]
CiteULike is a free service to help you to store, organise and share the scholarly papers you are reading. When you see a paper on the web that interests you, you can click one button and have it added to your personal library. CiteULike automatically extracts the citation details, so there's no need to type them in yourself. It all works from within your web browser so you can access it from any computer with an Internet connection. CiteULike supports annotation and rating of items, and upload of attachments (e.g. PDF file). (Attachments are only accessible privately by individual users).
Omeka is a content management system designed for the display of library, museum, archives, and scholarly collections and exhibitions.
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Originally developed for text manipulation, it is now used for a wide range of tasks including graphics programming, system administration, network programming, applications that require database access and CGI programming on the Web.
- C, shell scripting (sh), AWK, and sed
- Powerful text processing facilities
- Flexibility and adaptability
- Support for multiple programming paradigms
Zenodo builds and operates a simple and innovative service that enables researchers, scientists, EU projects and institutions to share, preserve and showcase multidisciplinary research results (data and publications) that are not part of the existing institutional or subject-based repositories of the research communities.
A statistical natural language parser for analyzing text to determine its grammatical structure.
The Stanford Part-of-Speech Tagger includes English, Arabic, Chinese, and German tagger modules.
BuddyPress is a variant of WordPress that includes social networking features.
A free (under the GNU General Public License) toolkit for the development of document image recognition systems.
- Custom dictionaries may be created to assist with analysis of specific record types
- Extensible functionality
- Optical character recognition (OCR) toolkit plugin
Project Quincy allows users to trace the development of social networks and institutions over time and space using information about people, places and organizations. It is a Django application with a MySQL database that can be installed on a web server.
FreeMind is Java-based mind mapping software.
MDID is software for teaching and learning with digital images, with tools for discovering, aggregating, and presenting digital media in a variety of learning spaces.
Greenstone is a suite of software for building and distributing digital library collections. It also allows users to publish to the internet or CD-ROM. Software interface and documentation available in English, French, Spanish, Russian, and Kazakh.
HTTrack provides an easy-to-use interface for downloading websites-- including HTML, images, and other files-- or update a copy of a previously-downloaded site.
Drupal is an extremely flexible general content management system with numerous plugins that provide scholar-oriented functionality.
A digital repository software package that may be used to accept, manage and publish digital objects. It is widely used in academia as a system to manage academic research papers, electronic theses and other distinct digital resources. EPrints offers an extensible plug-in architecture, enabling data processing activities to be tailored to the requirements of the institution.
Free and open source music notation program. From website:
MuseScore runs on Windows, MacOS, and Linux, and is available in over 40 different languages.
- Create sheet music with WYSIWYG editor
- Listen to your score with computer playback
- Share & print your score
- Work the way you like
- Get help
Commentpress is a theme and plugin for WordPress that enables granular public commenting on texts.
TAMS Analyzer is a program that works with TAMS to let you assign ethnographic codes to passages of a text just by selecting the relevant text and double clicking the name of the code on a list. It then allows you to extract, analyze, and save coded information.
Frogr is a small application for the GNOME desktop that allows users to manage their accounts in the Flickr image hosting website. It supports all the basic Flickr features, including uploading pictures, adding descriptions, setting tags and managing sets and groups pools.
The latest stable version of Frogr (0.10) currently features a basic flickr uploader with the following features:
GIMP is image editing software, much like Photoshop. It is a multi-platform software application primarily used for image composition and editing. The basic tool may be augmented by plug-ins and extensions that allow the use of new file formats, effects filters and batch processing capabilities. GIMP was initially created to manipulate raster images, but has been extended to provide limited vector image and moving image support. A number of free extensions are available in the plugin registry.
PAIR is a sequence alignment algorithm for humanities text analysis designed to identify "similar passages" in large collections of texts. In addition to a Philologic add-on, PAIR is available as Text::Pair, a generalized Perl module that supports one-against-many comparisons. A corpus is indexed and incoming texts are compared against the entire corpus for text reuse.
960 Grid System is a CSS template that comes with corresponding Acorn, Fireworks, Flash, InDesign, GIMP, Inkscape, Illustrator, OmniGraffle, Photoshop, QuarkXPress, Visio, Exp Design, and printable templates to facilitate different stages of the web design process.
Weka provides machine learning algorithms in Java for data mining and predictive modeling tasks. These algorithms can either be incorporated into other Java code or called from the Weka Workbench, a GUI environment.
The Open Harvester Systems is a free metadata indexing system that allowers users to create a searchable index of the metadata from Open Archives Initiative (OAI)-compliant archives, such as sites using Open Journal Systems (OJS) or Open Conference Systems (OCS). It can harvest OAI metadata in a variety of schemas (including unqualified DC, the PKP (Open Journal Systems/Open Conference Systems) Dublin Core extension, MODS, and MARCXML).
Integrated Content Environment (ICE) was an open source project of the Learning Resources Development (LRD) unit at the University of Southern Queensland. The content management system allowed users to convert content authored in Microsoft Word or OpenOffice.org Writer into self-contained course websites using the IMS format.
The ICE authoring environment enabled:
Calibre is a free and open source ebook library management application, including options for syncing to devices and converting between a large number of formats. Calibre also has a built-in e-book editor for EPUB and AZW3 formats.
Windows .NET based open source Digital Asset Management solution designed for medium size preservation, cataloguing, media archiving and batch transcoding.
MantisBT is a free popular web-based bugtracking system written in the PHP scripting language. The most common use of MantisBT is to track software defects. However, MantisBT is often configured by users to serve as a more generic issue tracking system and project management tool.
- event-driven-plug-in system
- works with MySQL, MS SQL, PostgreSQL, SQLite, Oracle and IBM DB2 databases
- RSS Feeds
- Customisable workflow
- Wiki integration
- Chat integration
xMod is a desktop application which can transform a repository of XML into a completely finished website.
The entire process can be setup and run to produce a basic website assuming some prerequisites:
- A set of valid XML files. These would normally comply with a TEI DTD.
- a configuration script that indicates the relationship between files
- A 'personality pack' (CSS and image files) that determine its visual appearance. However if they are not present, the completed website falls back on a default look and feel.
"In the WordHoard environment, texts are annotated or tagged by morphological, lexical, prosodic, and narratological criteria. They are mediated through a 'digital page' or user interface that lets scholarly but non-technical users explore the greatly increased query potential of textual data kept in such a form."
Denemo is a music notation editor with a graphical user interface that allows users to enter notation for typesetting by the LilyPond music engraver.
Flare is an ActionScript library for creating visualizations that run in the Adobe Flash Player. From basic charts and graphs to complex interactive graphics, the toolkit supports data management, visual encoding, animation, and interaction techniques. Flare features a modular design that lets developers create customized visualization techniques without having to reinvent the wheel.
eXist-db is an open source database management system that stores XML data according to the XML data model and features efficient, index-based XQuery processing.
The Virtual Lightbox enables online image comparison, with features like an image-centric whiteboard. There are two versions, application and applet, which have different functionalities.
GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP.
LibLime Koha is a web-based, open source integrated library system (ILS) that has also been used for virtual library systems (e.g. recreating historic libraries). LibLime Koha offers libraries circulation policies, patron management modules, parent-child relationship for patron records, club and service management features, in-depth "holds" support, single click batch import "undo" option, EzProxy compatibility, self-checkout interface and more.
PDFsam can split and merge batches of PDFs.
CamStudio is free and open source screencasting software that saves the video as AVI files, though a Flash converter is included.
CHET-C, or Chapel Hill Electronic Text-Converter, is a browser based software tool designed to convert digital texts that employ standard epigraphic conventions such as the Leiden sigla into EpiDoc-compliant XML files.
The tool can be accessed online at http://www.stoa.org/projects/epidoc/stable/chetc-js/chetc.html. Fragments of epigraphic text using standard sigla (eg Leiden convention markup) are pasted into the tool and Epidoc compliant XML is generated.
A structured text editor that may be used to create, edit, validate and convert XML and SGML documents. EpcEdit contains an integrated validating parser, an editor for CALS and HTML tables, an attribute editor and an element manipulation tool.
A software application for the playback of audio recordings. SoundScriber offers specific functionality for researchers that wish to transcribe a recording. It was originally developed for use in the Michigan Corpus of Academic Spoken English (MICASE) project and released for use by academics performing similar work.
- Audio playback via installed audio codecs (e.g. Wav, MP3)
- Variable speed playback
XSugar is a proof of concept tool for mapping textual content between a flat file schema and XML format. It performs statistical analysis to establish if transformations between the two formats are bi-directional, enabling content that has been converted into an XML format to be re-exported to the original flat file structure, or vice-versa. To validate the conversion, a schema must exist for source and destination formats, e.g. a bespoke XFlat encoded XML document that contains a definition of the structure of a class of flat files, an XML schema.
A software application that enables relational databases to be created, managed and queried. The database management system enables multiple users to access a database through an appropriate interface. As an open source tool, MySQL underpins a number of free software projects, such as WordPress, phpBB and other software built on a LAMP infrastructure. Although widely used, there are a number of performance issues that limit its use in some environments. For example, it is unable to use multiple CPU cores to process a single query, potentially limiting its use as a data warehouse.
MediaWiki is a free software open source wiki package written in PHP, originally for use on Wikipedia and other Wikimedia Foundation projects. It is designed to be run on a large server farm for a website that gets millions of hits per day.
Xournal is an application for note-taking, sketching, and journaling.
WikiMindMap is a mindmapping tool that allows you to browse in wiki content. It uses an interactive interface based on a system of nodes and brackets. Maps may be exported into FreeMind.
Bluefish is a powerful editor targeted towards programmers and web designers, with many options to write websites, scripts and programming code. Bluefish supports many programming and markup languages, and it focuses on editing dynamic and interactive websites.
Improvise is a fully-implemented Java software architecture and user interface that enables users to build and browse highly-coordinated visualizations interactively. By coupling a shared-object coordination model with a declarative visual query language, users gain precise control over how navigation and selection affects the appearance of data across multiple views, using a potentially infinite number of variations on well-known coordination patterns such as synchronized scrolling, overview+detail, brushing, drill-down, and semantic zoom.
Korbo is a powerful aggregation platform for gathering Linked Data objects relevant to your area of research into single workspaces or “baskets”.
Korbo is targeted primarily at developers who want to build applications on top of its API and make full use of the linked cultural data from sources such as Europeana, FreeBase and DBPedia.
Korbo is currently in the early stages of development, but you can already try out a demo version of the platform.
TemaTres is an open source vocabulary server, web application to manage and exploit vocabularies, thesauri, taxonomies and formal representations of knowledge.
Windows tool for annotating, editing, and sharing screen shots and images.
Org mode is for keeping notes, maintaining TODO lists, doing project planning, and authoring with a fast and effective plain-text system.
Org is not a standalone application but is a plugin in GNUEmacs and is now available in some Emacs distributions. As an Emacs plug in it is a linemode text based note manager rather than a gui tool
Ptolemaic is a computer application for music visualization and analysis written in the Java programming language. The software is designed to aid in the analysis of all types of Western music using established analytical techniques, including tonal functional analysis (Harrison 1994), pitch-class set analysis (Forte 1973), hierarchical linear analysis (Schenker 1935, Jones 2002), tonal pitch-space analysis on the Tonnetz (Riemann 1915), pitch-class set analysis (Forte 1973), and transformation analysis (Lewin 1987).
With ediarum researchers can comfortably transcribe, encode and edit manuscripts in TEI-XML, as well as publish their results in an online or print edition. The solution, developed by TELOTA, is based on three software components: exist-db, Oxygen XML Author, and ConTeXt. These are combined, supplemented with additional functions, and tailored to fit a project's needs.
Annotation Studio is an open source, web-based annotation application that integrates a powerful set of textual interpretation tools behind an intuitive and easy-to-use interface. Users can upload their own texts, and annotate with styled text, video, images, and weblinks. To date, the project has been used with great success in disciplines such as Writing, Literature, Foreign Languages, Anthropology, Film and Media Studies, and others at institutions including Harvard, Yale, Stanford, MIT, Barnard College, and Washington University.