HEURIST is an extremely flexible data management system designed specifically for Humanities data - see http://HeuristNetwork.org. It is available as a free web service for researchers (hosted at the University of Sydney Data Centre) or for local installation (Open Source). Any confident researcher can design, create, manage, analyse, visualise and publish their own richly-structured database(s) through a simple web interface, without programmers or consultants. Quite complex databases can be built in a few hours through borrowing structures and vocabularies published by other users. Databases can be designed and built incrementally, as existing data are not affected by changes in structure. Advanced features include record linking, drilldown facet searches, rule-based queries, custom reports, linked map-timelines, network visualisation, normalised spreadsheet import, crosstabulation, XML feeds, XSLT transforms.
Gephi is graphing software that provides a way to explore data through visualization and network analysis.
CoolTool is an online platform for market research. Collaborative, easy-to-use platform allows you to do absolutely everything, from creating stunning professional surveys to running affordable biometric research. CoolTool NeuroLab includes eye tracking, EEG, emotions measurement and mouse traking solutions. All of which are easily integrated into your questionnaire. This allows you to conduct neuromarketing projects with an ease of cloud technology.
Recogito is an online platform for collaborative document annotation.
Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts and images - and collaborate in their annotation and interpretation. Recogito enables you to make your work more visible on the Web more easily, and to expose the results of your research as Open Data.
trust your unique day will bring you bunches of joy, love and fun. You merit them a great deal. Appreciate!
Have a great birthday. I wish your consistently to be loaded with heaps of adoration, chuckling, satisfaction and the glow of daylight.
May this birthday be loaded with heaps of glad hours furthermore your existence with numerous upbeat birthdays, that are yet to come. Upbeat birthday.
ANNIS is an open source, cross platform (Linux, Mac, Windows), web browser-based search and visualization architecture for complex multi-layer linguistic corpora with diverse types of annotation. ANNIS, which stands for ANNotation of Information Structure, was originally designed to provide access to the data of the SFB 632 - “Information Structure: The Linguistic Means for Structuring Utterances, Sentences and Texts”. It has since then been extended to a large number of projects annotating a variety of phenomena.
Onodo is a user friendly tool for doing basic visualisation and analysis within a browser making it cross-platform right out of the box. Data can be entered via their intuitive, in-browser node and edge creation tables or imported from a file. As data is added it is instantly previewed and displayed. The visual appearance can be tweaked on the fly and customised in a variety of ways (including applying a unique image or URL to each node). Once created, visualisations can be embedded and shared outside of the Onodo browser environment.
Scripto is an open-source tool for community transcription of documents, images, and multimedia files. Registered users are permitted to view digital files and transcribe them with an easy-to-use toolbar. The tool includes a versioning history and editorial controls to make public contributions more manageable, and supports the transcription of a wide range of file types.
Recollection is a platform developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program (NDIIPP), allowing users to create and share embeddable interfaces to digital cultural heritage collections. The Library of Congress released its latest version of Recollection as Viewshare, built to increase the ease of finding, using, and sharing the project's software.
TXM is a free and open-source cross-platform Unicode, XML & TEI based text analysis software, supporting Windows, Mac OS X and Linux. It is also available as a J2EE standard compliant portal software (GWT based) for online access with access control built in (see a demo portal: http://portal.textometrie.org/demo).
The FAIMS Mobile Platform (http://www.fedarch.org) is an open source, generalised system for digital data collection on Android. It works offline and helps record free text, multimedia, structured or spatial data with ample opportunity for the capture of metadata and certainty components of the captured data. It needs to be customised (via an xml definition document) for particular field/lab workflows. As a server-client system it facilitates simultaneous operation by multiple users.
The OpenZoom SDK is a free and open source toolkit for Flash for developing Zoomable User Interfaces (ZUIs) for high-resolution images.
Geospatial Data Abstraction Library (GDAL) is a translator library for vector and raster geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation.
ColorBrewer is a web tool for selecting color schemes for thematic maps, most usually for choropleth maps. It includes 35 basic schemes with different numbers of classes for over 250 possible versions. Each scheme has CMYK, RGB, Hex, Lab, and AV3 (HSV) specs for the colors. The software is designed simply to list color specs for a scheme you find useful so you are able to create these colors in the mapping software you are using.
Ushahidi is an open source software for information collection, visualization and interactive mapping, to build tools for democratizing information, increasing transparency and lowering the barriers for individuals to share their stories. It has been used for election monitoring, crisis and emergency response, civil society, city/community building, arts and even where to find the best burger.
Crowdmap: A simple map-making tool, built on an open API, that allows for collaborative mapping
A software application that is used for analysing and visualising multi-volume seismic data.
- Visualization and analysis of 2D and 3D seismic data in a single survey
- 2D and 3D horizon tracking including auto-tracking, plane-by-plane, line and manual tracking
- On-the-fly calculation and visualization of various attributes and filters
- Plug-in architecture
Weave (Web-based Analysis and Visualization Environment) is a visualization platform designed to enable visualization of any available data by anyone for any purpose. Weave is an application development platform supporting multiple levels of user proficiency — novice to advanced — as well as the ability to integrate, disseminate and visualize data at “nested” levels of geography.
Viewshare is a free web application for creating interfaces and visualizations of cultural heritage collections. It can create interactive maps, timelines, facets, tag clouds, histograms, and image galleries. The intended users of Viewshare are individuals managing and creating access to digital collections of cultural heritage materials. Viewshare is offered as a software as a service (SaaS), email firstname.lastname@example.org to request a free account.
GapVis is an interface for exploring and reading texts that reference ancient places. It includes maps and data visualizations that show what locations are referred to a different points in the narrative and allows you to dig into the details to learn more.
QGIS is a user-friendly Open Source Geographic Information System (GIS) licensed under the GNU General Public License. QGIS is an official project of the Open Source Geospatial Foundation (OSGeo). It runs on Linux, Unix, Mac OSX, Windows and Android and supports numerous vector, raster, and database formats and functionalities.
GRASS GIS is free and open source software used for geospatial data management and analysis, image processing, graphics/maps production, spatial modeling, and visualization. GRASS is currently used in academic and commercial settings around the world, as well as by many governmental agencies and environmental consulting companies.
The Science of Science (Sci2) Tool is a modular toolset supporting temporal, geospatial, topical, and network analysis and visualization of datasets at the micro (individual), meso (local), and macro (global) levels. Users of the tool can:
- Access science datasets online or load their own
- Perform different types of analysis with the most effective algorithms available
- Use different visualizations to interactively explore and understand specific datasets
- Share datasets and algorithms across scientific boundaries
Open source map design studio for creating and styling web tile maps, based on either provided free MapBox Geodata, or data imported as a variety of file formats. It uses the CartoCSS styling language to customize interactive visualizations, with powerful control of symbols, zoom levels, labels, and other cartographic details. The finished maps can be exported either as images or tiled web maps. Though the TileMill software is free, there may be a fee to host the created tiles on MapBox or elsewhere.
The goal of the Alpheios project is to help people learn how to learn languages as efficiently and enjoyably as possible, and in a way that best helps them understand their own literary heritage and culture, and the literary heritage and culture of other peoples throughout history. One of the principal tools, a Firefox plugin, allows a reader to browse a web page with Latin, ancient Greek, or Arabic, click on a word, and get a definition and morphological analysis of the word.
WorldMap is a web-based map collaboration platform hosted at Harvard, which allows anyone to store, organize, visualize, edit, collaborate, and publish geospatial information. Upload your own map layers or use maps others have contributed.
The system attempts to fill the growing gap between powerful desktop-bound mapping applications, and lightweight web map solutions with limited storage capacity.
NewRadial is an interactive visualization environment that uses an adapter system to display and combine content from remotely-served or locally situated databases. Although initially designed for use with image-based databases, NewRadial’s capabilities have been extended to handle the manipulation and annotation, in a visual field, of text-based databases.
Serendip-o-matic extracts keywords from your Zotero library or from text that you've pasted into the web interface, and finds related content in locations such as the Digital Public Library of America (DPLA), Europeana, and Flickr Commons, including photographs, documents, maps and other primary sources.
Using Google Spreadsheets as the backend database is awesome because it is easy to use, share and collaborate with." -- website
CartoDB is a cloud based mapping, analysis and visualization engine that lets users build spatial applications for both mobile and the web. Users input tabular data and then construct an interactive visualisation through the web interface. It provides automatic georeferencing functionality and provides APIs for mobile data collection and dissemination.
Use is free for up to five tables; after that, there are monthly pricing plans.
Development was funded through EU and Spanish research programmes.
The DataTank is an open source tool that publishes data, stored in text-based files (e.g., CSV, XML, JSON) or in binary structures (e.g., SHP files, relational databases). The DataTank reads data from these structures and publishes them to the web using a URI as an identifier, providing these data in any format a user wants regardless of the original data structure. The DataTank requires a server with Apache2 or Nginx, mod rewrite enabled, PHP 5.4 or higher, Git, any database supported by Laravel 4.
StoryMapJS is a project by KnightLab which aims to help journalists and historians tell stories by using maps using a straight-forward and simple authoring tool. It is constructed around a framework of highlighting the locations of a series of events. As an editing tool, it is relatively intuitive and easy to use.
Freebase "is an open, Creative Commons Attribution (aka CC-BY) licensed collection of structured data," and a "platform for accessing and manipulating that data" via API. Almost 40 million entities and assertions about those entities are stored within a graph database. The database was built by pulling in open data and relies on community contribution to stay updated. Freebase is part of the semantic web and emits Linked Open Data (via RDF) for all its entities.
Annotum is an open-source, open-process, open-access scholarly authoring and publishing platform based on WordPress.
JabRef is a graphical application for managing bibliographical databases. JabRef is designed specifically for BibTeX bases, but can import and export many other bibliographic formats. JabRef runs on all platforms and requires Java 1.6 or newer.
LilyPond is text-based music engraving software, with specific text input conventions.
Beautiful Soup is a library, written in the Python programming language, for pulling specific pieces of data out of HTML and XML files. It is especially suitable when working with data files that aren't well-formed, or are otherwise difficult to parse.
Saves programmers hours or days of work on quick-turnaround screen scraping projects.
Bookworm is a tool that visualizes language usage trends in repositories of digitized texts in a simple and powerful way. It is a tool for culturomic exploration through the observation of chronological trends for words and phrases in large digitized collections of textual documents with metadata facets.
Overview is a tool for analyzing large sets of documents. In includes a sophisticated search engine, word clouds, entity detection, and topic-based document clustering. If that’s not good enough, you can write your own plugins using the API. It is open source and you can run it on your own computer.
It was originally designed for investigative journalists, but it’s now also used for qualitative research, social media conversation analysis, legal document review, digital humanities, and more.
Overview is built to do several types of tasks:
I would consider myself to be a visual learner. I surround myself with gifs and memes, and when it comes to academia I gravitate towards infographics, mind-maps and charts to observe key pieces of information. So it’s not surprising that I chose a graphics based tool to review. In my area of study (Digital Humanities and IT), we cross paths with a myriad of visualisations every day, in many different formats.
Audacity is a free, easy-to-use and multilingual audio editor and recorder. Basic features, as listed on their website, include:
- Record live audio.
- Record computer playback on any Windows Vista or later machine.
- Convert tapes and records into digital recordings or CDs.
- Edit WAV, AIFF, FLAC, MP2, MP3 or Ogg Vorbis sound files.
- Cut, copy, splice or mix sounds together.
- Change the speed or pitch of a recording.
The LC Newspaper Viewer is an open-source web application that understands how to model newspaper data created according to a set of technical guidelines, with the goal of publishing an online archive like Chronicling America.
A software application that enables a user to search, manipulate and publish large SGML/XML documents. Anastasia was developed within an academic context to enable the manipulation of a single, large mark-up documents or a set of documents. It utilises two methods to interpret the structure of a mark-up document: First, it uses pattern-matching algorithms to process a hierarchical tree, similar to other XML software applications; Second, it interprets the document structure as a series of sequential 'events' which must be processed.
Chicken (originally called Chicken of the VNC) is a VNC client, or Virtual Network Computing, that allows you to display and interact with a remote computer while displaying it on a remote screen.
Philomine is an extension to the Philologic text retrieval engine that supports a variety of machine learning, text mining, and document clustering tasks.
PhiloLine is an add-on for the Philologic text retrieval engine that provides a sequence alignment algorithm for humanities text analysis designed to identify "similar passages" in large collections of texts.
The Hopscotch Model and web tool constitutes a tool to help researchers to thoroughly accomplish with the stages included in the design of qualitative research studies, as well as a conceptual model for faculty teaching research methods to design comprehensive syllabi in the field of qualitative research. The model helps potential researchers to incorporate the theoretical background behind qualitative research, and the intrinsic complexity of the multiple technical procedures that could be followed within the many different traditions in the field.
Soundflower is a MacOS system extension that allows applications to pass audio to other applications. Soundflower is easy to use, it simply presents itself as an audio device, allowing any audio application to send and receive audio with no other support needed.
Example: Play a YouTube video, and use Soundflower to route the audio signal into recording software such as Audacity.
Initially supported and developed by Cycling '74, the project now lives on Google Code where is has received updates from members there.
Sigil is a free, open source, multi-platform e-book editor, designed for editing books in EPUB format.
- Full UTF-16 support and full EPUB 2 spec support
- Multiple views: code view (complete control over directly editing EPUB syntax), book view (WYSIWYG), and preview view
- Table of contents generator, metadata editor, multi-language user interface, spell checking tool, EPUB compliance validator, support for find and replace
Combined with the Leptonica Image Processing Library Tesseract can read a wide variety of image formats and convert them to text in over 40 languages.
This code is a raw OCR engine. It has no output formatting and no UI. It can detect fixed pitch vs proportional text. Nevertheless in 1995 this engine was in the top 3 in terms of character accuracy, and it compiles and runs on both Linux and Windows. Training code is included in the open source release.
The core developer on the project is Ray Smith (theraysmith).
A Python-based XML web publishing framework which enables dynamic pipelining of XSLT transformations. Data is processed by an XML pipeline composed of several WSGI applications and middleware components.
- Apache Cocoon Sitemap 1.0 compatible
- WSGI modularity
- URI pattern matching
The Entity Authority Tool Set (EATS) is a web application for recording, editing, using and displaying authority information about entities. It is designed to allow multiple authorities to each maintain their own independent data, while operating on a common base so that information about the same entity is all in one place. EATS also comes with client tools for automatically looking up entities in a text by name and adding appropriate TEI markup.
- A web API for importing and exporting entity data
Specify is a database platform for museum and herbarium research data. It manages species and specimen information for computerizing biological collections, tracking museum specimen transactions, linking images to specimen records and publishing catalog data to the Internet. Specify is written in Java for Windows, Mac OS X, and Linux computers and uses the relational data manager, MySQL, as its data engine. Specify, Java, and MySQL are free and open-source.
Pundit is a semantic annotation and augmentation tool. It enables users to create structured data while annotating web pages.
Annotations span from simple comments to semantic links to web of data entities (as Freebase.com and Dbpedia.org), to fine granular cross-references and citations. Pundit can be configured to include custom controlled vocabularies. In other words, annotations can refer to precise entities and concepts as well as express precise relations among entities and contents. Read more on semantically structured annotations
Yāna provides a free, open-source template that open access journals can use as the basis of their own multi-platform mobile applications. The Yāna template provides the basic functionality required by any journal operating on a mobile platform.
Superfastmatch is designed to find exact duplicates of text strings between documents.
Unlock Text is a powerful geoparser that can search text hosted on the web in txt or html format for references to locations. These locations are then returned ready for use in your results page, web map or any other application.
The Unlock Text API provides access to two parsers, the Edinburgh Geoparser from the Edinburgh Language Technology Group and the CLAVIN parser.
CulturalAnalytics is an R package containing functions for statistical analysis and plotting of image properties, including statistics such as the standard deviation and mean in the RGB and HSV color spaces, image entropy and histograms in greyscale (intensity) and color, and for plotting color clouds and image scatter charts.
SwiftRiver is free and open source web-based software for real-time filtering, curation, and qualitative analysis of social media data (Twitter, etc.)
EXMARaLDA is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool (Partitur-Editor), a tool for managing corpora (Corpus-Manager) and a query and analysis tool (EXAKT).
EXMARaLDA supports the time-aligned transcription of digital audio or video, a flexible annotation for freely choosable categories, the systematic documentation of a corpus through metadata, a flexible visualisation of transcription data in different layouts and document formats, as well as the computer-assisted querying of transcription, annotation and metadata.
Joomla is an open source content management system (CMS), enabling users to build websites and applications.
Dataplot is free, public-domain software for statistical analysis, and non-linear modeling. It was developed by the National Insistute of Standards and Technology in the United States. It performs "scientific, engineering, statistical, mathematical, and graphical analysis" through the use of "an interactive, command-driven language/system with English-like syntax." It will function on Unix, Linux, Mac OS X, and Windows XP/VISTA/7 systems.
DH Press (originally called diPH) is a toolkit conceived as an easy-to-use WordPress plugin which allows potentially every kind of user to visualise and mashup historic and geographic information, documents and various types of multimedia content to develop digital humanities project.
Philologic is a full-text search, retrieval and analysis tool with support for TEI-Lite XML/SGML, Unicode encoding, plaintext, Dublin Core/HTML, and DocBook.
Plone is a powerful, flexible, open source Content Management System (CMS) built on top of Zope application server and CMF.
- Flexible and adaptable workflow
- Free add-ons
- Versioning, history and reverting content
- Support for multiple mark up formats
- Multilingual content management
- RSS feed support
- WebDAV and FTP support
- Integrates with Active Directory, Salesforce, LDAP, SQL, Web Services, LDAP and Oracle
Online web survey tool written in PHP using MySQL, MSSQL or Postgres database. Multilingual site with demo, feature list, documentation [Open Source, GPL3]
Zotero [zoh-TAIR-oh] is a free, easy-to-use tool to help users collect, organize, cite, and share their research sources Users can add PDFs, images, audio and video files, snapshots of web pages, and really anything else. Zotero automatically indexes the full-text content of their libraries, enabling them to find exactly what they're looking for with just a few keystrokes.
KORA is an digital repository that allows institutions to ingest, manage, and deliver digital objects and metadata.
VisualEyes is web-based authoring tool developed at the University of Virginia to weave images, maps, charts, video, and data into highly interactive and compelling dynamic visualizations.
TimeMapper is an open source timeline tool that takes information in a Google spreadsheet and uses it to generate a timeline that can then be published, shared, or embedded on the web. It utilizes components from other open source tools like TimelineJS and Leaflet maps. It is simple to use, but not easily customizable.
Omeka is a content management system designed for the display of library, museum, archives, and scholarly collections and exhibitions.
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Originally developed for text manipulation, it is now used for a wide range of tasks including graphics programming, system administration, network programming, applications that require database access and CGI programming on the Web.
- C, shell scripting (sh), AWK, and sed
- Powerful text processing facilities
- Flexibility and adaptability
- Support for multiple programming paradigms
Map Warper is a tool for digitally aligning ("rectifying") historical maps to match today's precise maps. It is used publicly by the NYPL to crowdsource georectification of their own library of digitised historical maps.
In the wider version developed by Tim Waters user supplied maps can be georectified for subsequent use in your own mapping projects.
RStudio is an integrated development environment (IDE) for R. It is available in both open source and consumer versions, and can run either on your desktop, or through a browser connected to RStudio Server. Features include syntax highlighting, code completion, smart indentation, and an interactive debugger.
Editors' Notes is an open-source, web-based tool for recording, organizing, preserving, and opening access to research notes, built with the needs of documentary editing projects, archives, and library special collections in mind.
Cirilo is an application developed for content preservation and data curation in FEDORA-based repository systems. Content preservation and data curation include object creation and management, versioning, normalization and standards, and the choice of data formats. The client offers functionalities which are especially prone to be used as tools for mass operations on FEDORA objects, such as ingest or replacement processes.
JuxtaposeJS helps tell stories by comparing two image frames, including photos and gifs. It provides a means, by a draggable pane, to see the changes and visually analyse the differences between the two images.
It’s an adaptable storytelling tool that is ideal for highlighting then/now stories that explain slow changes over time (growth of a city skyline, regrowth of a forest, etc.) or before/after stories that show the impact of single dramatic events (natural disasters, protests, wars, etc.).
Open Conference Systems (OCS) is a free Web publishing tool that can create a complete Web presence for scholarly conferences. It is a highly flexible management and publishing system that can be downloaded for free and installed on a local Web server. It has been designed to reduce the time and energy devoted to the clerical and managerial tasks associated with managing a conference, while improving the record-keeping and efficiency of editorial processes.
Bibliopedia will perform advanced data-mining and cross-referencing of scholarly literature to create a humanities-centered collaboratory. As a prototype, it will search resources including JSTOR and Library of Congress for metadata about scholarly articles and books that mention the famed medieval travel narrative The Travels of Sir John Mandeville, examine the articles and books for citations, then save the results in a publicly accessible database.
From the web site:
A statistical natural language parser for analyzing text to determine its grammatical structure.
A text-mining system for scientific literature. Textpresso's two major elements are (1) access to full text, so that entire articles can be searched, and (2) introduction of categories of biological concepts and classes that relate to objects (e.g., association, regulation, etc.) or describe one (e.g., methods, etc).
The Natural Language Toolkit (NLTK) is an open source Python library for text analysis and natural language processing. NLTK can tokenize strings (create a list of words from a set of characters), identify parts of speech, and perform operations based on a word's context.
OxGarage is a web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transformations use the Text Encoding Initiative format as a pivot format.
OxGarage is based on the Enrich Garage Engine developed by Poznan Supercomputing and Networking Center and Oxford University Computing Services for the ENRICH project.
See the conversion matrix for details.
From the web site:
music21: A toolkit for computer-aided musicology
The Stanford Part-of-Speech Tagger includes English, Arabic, Chinese, and German tagger modules.
Zotpress is a WordPress plugin that enables users to easily add Zotero citations to a WordPress site by using a special syntax that turns a Zotero citation key into a well-formatted inline citation and bibliography, based on the data stored in Zotero.
GRETL () s a cross-platform software package for econometric analysis, written in C. It features:
A general-purpose high-level programming language that places an emphasis upon code readability. Python supports a number of development models, including object oriented, imperative, and functional design. It provides automatic memory management and a fully dynamic type system.
- Very clear, readable syntax
- Strong introspection capabilities
- Intuitive object orientation
- Natural expression of procedural code
A free (under the GNU General Public License) toolkit for the development of document image recognition systems.
- Custom dictionaries may be created to assist with analysis of specific record types
- Extensible functionality
- Optical character recognition (OCR) toolkit plugin
Scrapy is an open source programming library for web crawling and web page text extraction, written in Python. You can make calls to Scrapy code from within your own scripts and applications to automate the task of extracting data from websites.
You would typically use Scrapy to automate the task of visiting one or more web pages, on a website to which you have access. You could alternately use it to invoke web-based Application Programming Interfaces (APIs).
Tabula lets you upload a (text-based) PDF file into a simple web interface and magically pull tabular data into CSV format.
The HTML5 boilerplate is a set of HTML/CSS/JS templates for building websites. It offers cross-browser compatibility, caching and compression rules, mobile browser optimizations, graceful degradation, and IE-specific classes.
ImageJ is a Java open source image processing program designed for scientific multidimensional images. It is highly extensible, with thousands of plugins and macros for performing a wide variety of tasks, and a strong, established user base.
There are three major versions of ImageJ:
- ImageJ1 - The stable version, developed by Wayne Rasband at NIH since 1997
- ImageJ2 - Focuses on analysis of scientific multidimensional image data. Includes ImageJ1 with a compatibility layer
Users can upload photos and organize them into albums, and they can search photos that have been posted in public albums and filter the results by license (any Creative Commons license, licenses that allow commercial use, licenses that allow remixing).
Lexos is an online tool that enables you to "scrub" (clean) your text(s), cut a text(s) into various size chunks, manage chunks and chunk sets, and choose from a suite of analysis tools for investigating those texts. Functionality includes building dendrograms, making graphs of rolling averages of word frequencies or ratios of words or letters, and playing with visualizations of word frequencies including word clouds and bubble visualizations.
The Bamboo Services platform, currently in development, provides a number of useful features for tool developers who want to make their tool available as a service, including authentication and authorization options and a result-caching service that allows users to start long-running automated analysis jobs and receive a notification when their data is ready.
The HubZero Bamboo Work Space is an instance of HubZero enhanced with the combined functionality of the Bamboo Services Platform, Content Interoperability Hub, and Person Service built-in.
Project Quincy allows users to trace the development of social networks and institutions over time and space using information about people, places and organizations. It is a Django application with a MySQL database that can be installed on a web server.
RSiena is a package for the R language that enables the statistical analysis of network data, including longitudinal network data, longitudinal data of networks and behavior, and cross-sectional network data. It provides the same functionality available in SIENA (Simulation Investigation for Empirical Network Analysis), Windows software which is no longer maintained.
HUBzero is a web publication platform and content management system designed to facilitate collaboration on research and learning. In addition to standard blog and discussion features, HUBzero's most distinctive traits are a built-in environment that can run interactive software that scholars have developed within the browser, a tool development area, and the ability to share data and documents privately between members of the hub.
This is a frames-based page turner servlet with search functionality using XPATH.
Tumblr is a blogging/microblogging platform with a focus on data sharing between individual blogs. Users can create and disseminate data in a visual or HTML editor, using standard Tumblr posting formats: text, photo, quote, link, chat, audio, and video.
Built-in customization tools allow users to manipulate the appearance of their blog with little knowledge of web development. Tumblr also provides a CSS/HTML customization panel for more advanced users, including theme documentation and the ability to upload/alter theme asset files (e.g., CSS, JS).
FreeMind is Java-based mind mapping software.
MDID is software for teaching and learning with digital images, with tools for discovering, aggregating, and presenting digital media in a variety of learning spaces.
Greenstone is a suite of software for building and distributing digital library collections. It also allows users to publish to the internet or CD-ROM. Software interface and documentation available in English, French, Spanish, Russian, and Kazakh.
CoCoCo is an application for collecting, cataloging, and assessing the quality of user-submitted text or uploaded-file contributions.
Cytoscape is a platform for complex network analysis, visualization, and annotation.
Graphviz is open source software for graph visualization, representing structural information as diagrams of abstract graphs and networks. The package includes web and interactive graphical interfaces, and auxiliary tools, libraries, and language bindings.
Heritrix is web crawler used by the Internet Archive, which provides a web-based user interface after initial configuration on a Linux machine. Also used by the Library of Congress, Heritrix captures metadata in the Web ARChive (WARC) format.
HTTrack provides an easy-to-use interface for downloading websites-- including HTML, images, and other files-- or update a copy of a previously-downloaded site.
Drupal is an extremely flexible general content management system with numerous plugins that provide scholar-oriented functionality.
Juxta is an open-source cross-platform desktop tool for comparing and collating multiple witnesses to a single textual work. The software allows you to set any of the witnesses as the base text, to add or remove witness texts, to switch the base text at will, and to annotate Juxta-revealed comparisons and save the results. New in version 1.6.5 is the ability to upload your comparison sets to a free online workspace called Juxta Commons where you can analyze your data privately or choose to share visualizations of your work with anyone on the web.
LiveJournal is a community publishing platform, with features characteristic of both blogging and social networking platforms. The site is longstanding, originally established in 1999 as a blogging platform and online community built around personal journals. Today comprises more than 50 million journals, with topical focuses such as politics, entertainment, fashion, literature, and design.
FromThePage is free software that allows volunteers to transcribe handwritten documents on-line. It's easy to index and annotate subjects within a text using a simple, wiki-like mark-up. Users can discuss difficult writing or obscure words within a page to refine their transcription. The resulting text is hosted on the web, making documents easy to read and search.
The ScholarPress Researcher WordPress plugin displays a user's Zotero library within WordPress.
Global Translator automatically translates WordPress sites into a variety of user-chosen languages, using one of four translation engines (Google Translation Engine, Babel Fish, Promt, FreeTranslations).
Sophie is an electronic tool for authoring, collaborating, reading, and publishing rich media documents in networked environments. Built in Java it runs on a variety of platforms.
It does not support either the epub or mobi formats instead using its own internal format.
Development of the project seems to have stalled
A digital repository software package that may be used to accept, manage and publish digital objects. It is widely used in academia as a system to manage academic research papers, electronic theses and other distinct digital resources. EPrints offers an extensible plug-in architecture, enabling data processing activities to be tailored to the requirements of the institution.
NB is designed for faculty and students to collaboratively annotate PDFs in an online environment. The faculty member can upload course material (e.g. readings), and make it available to their students. Students can then create public and private annotations on those PDFs. NB includes a feature for faculty and (optionally) students to download the resulting fully-annotated PDFs.
Free and open source music notation program. From website:
MuseScore runs on Windows, MacOS, and Linux, and is available in over 40 different languages.
- Create sheet music with WYSIWYG editor
- Listen to your score with computer playback
- Share & print your score
- Work the way you like
- Get help
VennMaker provides an interactive platform for compiling, generating, visualising and analysing relationship data.
STACK is an extensible social media research toolkit designed to collect, process, and store data from online social networks. The toolkit is an ongoing project via the Syracuse University iSchool, and currently supports the Twitter Streaming API. Collecting from Facebook public pages and Twitter search API are under development. The toolkit architecture is modular and supports extending. Basic Linux / Mac command line skills needed.
To learn more: https://github.com/bitslabsyr/stack
MorphAdorner is a Java command-line program which acts as a pipeline manager for processes performing morphological adornment of words in a text. Currently MorphAdorner provides methods for adorning text with standard spellings, parts of speech and lemmata. MorphAdorner also provides facilities for tokenizing text, recognizing sentence boundaries, and extracting names and places.
OHMS (Oral History Metadata Synchronizer) inexpensively and efficiently enhances access to oral history by providing users with word-level search capability and a time-correlated transcript or indexed interview connecting the textual search term to the corresponding moment in the recorded interview online.
OHMS is an open source, web-based application designed to improve the user experience you provide for oral history, no matter what CMS or repository you use. There are 2 main components of the OHMS system
Commentpress is a theme and plugin for WordPress that enables granular public commenting on texts.
This package allows users to train topic models in MALLET and load results directly into R.
TAMS Analyzer is a program that works with TAMS to let you assign ethnographic codes to passages of a text just by selecting the relevant text and double clicking the name of the code on a list. It then allows you to extract, analyze, and save coded information.
TiddlyWiki is a reusable personal web notebook. It allows anyone to create personal hypertext documents that can be published on the Web, and also search and tag content. The developers write, "TiddlyWiki is designed to be non-linear, structuring content with stories, tags, hyperlinks, and other features. You can organise and retrieve your notes in ways that conform to your personal thought patterns, rather than feel chained to one preset organisational structure. You can use TiddlyWiki as a single file that you view and edit through any web browser, whether you are online or offline.
Ruby on Rails is a web development framework written in the Ruby language. It is intended to be used with an Agile development methodology that is used by web developers for rapid development.
OpenRefine (formerly Google Refine) is a tool for cleaning messy data (e.g. fixing inconsistencies), transforming it between different formats, and exploring data.
Islandora is an open-source software framework designed to help institutions and organizations and their audiences collaboratively manage, and discover digital assets using a best-practices framework. Islandora was originally developed by the University of Prince Edward Island's Robertson Library, but is now implemented and contributed to by an ever-growing international community.
Extensive set of tools to allow collaborative transcription of manuscript pages in TEI-compliant XML.
Features of T-PEN through version 1.2 [from project blog]
Zoom Tool in Transcription User Interface: Holding CTRL+SHIFT will result in a magnified image of the current line being transcribed.
VARD 2 is an interactive piece of software produced in Java designed to assist users of historical corpora in dealing with spelling variation, particularly in Early Modern English texts. The tool is intended to be a pre-processor to other corpus linguistic methods such as keyword analysis, collocations and annotation (e.g. POS and semantic tagging), the aim being to improve the accuracy of these tools
AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs. Annotation Graphs are a formal framework for representing linguistic annotations of time series data.
CorpusSearch 2 allows users to construct and search syntactically annotated corpora, including finding and counting lexical and syntactic patterns, correcting systemic errors, and coding linguistic features.
The software is released under Mozilla Public License 1.1 (MPL 1.1) .
Aelfred is a bare-bones Java XML parser. It has not been updated since 2002, and is dependent on JDK 1.4, which is very outdated.
Inkscape is an open-source vector based imaging programme similar to Adobe Illustrator, Corel Draw, Freehand, or Xara X.
Anki is flashcard program to aid in rote memorisation.
Anyone who needs to remember things in their daily life can benefit from Anki. Since it is content-agnostic and supports images, audio, videos and scientific markup (via LaTeX), the possibilities are endless.
Learning a language
Studying for medical and law exams
Memorizing people's names and faces
Brushing up on geography
Mastering long poems
Even practicing guitar chords!
Content curation and topic discovery website based primarily on publishers the user follows through social media.
Time Flow is an open-source timeline built to help journalists analyze temporal data. The application offers several view modes--timeline, calendar, list, table--to help explore thousands of data points. It is not a web-based tool--it is a desktop application that can run off a thumb-drive and is built to handle large datasets, and timeline events that may include approximate dates or date spans.
Frogr is a small application for the GNOME desktop that allows users to manage their accounts in the Flickr image hosting website. It supports all the basic Flickr features, including uploading pictures, adding descriptions, setting tags and managing sets and groups pools.
The latest stable version of Frogr (0.10) currently features a basic flickr uploader with the following features:
GIMP is image editing software, much like Photoshop. It is a multi-platform software application primarily used for image composition and editing. The basic tool may be augmented by plug-ins and extensions that allow the use of new file formats, effects filters and batch processing capabilities. GIMP was initially created to manipulate raster images, but has been extended to provide limited vector image and moving image support. A number of free extensions are available in the plugin registry.
The UVic Image Markup Tool allows you to "describe and annotate images, and store the resulting data in TEI XML files," all within a "simple enough interface that it can be used by people with little or no experience in editing XML code." Designed to be Windows-only, but can be successfully run on Linux using Wine. Guenther Goerz has created a Mac version of the Image Markup Tool using Wine Bottler (it's a large download).
Weka provides machine learning algorithms in Java for data mining and predictive modeling tasks. These algorithms can either be incorporated into other Java code or called from the Weka Workbench, a GUI environment.
PAIR is a sequence alignment algorithm for humanities text analysis designed to identify "similar passages" in large collections of texts. In addition to a Philologic add-on, PAIR is available as Text::Pair, a generalized Perl module that supports one-against-many comparisons. A corpus is indexed and incoming texts are compared against the entire corpus for text reuse.
960 Grid System is a CSS template that comes with corresponding Acorn, Fireworks, Flash, InDesign, GIMP, Inkscape, Illustrator, OmniGraffle, Photoshop, QuarkXPress, Visio, Exp Design, and printable templates to facilitate different stages of the web design process.
ScholarPress Vitaware enables you to build a CV in Wordpress using Zotero.
cue.language is a Java library that has tokenizing (words/sentences/ngram), string counting, language guessing, and stop word detection capabilities.
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
The Visual Understanding Environment (VUE) is concept mapping software that can integrate with multiple repositories to pull in, organize, and analyze data. Multiple features for advanced management of digital resources for teaching, learning, and research.
Integrated Content Environment (ICE) was an open source project of the Learning Resources Development (LRD) unit at the University of Southern Queensland. The content management system allowed users to convert content authored in Microsoft Word or OpenOffice.org Writer into self-contained course websites using the IMS format.
The ICE authoring environment enabled:
Calibre is a free and open source ebook library management application, including options for syncing to devices and converting between a large number of formats. Calibre also has a built-in e-book editor for EPUB and AZW3 formats.
Windows .NET based open source Digital Asset Management solution designed for medium size preservation, cataloguing, media archiving and batch transcoding.
Solr is an open source enterprise search platform from the Apache Lucene project. It operates as a standalone full-text search server within an appropriate servlet container, such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language.
Apache Lucene is a Java-based high-performance text search engine library.
MantisBT is a free popular web-based bugtracking system written in the PHP scripting language. The most common use of MantisBT is to track software defects. However, MantisBT is often configured by users to serve as a more generic issue tracking system and project management tool.
- event-driven-plug-in system
- works with MySQL, MS SQL, PostgreSQL, SQLite, Oracle and IBM DB2 databases
- RSS Feeds
- Customisable workflow
- Wiki integration
- Chat integration
A software tool for the manipulation of audio recordings. Wavesurfer provides basic audio editing function, such as excision, copying, pasting, zero-crossing adjustment, and effects such as fading, normalization, echo, inversion, reversal, replacement with silence, and DC-removal.
xMod is a desktop application which can transform a repository of XML into a completely finished website.
The entire process can be setup and run to produce a basic website assuming some prerequisites:
- A set of valid XML files. These would normally comply with a TEI DTD.
- a configuration script that indicates the relationship between files
- A 'personality pack' (CSS and image files) that determine its visual appearance. However if they are not present, the completed website falls back on a default look and feel.
"In the WordHoard environment, texts are annotated or tagged by morphological, lexical, prosodic, and narratological criteria. They are mediated through a 'digital page' or user interface that lets scholarly but non-technical users explore the greatly increased query potential of textual data kept in such a form."
Denemo is a music notation editor with a graphical user interface that allows users to enter notation for typesetting by the LilyPond music engraver.
Flare is an ActionScript library for creating visualizations that run in the Adobe Flash Player. From basic charts and graphs to complex interactive graphics, the toolkit supports data management, visual encoding, animation, and interaction techniques. Flare features a modular design that lets developers create customized visualization techniques without having to reinvent the wheel.
eXist-db is an open source database management system that stores XML data according to the XML data model and features efficient, index-based XQuery processing.
"The Virtual Lightbox for Museums and Archives (VLMA) is an educational tool for collecting and reusing in a structured fashion the online contents of museums and archives with visual components. With VLMA, you can browse and search collections, construct personal collections, export these collections to xml or Impress presentation format, annotate them, and share your collections with other VLMA users."
The Virtual Lightbox enables online image comparison, with features like an image-centric whiteboard. There are two versions, application and applet, which have different functionalities.
The Bamboo Content Interoperability Hub (CI gub) is an effort to largely automate the time-consuming process of downloading and compiling data from different repositories and archives and standardizing some of the format differences.
The Bamboo Person Service can help scholars access all the resources (tools, collections, and shared data) they have permissions for due to their institutional affiliations, scholarly society membership, association with research groups, etc. By connecting accounts from these different groups (e.g. a Google account for a research group, a university ID, and a generic username for a scholarly society), scholars will be able to use resources provided by any of these groups, in any work space or tool environment within the Bamboo ecosystem.
LibLime Koha is a web-based, open source integrated library system (ILS) that has also been used for virtual library systems (e.g. recreating historic libraries). LibLime Koha offers libraries circulation policies, patron management modules, parent-child relationship for patron records, club and service management features, in-depth "holds" support, single click batch import "undo" option, EzProxy compatibility, self-checkout interface and more.
GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP.
Transformer can load Unicode-encoded text files and transform them in various ways, including trying to rescue text files created in word processing programs that no longer run (e.g. old versions of Word.)
PostgreSQL is a powerful, open source object-relational database system running on all major platforms. Support for native programming interfaces for C/C++, Java, .Net, Perl, Python, Ruby, Tcl and ODBC among others.
PDFsam can split and merge batches of PDFs.
Chrome is a free browser, similar to Firefox, with numerous free extensions.
CamStudio is free and open source screencasting software that saves the video as AVI files, though a Flash converter is included.
SARIT (Search and Retrieval of Indic Texts" is a collection of electronic editions of Sanskrit and other Indian-language texts that have dated and embedded notes about their change history. You can perform a text search, retrieval and analysis of works in SARIT, as well as download all the texts and convert them to PDF, HTML, etc.
CHET-C, or Chapel Hill Electronic Text-Converter, is a browser based software tool designed to convert digital texts that employ standard epigraphic conventions such as the Leiden sigla into EpiDoc-compliant XML files.
The tool can be accessed online at http://www.stoa.org/projects/epidoc/stable/chetc-js/chetc.html. Fragments of epigraphic text using standard sigla (eg Leiden convention markup) are pasted into the tool and Epidoc compliant XML is generated.
A software tool for the creation of 3D interactive environments. It may be used to model and interact with physical objects that currently or have previously existed in the real world, or create virtual environments that have not previously encountered. VirTool player software is available for Microsoft Windows, Apple MacOS X, Nintendo Wii and Microsoft Xbox 360.
A structured text editor that may be used to create, edit, validate and convert XML and SGML documents. EpcEdit contains an integrated validating parser, an editor for CALS and HTML tables, an attribute editor and an element manipulation tool.
Xalan is an XSLT processor for transforming XML documents into HTML, text, or other XML document types. It implements XSL Transformations (XSLT) Version 1.0 and XML Path Language (XPath) Version 1.0.
- Conversion between structured markup formats
- Stylesheet validation
Fedora (Flexible Extensible Digital Object Repository Architecture) was originally developed by researchers at Cornell University as an architecture to store, manage, and access digital content in the form of digital objects. Fedora defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking behaviors to digital objects.
A software application for the playback of audio recordings. SoundScriber offers specific functionality for researchers that wish to transcribe a recording. It was originally developed for use in the Michigan Corpus of Academic Spoken English (MICASE) project and released for use by academics performing similar work.
- Audio playback via installed audio codecs (e.g. Wav, MP3)
- Variable speed playback
A software application that enables relational databases to be created, managed and queried. The database management system enables multiple users to access a database through an appropriate interface. As an open source tool, MySQL underpins a number of free software projects, such as WordPress, phpBB and other software built on a LAMP infrastructure. Although widely used, there are a number of performance issues that limit its use in some environments. For example, it is unable to use multiple CPU cores to process a single query, potentially limiting its use as a data warehouse.
XSugar is a proof of concept tool for mapping textual content between a flat file schema and XML format. It performs statistical analysis to establish if transformations between the two formats are bi-directional, enabling content that has been converted into an XML format to be re-exported to the original flat file structure, or vice-versa. To validate the conversion, a schema must exist for source and destination formats, e.g. a bespoke XFlat encoded XML document that contains a definition of the structure of a class of flat files, an XML schema.
A software suite for displaying, converting and editing still images stored in a raster image format. Image manipulation may be performed via the command line, API libraries, or through a simply graphical user interface.
- Software control through API and command line
- Format conversion
- Image transformation
- Transparency support
- Format identification
- Text to image conversion
- Read, process, or write mega-, giga-, or tera-pixel image sizes
- Distributed pixel cache
- Perceptual hash
Pattern is a Python web mining module with tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks).
BibDesk allows you to edit and manage your bibliography. It keeps track of the bibliographic information, associated files, and web links.
Bibus is a bibliographic and reference management software. As with other such tools, Bibus allows one to search, edit, and sort bibliographic records. In addition, Bibus contains features that makes it unique among open source and even commercial bibliographic databases:
Citeline was a web-based service that assisted with the publishing of bibliographies and citation collections. Users could create interactive exhibits and share your data online. Additionally, users could also import Zotero and BibTex files. Citeline was one of MIT's Simile project that has since been retired.
IssueLab is an online research archive in the nonprofit sector. In addition to archiving social policy research by nonprofits at no cost, IssueLab distributes this research online with a broad audience via e-newsletters, RSS feeds, and social media. Additionally, IssueLab advocates for the broad use of open licenses and the overall adoption of open access standards, to allow people to find and reuse nonprofit materials.
MediaWiki is a free software open source wiki package written in PHP, originally for use on Wikipedia and other Wikimedia Foundation projects. It is designed to be run on a large server farm for a website that gets millions of hits per day.
OpenOffice is an open-source office software suite for word processing, spreadsheets, presentations, graphics and databases.
Journler is a daily notebook and entry based information manager. Scholars, teachers, students, writers, and everyday users may use this on a daily basis to integrate their notebook content to other sources of media such as audio and video.
The site has not been updated since 201. It looks like Journaler is now available open source though the option to purchase is still displayed.
The Annotator's Workbench is a software tool that enables users to create a collection from a set of existing video files, segment that collection, create annotations and assign vocabulary terms to the segments, and control access. This tool is part of the Ethnographic Video for Institution and Analysis (EVIA) Digital Archive Project at Indiana University and the University of Michigan.
Annozilla allows you to view and create annotations associated with a web page. To use Annozilla you need to install the packages and get set up with an account with the W3C test server.
Digress.it is a WordPress plug-in that allows paragraph-level commenting in the margins of a text. Use digress.it as a collaboration tool for blog-style comment threads off individual paragraphs. Host your project on digress.it or download it for use with your own WordPress installation.
Project Pad is web-based system for media annotation and collaboration for teaching and learning and scholarly applications. Project Pad provides tools for browsing and working with audio, video, and images from digital repositories. The user may organize and annotate excerpts within their own "online notebook." Available as a standalone web application or set of Sakai tools.
Scribe is a note-taking program that allows you to manage your research notes, quotes, thoughts, contacts, published and archival sources, digital images, outlines, timelines, and glossary entries. You can create, organize, index, search, link and cross-reference your note and source cards.
Further information about Scribe can also be found at http://echo.gmu.edu/toolcenter-wiki/index.php?title=Scribe, 23.05.2014.
Xournal is an application for note-taking, sketching, and journaling.
Carrot2 is an open source search results clustering engine. It automatically organizes small collections of documents into categories.
Notational Velocity is open-source software that allows you to search across thousands of notes. You may create a new note or view or edit an existing note.
Pliny is a scholarly note-taking and annotation tool. It may be used with both digital (web pages, images, PDF files) and non-digital (books, printed articles) materials, run as a desktop application on the user's computer. Pliny is useful for taking and managing annotations and notes while reading, as well as subsequently developing and presenting an interpretation.
Casual allows you to search for a concept or common word and produces a unique cloud of concepts. Results are randomly displayed as definitions, related tags, and pictures.
WikiMindMap is a mindmapping tool that allows you to browse in wiki content. It uses an interactive interface based on a system of nodes and brackets. Maps may be exported into FreeMind.
Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction.
Open Attribute is a suite of tools that makes it simple for anyone to copy and paste the correct attribution for any CC licensed work. These tools will query the metadata around a CC-licensed object and produce a properly formatted attribution that users can copy and paste wherever they need to.
VIVO is an open source semantic web application originally developed and implemented at Cornell. When installed and populated with researcher interests, activities, and accomplishments, it enables the discovery of research and scholarship across disciplines at that institution and beyond. VIVO supports browsing and a search function which returns faceted results for rapid retrieval of desired information.
Xena is free and open source software developed by the National Archives of Australia to aid in the long term preservation of digital records. Xena is an acronym meaning Xml Electronic Normalising for Archives. Xena is a component of the Digital Preservation Software Platform (DPSP). Xena software aids digital preservation by performing two important tasks: detecting the file formats of digital objects and converting digital objects into open formats for preservation.
TEI Boilerplate is a lightweight solution for publishing styled TEI (Text Encoding Initiative) P5 content directly in modern browsers. With TEI Boilerplate, TEI XML files can be served directly to the web without server-side processing or translation to HTML.
Prism is a tool for crowdsourcing interpretation. Welcome to our experiment in crowd-sourcing and visualizing many readings of a common set of texts.
ReDBox is a metadata registry application for describing research data.
HandBrake is an open-source, GPL-licensed, multiplatform, multithreaded video transcoder
Bookworm enables you to graphically explore lexical trends in repositories of digitized texts.
All Our Ideas is a research project that seeks to develop a new form of social data collection by combining the best features of quantitative and qualitative methods. Using the power of the web, we are creating a data collection tool that has the scale, speed, and quantification of a survey while still allowing for new information to "bubble up" from respondents as happens in interviews, participant observation, and focus groups.
Jekyll is a simple, blog aware, static site generator. It takes a template directory containing raw text files in various formats, runs it through Textile or Markdown and Liquid converters, and creates a complete, static ready-to-publish website suitable for serving with your favorite web server. Jekyll also happens to be the engine behind GitHub Pages, which means you can use Jekyll to host your project’s page, blog, or website from GitHub’s servers for free.
Neatline is a tool for the creation of interlinked timelines and maps as interpretive expressions of the literary or historical content of archival collections.
Pandoc can convert documents in reStructuredText, textile, HTML, or LaTeX formats to a variety of other formats including XHTML, PDF, EPUB, docx, odt, and more.
Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).
Bluefish is a powerful editor targeted towards programmers and web designers, with many options to write websites, scripts and programming code. Bluefish supports many programming and markup languages, and it focuses on editing dynamic and interactive websites.
DSpace is the software of choice for academic, non-profit, and commercial organizations building open digital repositories. It is free and easy to install "out of the box" and completely customizable to fit the needs of any organization.
DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets. DSpace has an active community of developers and is used by thousands of institutions worldwide.
OpenETD is an open source, web-based software application for managing the submission, approval, and distribution of electronic theses and dissertations (ETDs).
Blacklight is an open source Ruby on Rails gem that provides a discovery interface for any Solr index. Blacklight provides a default user interface which is customizable via the standard Rails (templating) mechanisms. Blacklight accommodates heterogeneous data, allowing different information displays for different types of objects and features faceted browsing, relevance based searching, bookmarkable items, permanent URLs for every item, and user tagging of items.
The Dataverse Network is an application to publish, share, reference, extract and analyze research data. It facilitates making data available to others, and allows to replicate others work. Researchers and data authors get credit, publishers and distributors get credit, affiliated institutions get credit.
Search Flickr for photos, sort according to license types. Contains commercial as well as Creative Commons licensed photos.
Substance is a fully web-based document authoring and publishing platform.
Voyant Tools is a web-based reading and analysis environment for digital texts.
Improvise is a fully-implemented Java software architecture and user interface that enables users to build and browse highly-coordinated visualizations interactively. By coupling a shared-object coordination model with a declarative visual query language, users gain precise control over how navigation and selection affects the appearance of data across multiple views, using a potentially infinite number of variations on well-known coordination patterns such as synchronized scrolling, overview+detail, brushing, drill-down, and semantic zoom.
This online tool can be used for a wide variety of annotation tasks, including visualization and collaboration.
brat is designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and "interpreted" by a computer. brat also supports the annotation of n-ary associations that can link together any number of other annotations participating in specific roles. brat also implements a number of features relying on natural language processing techniques to support human annotation efforts.
Korbo is a powerful aggregation platform for gathering Linked Data objects relevant to your area of research into single workspaces or “baskets”.
Korbo is targeted primarily at developers who want to build applications on top of its API and make full use of the linked cultural data from sources such as Europeana, FreeBase and DBPedia.
Korbo is currently in the early stages of development, but you can already try out a demo version of the platform.
PyBossa is a free, open-source, platform for creating and running crowd-sourcing applications that utilise online assistance in performing tasks that require human cognition, knowledge or intelligence such as image classification, transcription, geocoding and more.
Archivematica is a digital preservation system that is designed to maintain standards-based, long-term access to collections of digital objects. It uses a micro-services design pattern to provide an integrated suite of software tools that allows users to process digital objects from ingest to access in compliance with the ISO-OAIS functional model. Users monitor and control the micro-services via a web-based dashboard. Archivematica uses METS, PREMIS, Dublin Core and other best practice metadata standards.
Android app that scans the ISBN barcode on a book and pulls the bibliographic metadata into your Zotero library.
Qnotero provides lightning quick access to users' Zotero references. It lives in the system tray and allows users to search through their references by author and/ or year of publication. If a PDF file is attached to a reference, users can open it directly from within Qnotero.
Zandy provides users with access to their Zotero library through a simple interface. Users can browse and modify the items in their libraries, add new items, view attachments, take and edit item notes, search their libraries, and add webpages from the Android browser.
The cross-platform Advene application allows users to easily create comments and analyses of video documents, through the definition of time-aligned annotations and their mobilisation into
automatically-generated or user-written comment views (HTML documents). Annotations can also be used to modify the rendition of the audiovisual document, thus providing virtual montage, captioning, navigation... capabilities. Users can exchange their comments/analyses in the form of Advene packages, independently from the video itself.
The Protege Project offers WebProtege and Protege Desktop, which are free and open source ontology editing applications.
TemaTres is an open source vocabulary server, web application to manage and exploit vocabularies, thesauri, taxonomies and formal representations of knowledge.
SimpleTCT is a simplified management environment designed to assist in textual comparison. Users can display the contents of .rtf files, define themes, highlight passages and add personal notes as required. A document may then be exported containing all of the selected passages, organised thematically, situated alongside the relevant notes.
Meld is a visual diff and merge tool targeted at developers. Meld helps you compare files, directories, and version controlled projects. It provides two- and three-way comparison of both files and directories, and has support for many popular version control systems.
Participad is a WordPress plugin that allows multiple people to edit the same WP content at the same time. Powered by Etherpad Lite, Participad gives you: notepads for collaborative notetaking; synchronous authoring of any content in the WordPress Dashboard; front-end editing. You can download it from the WordPress plugin repository.
Participad has three modules:
Designed for journalists, a web-based service for working with primary sources that features search and analysis, highlighting and annotation, and document sharing. Uses Tesseract for OCR and allows users to upload documents and search other its document collection.
Digitate is a free application designed for use on the iOS platform, specifically on iPad devices. The application allows scholars and enthusiasts with an interest in the visual and material elements of a cultural artefact to make notes and annotations directly on an image of such an artefact.
For example, a literary scholar might use it to annotate the material or bibliographic elements of a rare text or first edition, while an art historian might do the same on an image of a painting.
Nomenklatura is a reference data recon server. It is a service that allows users to define and manage manage lists of canonical entities (e.g. person or organization names) and aliases that connect to one of the canonical entities. This helps to clean up messy data in which a single entity may be referred to by many names.It includes a user interface, an API, and a reconciliation endpoint for OpenRefine for matching data from data sets with the canonical entries.
From the website: NodeXL is a free, open-source template for Microsoft® Excel® 2007 and 2010 that makes it easy to explore network graphs. With NodeXL, you can enter a network edge list in a worksheet, click a button and see your graph, all in the familiar environment of the Excel window. (http://nodexl.codeplex.com/)
Ptolemaic is a computer application for music visualization and analysis written in the Java programming language. The software is designed to aid in the analysis of all types of Western music using established analytical techniques, including tonal functional analysis (Harrison 1994), pitch-class set analysis (Forte 1973), hierarchical linear analysis (Schenker 1935, Jones 2002), tonal pitch-space analysis on the Tonnetz (Riemann 1915), pitch-class set analysis (Forte 1973), and transformation analysis (Lewin 1987).
(from web page)
Historical musical pieces make their way to us through multiple documents and it often happens that multiple sources introduce differences and variants in the music. meiView is an experimental web application designed to display 15–16th century music and provide a dynamic mechanism for the user to select which variant they want to see.
meiView is an open source software licensed under Apache 2.0. See the source code on
(from web page)
Augmented Notes integrates scores and audio files to produce interactive multimedia websites in which measures of the score are highlighted in time with music.
To get started making your own website, upload mp3 and ogg versions of your audio file, images of the pages of your score, and, if you have one, an MEI file containing measure boundaries, then hit submit below. Augmented Notes will help you mark the measures on the score and assign them times, and will output a zip file of everything you need to have your own website.
Bokeh is a Python interactive visualization library for large datasets that natively uses the latest web technologies. Its goal is to provide elegant, concise construction of novel graphics in the style of Protovis/D3, while delivering high-performance interactivity over large data to thin clients.
With ediarum researchers can comfortably transcribe, encode and edit manuscripts in TEI-XML, as well as publish their results in an online or print edition. The solution, developed by TELOTA, is based on three software components: exist-db, Oxygen XML Author, and ConTeXt. These are combined, supplemented with additional functions, and tailored to fit a project's needs.
Annotation Studio is an open source, web-based annotation application that integrates a powerful set of textual interpretation tools behind an intuitive and easy-to-use interface. Users can upload their own texts, and annotate with styled text, video, images, and weblinks. To date, the project has been used with great success in disciplines such as Writing, Literature, Foreign Languages, Anthropology, Film and Media Studies, and others at institutions including Harvard, Yale, Stanford, MIT, Barnard College, and Washington University.
ManageWP is a paid service used to manage multiple WordPress instances across multiple servers. It provides dashboard monitoring of performance as well as batch updating and management of themes, plugins, comments, etc. It provides facility for syndication of material across multiple sites as well as aggregated analytics and cloning of WordPress instances.
Ghost is a free, open source publishing platform. Also available as a hosted service for a monthly subscription cost.
MapCraft is a tool for tracking mapping progress during a mapping party or other coordinated project taking place in a concentrated area. It shows a cake diagram as a bunch of clickable areas, with the ability to take ownership of cake slices, as well as commenting on them, and chatting to eachother about it. It takes the concept of a "cake diagram" and makes it more dynamic.