Google Maps is a web mapping service application that includes street maps, satellite images, street view perspectives, as well as web functions such as routing and geocoding. The API can be used outside of the normal Google Maps interface for other projects.
BatchGeo is an online service that maps address data as points. The cut and paste interface makes it easy to convert a spreadsheet of street addressed into a map can be embedded or downloaded as a KML file. A limited number of addresses can be mapped for free; large files require a subscription.
Beautiful Soup is a library, written in the Python programming language, for pulling specific pieces of data out of HTML and XML files. It is especially suitable when working with data files that aren't well-formed, or are otherwise difficult to parse.
Saves programmers hours or days of work on quick-turnaround screen scraping projects.
The LC Newspaper Viewer is an open-source web application that understands how to model newspaper data created according to a set of technical guidelines, with the goal of publishing an online archive like Chronicling America.
capella-scan can "OCR" music scores from PDF or common image formats and output the results in MusicXML for use with common music editing software.
epub-tools is a collection of Python tools for generating and managing epub documents from Word, RTF, DocBook, TEI and FictionBook.
Voyeur is a web-based text analysis environment where users can apply a wide variety of tools to any text they import.
Zoho provides a drag-and-drop interface for creating database-driven applications, such as forms.
Open Journal Systems (OJS) is a journal management and publishing system. Public Knowledge Project (the sponsor of OJS) is a multi-university initiative developing (free) open source software and conducting research to improve the quality and reach of scholarly publishing
DEVONthink is a database that helps users organize, manage and collaborate on digital files, including Office files, links, e-mails, research data and PDFs.
Philologic is a full-text search, retrieval and analysis tool with support for TEI-Lite XML/SGML, Unicode encoding, plaintext, Dublin Core/HTML, and DocBook.
Microsoft Sharepoint is an environment for sharing documents with collaborators, using granular permissions. Sharepoint can tightly integrated with Microsoft Office (e.g. Office documents can be saved directly to Sharepoint, some Sharepoint installations allow web-based editing using the cloud-hosted Office 365. Sharepoint is commonly used to host collaborative workspaces, data management system, wikis and blogs.
- Extensive integration with Microsoft Office System programs
RStudio is an integrated development environment (IDE) for R. It is available in both open source and consumer versions, and can run either on your desktop, or through a browser connected to RStudio Server. Features include syntax highlighting, code completion, smart indentation, and an interactive debugger.
SharpEye is music scanning/"OCR" software that can convert an image of a score into an editable format such as MusicXML.
OxGarage is a web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transformations use the Text Encoding Initiative format as a pivot format.
OxGarage is based on the Enrich Garage Engine developed by Poznan Supercomputing and Networking Center and Oxford University Computing Services for the ENRICH project.
See the conversion matrix for details.
PDFtoMusic Pro converts PDFs created by other music notation programs to MusicXML scores.
Anthologize is a WordPress plugin that allows users to outline, order, and edit content into a single volume that can be exported as PDF, TEI or epub.
Scripto is an engine for crowdsourcing the transcription of content that can be integrated with a custom transcription GUI and existing CMS.
Heritrix is web crawler used by the Internet Archive, which provides a web-based user interface after initial configuration on a Linux machine. Also used by the Library of Congress, Heritrix captures metadata in the Web ARChive (WARC) format.
SiteSucker is OSX and iOS software that can download an entire website, including images and videos.
HTTrack provides an easy-to-use interface for downloading websites-- including HTML, images, and other files-- or update a copy of a previously-downloaded site.
Afloat is a utility that adds new window management functionality to OSX, including keeping a window on top, or turning a window into an overlay on the screen. This can be particularly useful in full-screen view (e.g. during a presentation) when you want to keep another small window, such as a Twitter client, visible somewhere on the screen.
Juxta is an open-source cross-platform desktop tool for comparing and collating multiple witnesses to a single textual work. The software allows you to set any of the witnesses as the base text, to add or remove witness texts, to switch the base text at will, and to annotate Juxta-revealed comparisons and save the results. New in version 1.6.5 is the ability to upload your comparison sets to a free online workspace called Juxta Commons where you can analyze your data privately or choose to share visualizations of your work with anyone on the web.
Evernote is note-taking software in the cloud, with options for private and shared notebooks. Users can take text notes, and upload files to attach them to notes. Evernote has built-in OCR for images with printed or handwritten text. A premium account allows access to notebooks offline, as well as more storage and embedded PDF search.
Leximancer is text analysis software that can create topic and concept based network visualizations and includes a sentiment analyzer.
Netvibes offers a free personal web dashboard for following feeds, friends and using the provided apps. A premium account includes functionality for analytics, tagging, curation, alerts, sentiment analysis, and search.
Global Translator automatically translates WordPress sites into a variety of user-chosen languages, using one of four translation engines (Google Translation Engine, Babel Fish, Promt, FreeTranslations).
Sophie is an electronic tool for authoring, collaborating, reading, and publishing rich media documents in networked environments. Built in Java it runs on a variety of platforms.
It does not support either the epub or mobi formats instead using its own internal format.
Development of the project seems to have stalled
Scrivener is software for writing that includes virtual index cards, outlining, version control, import/export options, and scriptwriting features, and provides a management system for notes and documents plus support for document metadata.
It allows the creation of documents from sub documents, ebook (epub and Kindle/mobi) and TeX and LaTeX export as well as ODF, PDF and Microsoft Word exports.
A Linux version is in beta, and an iOS version is reportedly under development
R is a free software environment for statistical computing and graphics. R can be run from the command line, or using any of the many graphical user interfaces available on a variety of platforms; these are listed as separate tools.
Navicat for MySQL is an interface for working with MySQL databases, including importing data from CSV or Excel, exporting, reporting, querying, and for developing scripts etc, and general database exploration
Navicat for MySQL can also be used with MariaDB databases
SEASR provides an environment for developing data flows that ingest data, process it through a series of transformations and analytics, and send the data to a results viewer.
MONK is a digital environment designed to help humanities scholars discover and analyze patterns in the texts they study.
The Visual Understanding Environment (VUE) is concept mapping software that can integrate with multiple repositories to pull in, organize, and analyze data. Multiple features for advanced management of digital resources for teaching, learning, and research.
GitHub is a web-based repository service which offers the distributed revision control and source code management (SCM) functionality of GIT with a graphical user interface, desktop, and mobile integration. It also provides collaboration tools such as access control, wikis, task management, code review, bug tracking, and feature requests. It offers free accounts, often used to host opensource software projects, and private (paid) repositories.
Image Map Tool allows you to upload an image (or specify the URL of an image found online) and turn it into a clickable image map.
Integrated Content Environment (ICE) was an open source project of the Learning Resources Development (LRD) unit at the University of Southern Queensland. The content management system allowed users to convert content authored in Microsoft Word or OpenOffice.org Writer into self-contained course websites using the IMS format.
The ICE authoring environment enabled:
HyperPo is a user-friendly text exploration and analysis program that allows users to import texts or use texts available online (in English or French), and provides frequency lists of characters, words and series of words, color-coding to indicate repetition, KWIC, co-occurrence and distribution lists, and the ability to simultaneously compare data from multiple texts.
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
Calibre is a free and open source ebook library management application, including options for syncing to devices and converting between a large number of formats. Calibre also has a built-in e-book editor for EPUB and AZW3 formats.
Text Fixer allows users to copy and paste a Word document into a box and convert it to clean HTML.
Twapper Keeper lets users create an archive of tweets based on hashtag, keyword, or person, for them to review online.
PhotoScore takes an image of a music score-- including handwritten scores-- and outputs it in an editable format, including MusicXML.