Transcription

What kind of data should the tool work with?

Recogito is an online platform for collaborative document annotation.

Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts and images - and collaborate in their annotation and interpretation. Recogito enables you to make your work more visible on the Web more easily, and to expose the results of your research as Open Data.

Code license: Open source, Apache License
Last updated: 21 Dec 2016

The goal of the Alpheios project is to help people learn how to learn languages as efficiently and enjoyably as possible, and in a way that best helps them understand their own literary heritage and culture, and the literary heritage and culture of other peoples throughout history. One of the principal tools, a Firefox plugin, allows a reader to browse a web page with Latin, ancient Greek, or Arabic, click on a word, and get a definition and morphological analysis of the word.

Code license: Open source, GNU GPL
Last updated: 7 Jun 2016

An (optical character recognition) engine for creating editable and searchable electronic files from scanned paper documents, PDFs and digital photographs.
Features:

  • Recognition of Digital Camera and Mobile Phone Camera Images
  • Comprehensive Language Support
  • Complete Integration with Popular Office Applications
  • PDF conversion, archiving and security
Code license: Closed source
Last updated: 17 May 2016

DM is an environment for the study and annotation of images and texts. It is a suite of tools, enabling scholars to gather and organize the evidence necessary to support arguments based in digitized resources. DM enables users to mark fragments of interest in manuscripts, print materials, photographs, etc. and provide commentary on these resources and the relationships among them.

Last updated: 1 May 2016

EXMARaLDA is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool (Partitur-Editor), a tool for managing corpora (Corpus-Manager) and a query and analysis tool (EXAKT).

EXMARaLDA supports the time-aligned transcription of digital audio or video, a flexible annotation for freely choosable categories, the systematic documentation of a corpus through metadata, a flexible visualisation of transcription data in different layouts and document formats, as well as the computer-assisted querying of transcription, annotation and metadata.

Code license: Open source
Last updated: 21 Oct 2015

Omeka is a content management system designed for the display of library, museum, archives, and scholarly collections and exhibitions.

Code license: Open source, GNU GPL
Last updated: 2 Aug 2015

InqScribe is a low-cost, easy-to-use software for creating transcripts from video or audio files. It has a lot of simple customization features which make it easy to use, but not a lot of extra stuff that might bog a new user down. It's best for transcription, but it can also be used for creating subtitles for videos (it can be set up to export a video with the transcript overlaid on it during playback) and even for coding data (best for codes that do not need durations associated with them). It exports to a range of formats (like CSV) for easy analysis.

Last updated: 14 Jun 2015

FromThePage is free software that allows volunteers to transcribe handwritten documents on-line. It's easy to index and annotate subjects within a text using a simple, wiki-like mark-up. Users can discuss difficult writing or obscure words within a page to refine their transcription. The resulting text is hosted on the web, making documents easy to read and search.

Code license: Open source, GNU Affero GPL
Last updated: 2 May 2015

OHMS (Oral History Metadata Synchronizer) inexpensively and efficiently enhances access to oral history by providing users with word-level search capability and a time-correlated transcript or indexed interview connecting the textual search term to the corresponding moment in the recorded interview online.

OHMS is an open source, web-based application designed to improve the user experience you provide for oral history, no matter what CMS or repository you use. There are 2 main components of the OHMS system

Code license: Open source
Last updated: 6 Apr 2015

Extensive set of tools to allow collaborative transcription of manuscript pages in TEI-compliant XML.

Features of T-PEN through version 1.2 [from project blog]

Zoom Tool in Transcription User Interface: Holding CTRL+SHIFT will result in a magnified image of the current line being transcribed.

Last updated: 17 Mar 2015

Praat is software for the phonetic analysis of speech, including support for articulatory and speech synthesis.

Code license: GNU GPL v2
Last updated: 19 Feb 2015

AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs. Annotation Graphs are a formal framework for representing linguistic annotations of time series data.

Code license: Open source
Last updated: 11 Feb 2015

Dragon Dictation is a voice recognition application that allows you to speak and instantly see your text content from email messages to blog posts on your iPad, iPhone, or iPod Touch.

Code license: Closed source
Last updated: 29 Dec 2014

Express Scribe is a professional audio player software for PC or Mac that assists in the transcription of audio recordings.

Code license: Closed source
Last updated: 29 Dec 2014

eLaborate is an online work environment in which scholars can upload scans, transcribe and annotate text, and publish the results as on online text edition which is freely available to all users.

Code license: GNU GPL v3
Last updated: 29 Dec 2014

OpenETD is an open source, web-based software application for managing the submission, approval, and distribution of electronic theses and dissertations (ETDs).

Code license: Open source, GNU GPL v3
Last updated: 29 Dec 2014
CSV
Subscribe to Transcription