text analysis

What kind of data should the tool work with?

Textable is an open source program for text analysis. It offers a set of basic text-analytic components (e.g. import text from files, segment into words, measure segment diversity, etc.), which the user combines using a visual interface to build custom analytic workflows.

Code license: GNU GPL v3
Last updated: 20 Aug 2017

Con DiscoverText, los usuarios pueden importar datos desde una amplia variedad de fuentes (como Facebook &amp, inicio de Twitter, texto simple, Word, Excel, comentarios públicos de YouTube, blogs o wikis, PDF, entre otros), codificarlos y generar nubes de palabras e informes.

Code license: Closed source
Last updated: 24 Feb 2017

AroniSmartIntelligence™ is an application that performs text analytics on RSS articles, reviews, feedback, chat data or other unstructured texts organized into sub-folders. The output may be further input into other advanced statistical analytics or data mining modules available in AroniSmartIntelligence™, including regression analysis, econometrics, segmentation and Bayesian models.

Code license: Closed source
Last updated: 18 Mar 2016

TAToo is an embeddable Flash widget that displays TAPOR analytics for the page on which it resides.

Code license: Apache License
Last updated: 23 Feb 2016

The TAPoR Portal is an online environment where users can keep track of texts they want to study (uploaded or available online), learn about and try different tools, and run tools on texts.

Last updated: 23 Feb 2016

A graphical user interface tool for Latent Dirichlet Allocation topic modeling.

Last updated: 17 Feb 2016

El programa MONK workbench provee 525 obras de la Literatura de América del siglo XVIII y XIX y 37 obras de teatro y 5 obras de poesía escritas por William Shakespeare, junto con herramientas que permiten la investigación literaria a través del descubrimiento, el análisis y la visualización de patrones.

Last updated: 12 Aug 2015

Este producto puede filtrar o brindar formato a contenido de texto. También incluye un organizador de enlaces y documentos y capacidades de búsqueda. Por esto, es posible que sea más apropiado denominarlo como un sistema de administración de textos. Si el usuario tiene una gran cantidad de documentos guardados en la computadora y enlaces en línea que usa, esta es una aplicación de utilidad que le permite navegar en el entorno con más facilidad.

Code license: GNU GPL v3
Last updated: 15 Jun 2015

"Linguistic Inquiry and Word Count (LIWC) es un software de análisis de datos para consultas lingüísticas y conteo de palabras... LIWC permite calcular el grado de uso de diversas categorías de palabras a través de un gran conjunto de textos". Análisis web limitado disponible.

Last updated: 23 May 2015

Whatizit puede adquirir hasta 500.000 términos que se copian en el cuadro de texto, para ejecutar cualquier operación de segmentación de análisis de texto predefinida.

Last updated: 23 May 2015

El Analizador Macro-Etimológico (Macro-Etymological Analyzer, en inglés) es una aplicación web para el análisis de textos que busca todas las palabras de un texto dado en la red global etimológica (Etymological Wordnet) y genera estadísticas respecto de la etimología macro del texto, organizando las palabras según la raíz lingüística. Por ejemplo, puede analizar una novela y calcular las proporciones de palabras de origen anglosajón o afroasiático.

Code license: GNU GPL v3
Last updated: 20 May 2015

Lexos es una herramienta en línea que permite al usuario "restregar" (limpiar) los textos, cortar fragmentos en diferentes tamaños, manejar los fragmentos y conjuntos de fragmentos, y elegir a partir de una gama de herramientas de análisis para investigar estos textos.Sus funciones incluyen la construcción de dendogramas, la realización de gráficos sobre promedios móviles de frecuencias de palabras o proporciones de palabras o letras, y el juego con las visualizaciones de las frecuencias de palabras, como nubes de palabras y visualización en burbujas.

Code license: Open source
Last updated: 17 May 2015

AntWordProfiler es un software libre para el análisis de texto. En este caso analiza la frecuencia de palabras.

Last updated: 9 May 2015

Juxta es una herramienta de escritorio multiplataforma y de código abierto para comparar y compilar varios testimonios, convirtiéndolos en un único texto. El programa permite configurar cualquiera de los testimonios como texto de base, agregar o quitar testimonios, cambiar el texto base, hacer anotaciones a las comparaciones generadas por Juxta y guardar los resultados.

Code license: Open source, Creative Commons
Last updated: 4 May 2015

Quirkos es un software de análisis de texto para principiantes interesados en investigación cualitativa. Permite usar visualizaciones en tiempo real como interfaz . Admite operaciones de codificación-recuperación , búsquedas y consultas (queries) sobre datos. Sumado a ello, visualiza relaciones entre tópicos y temas.
Puedes encontrar más información en el siguiente link: http://www.quirkos.com/qualitative-data-analysis-software.html

Code license: Closed source
Last updated: 3 May 2015

A software tool for performing concordance – the analysis of a set of words within its immediate context - on a body of text. The tool performs full concordance, reading and analysing each and every word in a text. It was initially written for the analysis of English texts, but has since been extended to cater for other Western languages. Limited support is also provided for text in East Asian scripts, such as Chinese and Korean.

Features:

Code license: Closed source
Last updated: 11 Feb 2015

CATMA (Computer Aided Textual Markup & Analysis) is a free, open source markup and analysis tool from the University of Hamburg's Department of Languages, Literature and Media. It incorporates three interactive modules: (1) The tagger enables flexible and individual textual markup and markup editing. (2) The analyzer incorporates a query language and predefined functions. It also includes a query builder that allows users to construct queries from combinations of pre-defined questions while allowing for manual modification for more specific questions.

Code license: GNU GPL v3
Last updated: 29 Dec 2014

MONK is a digital environment designed to help humanities scholars discover and analyze patterns in the texts they study.

Last updated: 29 Dec 2014

HyperPo is a user-friendly text exploration and analysis program that allows users to import texts or use texts available online (in English or French), and provides frequency lists of characters, words and series of words, color-coding to indicate repetition, KWIC, co-occurrence and distribution lists, and the ability to simultaneously compare data from multiple texts.

Last updated: 29 Dec 2014

text analytic and data extraction framework: data and semantic analytics in a suite of business applications.

Last updated: 29 Dec 2014

Basis provides natural language processing technology for the analysis of unstructured multilingual text.

Last updated: 29 Dec 2014

The main programs that comprise the Information processor are called the analyst server and query or knowledge processor. The analyst program can be called from a command line, from an html form, or through a TCP/IP socket protocol. The query processor can be accessed with any browser using HTML commands. It analyzes text and allows the user to search it.

Code license: Closed source
Last updated: 29 Dec 2014

The Versioning Machine displays multiple versions of text encoded according to TEI Guidelines and allows for comparisons of annotation and introductory materials. This is a text editor and allows editors "to immediately see the consequences of their editorial decisions." This tool does not appear to have been updated since 2011.

Last updated: 29 Dec 2014

CollateX-based text collation client. CollateX, run on an server independent from the URL above, is a powerful, fully automatic, baseless text collation engine for multiple witnesses. A second collation technique, ncritic, provides a slightly different baseless text collation. Each engine complements each other nicely. The user can use different files, even URLs, then output the result in GraphML, TEI, JSON, HTML, or SVG. Fuzzy matching is an option.

Last updated: 29 Dec 2014

LATtice lets you explore and compare texts across entire corpora but also allows you to “drill down” to the level of individual LATs (language action types) to ask exactly what rhetorical categories make texts similar or different.

Last updated: 29 Dec 2014

Bookworm enables you to graphically explore lexical trends in repositories of digitized texts.

Code license: Open source
Last updated: 29 Dec 2014

Voyant Tools is a web-based reading and analysis environment for digital texts.

Code license: Open source
Last updated: 29 Dec 2014

Kaleidoscope is one of the world's best tools for spotting differences in images and text, and now it supports merging of files and folders, too. Kaleidoscope integrates directly with Git, Subversion, Mercurial, and Bazaar to fit perfectly in your workflow.

Last updated: 29 Dec 2014

Meld is a visual diff and merge tool targeted at developers. Meld helps you compare files, directories, and version controlled projects. It provides two- and three-way comparison of both files and directories, and has support for many popular version control systems.

Code license: Open source, GNU GPL v2
Last updated: 29 Dec 2014

The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels.

Last updated: 29 Dec 2014

TVE is an interactive Java tool for exploring the effect of window size on three common linguistic measures: type-token ratio, proportion of hapax legomena, and average word length. In addition, TVE can cluster the text fragments according to a user-given set of words by applying principal component analysis (PCA).

Last updated: 29 Dec 2014

Annotation Studio is an open source, web-based annotation application that integrates a powerful set of textual interpretation tools behind an intuitive and easy-to-use interface. Users can upload their own texts, and annotate with styled text, video, images, and weblinks. To date, the project has been used with great success in disciplines such as Writing, Literature, Foreign Languages, Anthropology, Film and Media Studies, and others at institutions including Harvard, Yale, Stanford, MIT, Barnard College, and Washington University.

Code license: Open source, GNU GPL, GNU GPL v2
Last updated: 29 Dec 2014
CSV
Subscribe to text analysis