TL;DR

Voyant Tools is a browser-based text analysis platform. Simply paste or upload text data to instantly perform word cloud generation, KWIC (Key Word In Context) analysis, co-occurrence analysis, topic modeling, TF-IDF calculations, and more. It supports Japanese morphological analysis and is widely used as a standard text mining tool in Digital Humanities.

What is Voyant Tools?

Voyant Tools is an open-source text analysis environment developed by Stéfan Sinclair and Geoffrey Rockwell. With over 20 years of history since its first version in 2003, it is one of the most widely used text analysis tools in the DH field.

Key features include:

  • Fully browser-based: Just visit voyant-tools.org to get started
  • 24+ analysis tools: Word clouds, frequency analysis, KWIC, co-occurrence networks, trend graphs, and more
  • Multilingual support: Handles English, Japanese, Chinese, Korean, and many other languages
  • Corpus sharing: Analysis results can be shared via URL, enabling reproducible research
  • API access: Programmatic access is also available

Key Analysis Features

Cirrus (Word Cloud)

Visualizes high-frequency words as a word cloud. Customizable stopword lists allow appropriate filtering for any target language.

KWIC (Concordance)

Displays how specific keywords are used within their context. Analyze usage patterns while viewing surrounding text.

Shows how word frequencies change throughout a document as line graphs. Track multiple terms simultaneously to analyze thematic shifts within texts.

Visualizes co-occurrence relationships between words as a network graph, revealing which words frequently appear together.

Topics (Topic Modeling)

LDA-based topic modeling automatically extracts latent topics from text corpora.

How to Use

Basic Workflow

  1. Visit Voyant Tools
  2. Paste text into the input area or upload files (TXT, PDF, HTML, XML, etc.)
  3. Click “Reveal” to display multiple analysis panels simultaneously
  4. Adjust each panel’s settings to deepen your analysis
  5. Use the export button to download images or data

Analyzing Japanese Text

Voyant Tools supports Japanese morphological analysis. When uploading Japanese text, set the language to “Japanese” for proper segmentation. Note that accuracy for specialized terminology and proper nouns may be limited, so reviewing results as you proceed is recommended.

Practical Applications in DH Research

Comparative Literary Analysis

Load multiple literary works as a corpus to analyze vocabulary differences and shared expression patterns across works. For example, quantitatively demonstrate how Natsume Soseki’s vocabulary evolved between his early and late periods.

Historical Document Analysis

Track discourse changes around specific social issues over time using digitized newspaper archives. Combining the Trends tool with KWIC enables simultaneous quantitative and qualitative analysis.

Academic Literature Survey

Apply topic modeling to a large collection of research paper abstracts to grasp overall research trends in a field.

Comparison with Other Tools

FeatureVoyant ToolsAntConcKH Coder
EnvironmentBrowserDesktopDesktop
Japanese supportYesLimitedExtensive
VisualizationRichBasicRich
ProgrammingNot requiredNot requiredNot required
SharingURL sharingNoneNone

Conclusion

Voyant Tools is a powerful platform that makes rich text analysis accessible without specialized knowledge or programming skills. It is particularly well-suited for quickly grasping an overview of text data in the early stages of DH research. The URL-based corpus sharing feature also ensures research reproducibility.

References