TL;DR
Voyant Tools is a browser-based text analysis platform. Simply paste or upload text data to instantly perform word cloud generation, KWIC (Key Word In Context) analysis, co-occurrence analysis, topic modeling, TF-IDF calculations, and more. It supports Japanese morphological analysis and is widely used as a standard text mining tool in Digital Humanities.
What is Voyant Tools?
Voyant Tools is an open-source text analysis environment developed by Stéfan Sinclair and Geoffrey Rockwell. With over 20 years of history since its first version in 2003, it is one of the most widely used text analysis tools in the DH field.
Key features include:
- Fully browser-based: Just visit voyant-tools.org to get started
- 24+ analysis tools: Word clouds, frequency analysis, KWIC, co-occurrence networks, trend graphs, and more
- Multilingual support: Handles English, Japanese, Chinese, Korean, and many other languages
- Corpus sharing: Analysis results can be shared via URL, enabling reproducible research
- API access: Programmatic access is also available
Key Analysis Features
Cirrus (Word Cloud)
Visualizes high-frequency words as a word cloud. Customizable stopword lists allow appropriate filtering for any target language.
KWIC (Concordance)
Displays how specific keywords are used within their context. Analyze usage patterns while viewing surrounding text.
Trends (Frequency Progression)
Shows how word frequencies change throughout a document as line graphs. Track multiple terms simultaneously to analyze thematic shifts within texts.
Links (Co-occurrence Network)
Visualizes co-occurrence relationships between words as a network graph, revealing which words frequently appear together.
Topics (Topic Modeling)
LDA-based topic modeling automatically extracts latent topics from text corpora.
How to Use
Basic Workflow
- Visit Voyant Tools
- Paste text into the input area or upload files (TXT, PDF, HTML, XML, etc.)
- Click “Reveal” to display multiple analysis panels simultaneously
- Adjust each panel’s settings to deepen your analysis
- Use the export button to download images or data
Analyzing Japanese Text
Voyant Tools supports Japanese morphological analysis. When uploading Japanese text, set the language to “Japanese” for proper segmentation. Note that accuracy for specialized terminology and proper nouns may be limited, so reviewing results as you proceed is recommended.
Practical Applications in DH Research
Comparative Literary Analysis
Load multiple literary works as a corpus to analyze vocabulary differences and shared expression patterns across works. For example, quantitatively demonstrate how Natsume Soseki’s vocabulary evolved between his early and late periods.
Historical Document Analysis
Track discourse changes around specific social issues over time using digitized newspaper archives. Combining the Trends tool with KWIC enables simultaneous quantitative and qualitative analysis.
Academic Literature Survey
Apply topic modeling to a large collection of research paper abstracts to grasp overall research trends in a field.
Comparison with Other Tools
| Feature | Voyant Tools | AntConc | KH Coder |
|---|---|---|---|
| Environment | Browser | Desktop | Desktop |
| Japanese support | Yes | Limited | Extensive |
| Visualization | Rich | Basic | Rich |
| Programming | Not required | Not required | Not required |
| Sharing | URL sharing | None | None |
Conclusion
Voyant Tools is a powerful platform that makes rich text analysis accessible without specialized knowledge or programming skills. It is particularly well-suited for quickly grasping an overview of text data in the early stages of DH research. The URL-based corpus sharing feature also ensures research reproducibility.