Structured and Unstructured Data
- Blake’s Songs of Innocence (http://www.glyndwr.ac.uk/rdover/blake/songsinn.htm)
- Blake’s Songs of Innocence TEI Mark-up (http://www.teibyexample.org/examples/TBED04v00.htm)
- Jeremy Bentham Handwritten (http://www.teibyexample.org/examples/TBED06v00.htm)
- Perseus Project (http://www.perseus.tufts.edu/)
- Civil War Washington (http://civilwardc.org/)
Metadata
- Simple Definistion of Metadata
- NYPL Digital Gallery Image Metadata sample
- http://www.niso.org/publications/press/UnderstandingMetadata.pdf
- Quilt Index (http://www.quiltindex.org/)
- http://quiltindex.kora.matrix.msu.edu/login.php
Sites
- Mining the Dispatch, http://dsl.richmond.edu/dispatch/pages/home
- Old Bailey Online, http://www.oldbaileyonline.org/
- Cameron Blevins, Topic Modeling Martha Ballard’s Diary (series of posts), http://historying.org/2010/04/01/topic-modeling-martha-ballards-diary/
- 8000 Canadians (http://themacroscope.org/interactive/dcbnet/)
-
- Explanation (http://www.themacroscope.org/?page_id=70)
- Newton (http://webapp1.dlib.indiana.edu/newton/)
- The Historian’s Macroscope: Big Digital History (http://www.themacroscope.org/)
Tools (Fireside example text)
- n-Gram Viewer, https://books.google.com/ngrams/
- Bookworm, http://bookworm.culturomics.org/
- Voyant Tools, http://voyant-tools.org/
- Overview, http://overview.ap.org/
- TAPOR – Text Analysis Tools (http://taporware.ualberta.ca/~taporware/textTools/)
- Topic Modeling in the Browser,http://mimno.infosci.cornell.edu/jsLDA/
Other Sites
- Wordle (http://www.wordle.net/)
- Text Encoding Initiative (structured data) (http://www.tei-c.org/index.xml)
- TEI Examples (http://www.teibyexample.org/)
- TEI Databases (http://www.tei-c.org/Activities/Projects/)
- Project Gutenberg (text examples) (http://www.gutenberg.org/)
- Voyeur — Text Anaysis Tools ( http://hermeneuti.ca/voyeur)
- Voyeur Tools (http://voyeurtools.org/tool/Links/)
- Mallet – Machine Learning for Language Toolkit (http://mallet.cs.umass.edu/)
- Text Arc (http://www.textarc.org/
- Book Lamp ( http://labs.booklamp.org/sentexp/)
- Docuburst (http://vialab.science.uoit.ca/docuburst/welcome.php)
- Textgrid (http://www.textgrid.de/en)
- Digging into Data (http://www.diggingintodata.org/)
References
- Fred Gibbs’s, Getting Started in Text Mining,http://fredgibbs.net/courses/etc/getting-started-with-text-mining
- Miriam Posner, “Very Basic Strategies for Interpreting Results from the Topic Modeling Tool,” http://miriamposner.com/blog/very-basic-strategies-for-interpreting-results-from-the-topic-modeling-tool/
- John Burrows, “Textual Analysis,” A Companion to Digital Humanitieshttp://nora.lis.uiuc.edu:3030/companion/view?docId=blackwell/9781405103213/9781405103213.xml&chunk.id=ss1-4-4&toc.depth=1&toc.id=ss1-4-4&brand=9781405103213_brand
- Basic introduction of text mining principles and terminology:http://www.cch.kcl.ac.uk/legacy/teaching/av1000/textanalysis/method.html
- Shlomo Argamon et al., “Gender, Race, and Nationality in Black Drama, 1950-2006: Mining Differences in Language Use in Authors and Their Characters,” Digital Humanities Quarterly, 3:2 (2009),http://digitalhumanities.org/dhq/vol/3/2/000043/000043.html.
- Lauren Klein and Jacob Eisenstein, “Reading Thomas Jefferson with TopicViz: Towards a Thematic Method for Exploring Large Cultural Archives,” Scholarly and Research Communications 4, no. 3 (2013),http://src-online.ca/index.php/src/article/view/121/259