[Dhcenterstartup] text mining software

Eric Lease Morgan emorgan at nd.edu
Tue Aug 27 19:02:17 CEST 2013


What sorts of text mining software do y'all support / use in your digital humanities centers?

We here in the Hesburgh Libraries at the University of Notre Dame have all but opened a place called the Center For Digital Scholarship. We are / will be providing a number of different services to a number of different audiences. These services include but are not necessarily limited exactly to:

  * data management consultation
  * data analysis and visualization
  * geographic information systems support
  * text mining investigations
  * referrals to other "centers" across campus

I am expected to support the text mining investigations. I have traditionally used open source tools do to my work. Many of these tools require some sort of programming in order to exploit. To some degree I am expected mount text mining software on our local Windows and Macintosh computers here in our Center. I am familiar with the lists of tools available at Bamboo as well as Hermeneuti.ca. [0, 1] TAPoRware is good too, but a bit long in the tooth. [2]

Do you know of other sets of tools to choose from? Are you familiar with SAS® Text Analytics, STATISTICA Data Miner, or RapidMiner? [3, 4, 5]

[0] Bamboo Dirt - http://dirt.projectbamboo.org
[1] Hermeneuti.ca - http://hermeneuti.ca/voyeur/tools
[2] TAPoRware - http://taporware.ualberta.ca
[3] Text Analytics - http://www.sas.com/text-analytics/
[4] Data Miner - http://www.statsoft.com/Products/STATISTICA/Data-Miner/
[5] RapidMiner - http://rapid-i.com/content/view/181/190/

--
Eric Lease Morgan, Digital Initiatives Librarian
Hesburgh Libraries
University of Notre Dame

574/631-8604






More information about the Dhcenterstartup mailing list