I will be posting my progress for a data mining project in my Historian’s Craft course. The data will be posted as progression (so far it is preliminary data). I am data mining Susanna Moodie’s Roughing it in the Bush (1852) and Henry David Thoreau’s An Excursion to Canada (1853). The purpose is to “not read” or “distant read” these two separate pieces of relatively the same time, place, and theme, to discover patterns between them. The platform I will be using for data mining is Voyant Tools. The new version is still in beta but it is truly a great tool. For example, the word frequency has a tool to take out unnecessary words such as “the” and “is”.
I will post my data along the way, as well as my processes and final work. I predict some very interesting conclusions!
A list of links to the appropriate pages: