Data Mining History & the Canadian Landscape

I will be posting my progress for a data mining project in my Historian’s Craft course. The data will be posted as progression (so far it is preliminary data). I am data mining Susanna Moodie’s Roughing it in the Bush (1852) and Henry David Thoreau’s An Excursion to Canada (1853). The purpose is to “not read” or “distant read” these two separate pieces of relatively the same time, place, and theme, to discover patterns between them. The platform I will be using for data mining is Voyant Tools. The new version is still in beta but it is truly a great tool. For example, the word frequency has a tool to take out unnecessary words such as “the” and “is”.

I will post my data along the way, as well as my processes and final work. I predict some very interesting conclusions!

A list of links to the appropriate pages:

Word Frequency

Word Representation (frequency by size)

Raw Graph of Words

Case Study: Thoreau, Moodie, & Metaphors

The Grand Finale

Advertisements

One Comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s