When Albany was in print

albany google ngram 2

Thoughtful chin scratch...

Update: Alex set us straight -- see his comment about searching for "Albany" vs. "albany." The new graph is above (the old graph is after the jump).

The capital-A Albany graph makes more sense. It was a little weird that there was little or no mention of Albany prior to 1780 -- the city had already been been incorporated for almost 100 years by that point.
_____

Google recently opened up access to a massive dataset collected from digitized books -- some 500 billion words from 5.2 million books. The hope is this dataset will enable interesting sorts of quantitative research for the humanities.

The simplest of this sort of research involves word counts. The Google dataset includes counts for how often a word was used in the texts over time. Go ahead -- you can try it for yourself.

Roland emailed us today after doing search for "Albany." The results are in the graph above.

You have to figure there was spike toward the end of the 1700s because of New York becoming a state. Any thoughts on the reasons for the spike around 1850 (Civil War lead up?) -- or 1950, or since 1988?

(Thanks, Roland!)

graph: Google

Previously posted graph:

albany google ngram graph

Comments

You should note that this search is case sensitive and the results for albany and Albany are very different. If you search them both simultaneously

(http://ngrams.googlelabs.com/graph?content=albany,Albany&year_start=1700&year_end=2000&corpus=0&smoothing=3)

you will see that people very rarely use albany as opposed to the usually correct Albany.

Its still interesting to see the peaks stark peaks in the lowercase version of the graph. The post 1988 increase seems to be universal across lowercase versions of proper nouns, which might lead you to believe the quality of editing has decreased over the last quarter century or so. Thats just one possible explanation.

I wonder - have they taken into consideration the number of books representing a particular year or time period? You're bound to get more mentions if 1) more books were printed in those years or 2) the sample that populated the database overselected books from those years. Just saying.

@Barold: Look here. Results are normalized for publication volume over time.

@Alex: Yes, the meaningful query is "Albany". Far as I know, the Ngram query syntax doesn't support conditional operators, so you can't do something like "Albany|albany" ... yet. OTOH, many of the datasets are available, and the graphical output is just Google Charts, so a motivated diver could mash something up for a special-interest query without too much trouble.

LQ

WRT anomalies that don't have obvious maps to Albany, NY history, remember there's an Albany, Georgia that maybe generates 19th-century attention at various times.

Also, Albany, NY was a top-ten U.S. population metro from 1810 to 1860, so doings here would show spikes in published mention that we don't see in today's much larger nation and literary corpus.

LQ

I've been trying to figure out whether this is anything other than a stupid web trick since they came out with it. It's not random, because Google's books come from certain collections (though at this point a vast array of them). It's not comprehensive, either. And it's highly repetitive -- Google books often has several editions of the same book (scanned from different collections), and none of related books you would expect to find. In the end it seems pretty meaningless.

Not only was Albany a top 10 city, as Lou Quillio mentions, but it was a major industrial center. central to canal and rail, and much more tightly tied to New York City than it is today. No mystery that it would have rated mentions in the books of the day.

Yes, but print now has taken on a cyber dimension in so many ways.

Print is the past! Internet is the future!

One small part of the new mosaic = http://www.nysm.nysed.gov/albany/gallery.html

Best holiday wishes,
Steve Bielinski

just fyi: there are at least a dozen Albanys in the US alone

@Carl - I've been trying to figure out whether this is anything other than a stupid web trick since they came out with it.

You know, it is what it is, fully disclosed: a big sloppy pile of indexed written matter, tagged by publication date. The user decides for herself what her query results mean, whether she's a skilled humanities researcher (the target audience) or my mom.

The only way to mistake one's query results is to mistake what the data is: a judgment-free pile.

LQ

Say Something!

We'd really like you to take part in the conversation here at All Over Albany. But we do have a few rules here. Don't worry, they're easy. The first: be kind. The second: treat everyone else with the same respect you'd like to see in return. Cool? Great, post away. Comments are moderated so it might take a little while for your comment to show up. Thanks for being patient.

What's All Over Albany?

All Over Albany is for interested and interesting people in New York's Capital Region. In other words, it's for you. It's kind of like having a smart, savvy friend who can help you find out what's up. Oh, and our friends call us AOA.

Search

Recently on All Over Albany

Gawking at the new Schenectady train station

In a bit of a surprise the new Schenectady train station opened this past Wednesday, a few weeks ahead of the announced schedule. The $23... (more)

A little push up the hill

Wrapped into my update this past week about what it's been like to use a bike as one of my primary ways of getting around... (more)

A collection of castle day trips

This part of the country is dotted with castle-like structures, full of history, mystery, romance, and fairytale. Here's a handful of castles that are within... (more)

Classics of Science Fiction at The Linda

A multi-day get-together called Classics of Science Fiction will be at The Linda in Albany November 1-4. Blurbage: Guests include authors, artists, podcasters, cosplayers, business... (more)

Cuomo leads in Q-poll, NTSB still hasn't examined limo from deadly Schoharie crash, Schenectady and GE

Q-poll shows Cuomo with strong lead The latest Quinnipiac University poll shows Andrew Cuomo with a 23-percentage point lead over Republican challenger Marc Molinaro. [Spectrum]... (more)

Recent Comments

I ride every day to work, and also after work for exercise. I love the concept of being a person who happens to ride a bike. There's a level of bike riding, with the high performance gear and sleek clothing, that makes riding seem like its not for everyone. I try to avoid markers like that, and always wear regular clothing/shoes/backpack with dumpy-looking bike. One concession is bike gloves.

Gawking at the new Schenectady train station

...has 4 comments, most recently from Walter Clark

A collection of castle day trips

...has 1 comment, most recently from Jsc

A little push up the hill

...has 2 comments, most recently from Ryan H

Today's moment of mural

...has 3 comments, most recently from Rich

A year later I'm still using a bike to get around town -- here are a few thoughts about how that's worked out

...has 13 comments, most recently from Randal Putnam