When Albany was in print

albany google ngram 2

Thoughtful chin scratch...

Update: Alex set us straight -- see his comment about searching for "Albany" vs. "albany." The new graph is above (the old graph is after the jump).

The capital-A Albany graph makes more sense. It was a little weird that there was little or no mention of Albany prior to 1780 -- the city had already been been incorporated for almost 100 years by that point.

Google recently opened up access to a massive dataset collected from digitized books -- some 500 billion words from 5.2 million books. The hope is this dataset will enable interesting sorts of quantitative research for the humanities.

The simplest of this sort of research involves word counts. The Google dataset includes counts for how often a word was used in the texts over time. Go ahead -- you can try it for yourself.

Roland emailed us today after doing search for "Albany." The results are in the graph above.

You have to figure there was spike toward the end of the 1700s because of New York becoming a state. Any thoughts on the reasons for the spike around 1850 (Civil War lead up?) -- or 1950, or since 1988?

(Thanks, Roland!)

graph: Google

Previously posted graph:

albany google ngram graph


You should note that this search is case sensitive and the results for albany and Albany are very different. If you search them both simultaneously


you will see that people very rarely use albany as opposed to the usually correct Albany.

Its still interesting to see the peaks stark peaks in the lowercase version of the graph. The post 1988 increase seems to be universal across lowercase versions of proper nouns, which might lead you to believe the quality of editing has decreased over the last quarter century or so. Thats just one possible explanation.

I wonder - have they taken into consideration the number of books representing a particular year or time period? You're bound to get more mentions if 1) more books were printed in those years or 2) the sample that populated the database overselected books from those years. Just saying.

@Barold: Look here. Results are normalized for publication volume over time.

@Alex: Yes, the meaningful query is "Albany". Far as I know, the Ngram query syntax doesn't support conditional operators, so you can't do something like "Albany|albany" ... yet. OTOH, many of the datasets are available, and the graphical output is just Google Charts, so a motivated diver could mash something up for a special-interest query without too much trouble.


WRT anomalies that don't have obvious maps to Albany, NY history, remember there's an Albany, Georgia that maybe generates 19th-century attention at various times.

Also, Albany, NY was a top-ten U.S. population metro from 1810 to 1860, so doings here would show spikes in published mention that we don't see in today's much larger nation and literary corpus.


I've been trying to figure out whether this is anything other than a stupid web trick since they came out with it. It's not random, because Google's books come from certain collections (though at this point a vast array of them). It's not comprehensive, either. And it's highly repetitive -- Google books often has several editions of the same book (scanned from different collections), and none of related books you would expect to find. In the end it seems pretty meaningless.

Not only was Albany a top 10 city, as Lou Quillio mentions, but it was a major industrial center. central to canal and rail, and much more tightly tied to New York City than it is today. No mystery that it would have rated mentions in the books of the day.

Yes, but print now has taken on a cyber dimension in so many ways.

Print is the past! Internet is the future!

One small part of the new mosaic = http://www.nysm.nysed.gov/albany/gallery.html

Best holiday wishes,
Steve Bielinski

just fyi: there are at least a dozen Albanys in the US alone

@Carl - I've been trying to figure out whether this is anything other than a stupid web trick since they came out with it.

You know, it is what it is, fully disclosed: a big sloppy pile of indexed written matter, tagged by publication date. The user decides for herself what her query results mean, whether she's a skilled humanities researcher (the target audience) or my mom.

The only way to mistake one's query results is to mistake what the data is: a judgment-free pile.


Say Something!

We'd really like you to take part in the conversation here at All Over Albany. But we do have a few rules here. Don't worry, they're easy. The first: be kind. The second: treat everyone else with the same respect you'd like to see in return. Cool? Great, post away. Comments are moderated so it might take a little while for your comment to show up. Thanks for being patient.

What's All Over Albany?

All Over Albany is for interested and interesting people in New York's Capital Region. In other words, it's for you. It's kind of like having a smart, savvy friend who can help you find out what's up. Oh, and our friends call us AOA.


Recently on All Over Albany

Today's moment of winter

Beige on gray sway along the Indian Pond on the UAlbany campus. There was a rumor that it was something like 70 degrees at some... (more)

Chris Gibson at Bethlehem Public Library

Former Congressman Chris Gibson will be at the Bethlehem Public Library March 3 to talk about his book Rally Point. The talk is free and... (more)

Kirsten Gillibrand on Desus and Mero and The Late Show

More evidence that Kirsten Gillibrand is becoming a national figure: KG appeared on both the Late Show with Stephen Colbert and Viceland's Desus &... (more)

NY Maple Weekend 2018

March is almost here, and that means maple syrup in New York. Maple farms around the state area again participating in two "Maple Weekends" this... (more)

Morning Blend

Guns + The governors of New York, Connecticut, New Jersey, and Rhode Island say their states are forming a coalition -- States for Gun Safety... (more)

Recent Comments

Definitely needs a mural. Another spot that is begging for one is the Delaware Ave Price Chopper in Albany. The side along Elm St is a huge gray wall that takes up most of the block!

Good places for a large company picnic?

...has 9 comments, most recently from frank

Kirsten Gillibrand on Desus and Mero and The Late Show

...has 1 comment, most recently from Bullwinkle

Dylan Ratigan has jumped into the pool of challengers to Elise Stefanik

...has 6 comments, most recently from ace

Flooding begins to recede along Mohawk, missed fax might have prevented Jay Street fire, Capital Region private college applications rising

...has 1 comment, most recently from Lauren

The Glynn Mansion and the story of Martin Glynn

...has 1 comment, most recently from BS