Researchers Tap Google Books to Create the Word Cloud for Human History

By Andrew Moseman | December 17, 2010 11:34 am

NYC_Public_LibraryHumanity’s legacy of millions upon millions of books represents an unparalleled reservoir of data, precisely detailing the changes in language and culture over the centuries. Now, if only a search engine giant were digitizing that history…

Oh, right. Google has been doing just that, and now scientists are beginning to tap that treasure trove of data.

From Ed Yong:

Together with over 40 university libraries, the internet titan has thus far scanned over 15 million books, creating a massive electronic library that represents 12% of all the books ever published. All the while, a team from Harvard University, led by Jean-Baptiste Michel and Erez Lieberman Aiden have been analysing the flood of data.

Their first report is available today. Although it barely scratches the surface, it’s already a tantalising glimpse into the power of the Google Books corpus. It’s a record of human culture, spanning six centuries and seven languages. It shows vocabularies expanding and grammar evolving. It contains stories about our adoption of technology, our quest for fame, and our battle for equality. And it hides the traces of tragedy, including traces of political suppression, records of past plagues, and a fading connection with our own history.

Do yourself a favor and check out the rest of Ed’s extensive post—including fascinating examples like the “half-life” of any given year being mentioned in literature—over at Not Exactly Rocket Science. And try out Google’s search to see the prevalence of any phrases or phrases over the years.

Related Content:
DISCOVER: The Dawn of Urban Civilization: Writing, Urban Life, and Warfare
80beats: The Brains of Storytellers and Their Listeners Actually Sync Up
Not Exactly Rocket Science: New Languages Evolve in Rapid Bursts
Not Exactly Rocket Science: The Evolution of Past Tense: How Verbs Change Over Time

Image: Wikimedia Commons (New York Public Library)

CATEGORIZED UNDER: Mind & Brain, Technology
  • Edwin Harns

    I have to thank you to the efforts you have invest scripting this blog. I’m hoping an identical high-grade website post from your business around the upcoming also. In fact your creative writing skills has encouraged me to obtain my own personal web site now. Actually the blogging is spreading its wings quickly. Your write is a good example of it.

NEW ON DISCOVER
OPEN
CITIZEN SCIENCE
ADVERTISEMENT

Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

80beats

80beats is DISCOVER's news aggregator, weaving together the choicest tidbits from the best articles covering the day's most compelling topics.
ADVERTISEMENT

See More

ADVERTISEMENT
Collapse bottom bar
+

Login to your Account

X
E-mail address:
Password:
Remember me
Forgot your password?
No problem. Click here to have it e-mailed to you.

Not Registered Yet?

Register now for FREE. Registration only takes a few minutes to complete. Register now »