Hacker News new | past | comments | ask | show | jobs | submit login
Data Scraping Wikipedia with Google Spreadsheets (ouseful.wordpress.com)
33 points by iamelgringo on Oct 15, 2008 | hide | past | favorite | 7 comments



"So to recap, we have scraped some data from a wikipedia page into a Google spreadsheet using the =importHTML formula, published a handful of rows from the table as CSV, consumed the CSV in a Yahoo pipe and created a geocoded KML feed from it, and then displayed it in a Yahoo map."

Hilarious!


Fantastic. Such a cleaner method than copy-pasting table data from web pages into excel to visualize and manipulate it, which I've done countless times doing market research.


My BSc dissertation was on a simplified English dialect (language) which can be translated into triplets (or data items) while still being readable as English. However, I never really finished the editor it required which hooked up to a parser to show you the triplets as you type.

Also, I'm bad at getting traction on my ideas.


Or you could just use DBpedia: http://dbpedia.org/About


the purpose isn't to show getting data from wikipedia. this can be done with any html table


Bravo! One of the most useful submissions to HN in a while. This is the kind of thing that gives my faith in tech a little boost - "throwing sheep at your friends on facebook" or cute productivity apps that reduce your productivity are not.


does any one know of a way to incorporate google spreadsheets with ajax rendered table data?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: