The UK government publishes lots of spending data now. Let's do cool stuff!
ScraperWiki is one of these new-fangled cloud services, hosting code that scrapes websites. You can throw some python (or ruby, or php) together to download all the CSV files for a department.
Google Refine is like a spreadsheet on crack, with features ideal for cleaning up messy data sets. I saw it for the first time at OpenTech 2011 a few weeks ago in London. You can take the government data, clean up the worst typos, and integrate it into the scraperwiki scraper.
For bonus marks, throw in a bit of jQuery and Google Charts, and create a dynamically generated pie chart, or maybe a word cloud. There must be more imaginative ways to visualize this... email me if you have any ideas!
They interviewed me last week - I must say, that's the first time that's happened, but I'm very flattered. None of the above would exist without the work done by @DataMinerUK et. al., so thank you, everyone.
Posted: 14 Jun 2011 23:02 |
| < | June 2011 | > | ||||
| Su | Mo | Tu | We | Th | Fr | Sa |
| 1 | 2 | 3 | 4 | |||
| 5 | 6 | 7 | 8 | 9 | 10 | 11 |
| 12 | 13 | 14 | 15 | 16 | 17 | 18 |
| 19 | 20 | 21 | 22 | 23 | 24 | 25 |
| 26 | 27 | 28 | 29 | 30 | ||
Tim Retout [email protected]
JabberID: [email protected]
I'm afraid I have turned off comments for this blog, because of all the spam. Let's face it, I didn't read them anyway. Feel free to email me.