Free data: utility, risks, opportunities

Written by

Some random thoughts after The possibilities of real-time data event at the City Hall.

Free your location: you’re already being photographed
I was not surprised to hear the typical objection (or rant, if you don’t mind) of institutions’ representative when requested to release data: “We must comply with the Data Protection Act!“. Although this is technically true, I’d like to remind these bureaucrats that in the UK being portraited by a photographer in a public place is legal. In other words, if I’m in Piccadilly Circus and someone wants to take a portrait of me, and possibly use it for profit, he is legally allowed to do so without my authorization.
Hence, if we’re talking about releasing Oyster data, I can’t really see bigger problems than those related to photographs: where Oyster data makes it public where you are and, possibly, when, a photograph might give insight to where you are and what you are doing. I think that where+what is intrinsically more dangerous (and misleading, in most cases) than where+when, so what’s the fuss about?

Free our data: you will benefit from it!
Bryan Sivak, Chief Technology Officer of Washington DC (yes, they have a CTO!), has clearly shown it with an impressive talk: freeing public data improves service level and saves public money. This is a powerful concept: if an institution releases data, developers and business will start creating enterprises and applications over it. But more importantly, the institution itself will benefit from better accessibility, data standards, and fresh policies. That’s why the OCTO has released data and facilitated competition by offering money prizes to developers: the government gets expertise and new ways of looking at data in return for technological free speech. It’s something the UK (local) government should seriously consider.

Free your comments: the case for partnerships between companies and users
Jonathan Raper, our Twitter’s @MadProf, is sure that partnerships between companies and users will become more and more popular. Companies, in his view, will let the cloud generate and manage a flow of information about their services and possibly integrate it in their reputation management strategy.
I wouldn’t be too optimistic, though. Albeit it’s true that many longsighted companies have started engaging with the cloud and welcome autonomous, independently run, twitter service updates, most of them will try to dismiss any reference to bad service. There are also issues with data covered by licenses (see the case of FootyTweets).
I don’t know why I keep thinking about trains as an example, but would you really think that, say, Thameslink would welcome the cloud twitting about constant delays on their Luton services? Not to mention the fact that NationalRail forced a developer to stop offering a free iPhone application with train schedules – to start selling their own, non free (yes, charging £4.99 for data you can get from their own mobile web-site for free, with the same ease of use, is indeed a stupid commercial strategy).

Ain’t it beautiful, that thing?
We’ve seen many fascinating visualization of free data, both real-time and not. Some of these require a lot of work to develop. But are they useful? What I wonder is not just if they carry any commercial utility, but if they can actually be useful to people, by improving their life experience. I have no doubt, for example, that itoworld‘s visualization of transport data, and especially those about Congestion Charging, are a great tool to let people understand policies and authorities make better planning. But I’m not sure that MIT SenseLab’s graphs of phone calls during the World Cup Final, despite being beautiful to see, funny to think about, and technically accurate, may bring any improvement to user experience. (Well, this may be the general difference between commercial and academic initiative – but I believe this applies more generally, in the area of data visualization).

Unorthodox uses of locative technologies
MIT Senselab‘s Carlo Ratti used gsm cell association data to approximate people density in streets. This is an interesting use of technology. Nonetheless, unorthodox uses of technologies, especially locative technologies, must be taken carefully. Think about using the same technique to calculate road traffic density: you would have to consider single and multiple occupancy vehicles, where this can have different meanings on city roads and motorways. Using technology in unusual ways is fascinating and potentially useful, but the association of the appropriate technique to the right problem must be carefully gauged.

Risks of not-so-deep research
This is generally true in research, but I would say it’s getting more evident in location-based services research and commercial activities: targeting marginally interesting areas of knowledge and enterprise. Ratti’s words: “One PhD student is currently looking at the correlations between Britons and parties in Barcelona… no results yet“. Of course, this was told as a half-joke. But in many contexts, it’s still a half-truth.

geo gla gov gov 2.0 opendata real time

Comments

3 responses to “Free data: utility, risks, opportunities”

20 April 2010

dan

ciao beppe, you point out that what carlo has shown has focused on visualization, and that is worrying if his lab stops there. in the same vein, jeremy faludi wrote that this way of doing research is simply “info-porn”, because there hasn’t been follow through on the potential.
http://www.worldchanging.com/archives/009595.html

however, in the last 1 and half year, one team of his lab (to which i belonged and i’m still working with) has focused on in-depth analysis of city data. so you’ll start to see papers in tier-one conferences and journals. having this new approach, we hope to go beyond short-term applications/visualizations. so your point has been taken on board 😉

as for the comment on the oyster representative, i don’t entirely agree with you. going from where+when to where+what is quite easy, and i’ll show that in a paper i’m submitting this week (i’ll blog about it in a couple of months). so the the oyster people are rightly cautious but are also open to change their current approach (after all, they were there,right?)

happy to see you are keeping on blogging. we need more people who are interested in science and willing to share their knowledge 😉 thanks

Reply
20 April 2010

Beppe

Hi Daniele,
I was actually not critical of SenseLab’s or Carlo’s own work (which I find fascinating), I was rather wondering how this “beauty” can turn into something that makes people’s lives better. As you suggest I’m happy to see that the lab is going onto get this done 🙂 I’m actually in the process of coming back to reading academic stuff, and SenseLab’s list of publications seems to be a good one to begin with.

As for oysters… well, I’m more than worried by a “Big Brother” scenario, and you are right when you say that the step from where+when to where+what is a short one. Somewhat I agree with the slow approach to this, and I’m not advocating any speed up in this process. Nonetheless, I think that you are actually pointing the real issue: the way data are used, not the data itself. I’m not against rules in this context, and I believe we should rather concentrate on how to detect, avoid, and punish unacceptable behaviours (e.g.: using data to create advertisement without the user’s authorization, insurances discriminating users that work in Brixton, etc…). I’d be very interested in reading your paper about this!

Yep, I’ll keep on blogging, not at a very high frequency as usual, but when I have something I’d like to discuss I’ll put it on here 🙂

Reply
20 April 2010

dan

when you start populating your reading list, please drop me a line. i’ll give you few recommendations 😉

Reply

Free data: utility, risks, opportunities

Comments

3 responses to “Free data: utility, risks, opportunities”

Leave a Reply Cancel reply

More posts

Hello world!

Vacuum tubes can be art

Christmas Time

The several issues of geo development: a chronicle of October's GeoMob