linked data: All content tagged as linked data in NoSQL databases and polyglot persistence
While writing quite a bit lately about Big Data marketplaces, I thought it would be worth mentioning Tim Berners-Lee 5-start deployment scheme for Linked Open Data:
- make your stuff available on the Web (whatever format) under an open license
- make it available as structured data (e.g., Excel instead of image scan of a table)
- use non-proprietary formats (e.g., CSV instead of Excel)
- use URIs to identify things, so that people can point at your stuff
- link your data to other data to provide context
See Tim Berner-Lee talking about the star scheme at gov 2.0 expo:
ReadWriteWeb has published a very interesting story of a project presented at last week’s Strata conference aiming to reconstruct linked data based on public data sources like Flickr and OpenStreetMap using a somehow classical”fuzzy matching” approach.
build a detailed database of information about places in Afghanistan, using only public sources on the Web. The goal is to describe in detail the towns and cities including everything from names, locations and populations, as well as lists and coordinates for schools, mosques, banks and hotels.
My gut feeling is that mixing in some graph database would make this problem not necessarily easier to address, but it would bring in a different angle to tackle it. Fuzzy matching is a search-based approach with an inductive flavor, while using a graph databases could bring in a deductive approach.