FuzzyTable is a large-scale, low-latency, parallel fuzzy-matching database built on Hadoop
Interesting solution built on top of Hadoop:
FuzzyTable is a large-scale, low-latency, parallel fuzzy-matching database built over Hadoop. […] Our work involved developing two major components – a clustering process that we use to reduce the total search space for each query, and a client-server system for performing fuzzy matching on demand and in parallel across a Hadoop instance.
Even if they are not the first using this architecture, probably others could have gone a different route, trying to use Solr or ElasticSearch alone.
Original title and link: FuzzyTable is a large-scale, low-latency, parallel fuzzy-matching database built on Hadoop (NoSQL databases © myNoSQL)
via: http://www.cloudera.com/blog/2010/11/tackling-large-scale-data-in-government/