accumulo: All content tagged as accumulo in NoSQL databases and polyglot persistence
Monday, 3 September 2012
Accumulo: Why the World Needs Another NoSQL Database
Why Accumulo? Except the quick review I’ve posted last year and recent Senate discussions about it, I haven’t heard much about Accumulo.
Jeff Kelly’s post over SiliconAngle looks into some of Accumulo’s prominent features and also introduces Sqrrl a company created to commercialize Accumulo for the enterprise world1.
The combination of consistency at scale, fast read/writes and cell-level security makes Accumulo an intriguing entrant to the NoSQL community.
Original title and link: Accumulo: Why the World Needs Another NoSQL Database (©myNoSQL)
via: http://siliconangle.com/blog/2012/08/20/accumulo-why-the-world-needs-another-nosql-database/
Tuesday, 7 August 2012
Accumulo, HBase, Cassandra and Some Unanswered Questions
Cade Metz for Wired:
The bill bars the DoD from using the database unless the department can show that the software is sufficiently different from other databases that mimic BigTable. But at the same time, the bill orders the director of the NSA to work with outside organizations to merge the Accumulo security tools with alternative databases, specifically naming HBase and Cassandra.
Is this good for HBase and Cassandra? Is this good for encouraging innovation? Is this good for supporting businesses? Just a few questions I couldn’t answer myself after reading this article about the investigation initiated by the Senate into NSA’s open sourced Accumulo.
Original title and link: Accumulo, HBase, Cassandra and Some Unanswered Questions (©myNoSQL)
via: http://www.wired.com/wiredenterprise/2012/07/nsa-accumulo-google-bigtable/
Monday, 5 September 2011
Accumulo: A New BigTable Inspired Distributed Key/Value by NSA
The National Security Agency has submitted to Apache Incubator a proposal to open source Accumulo, a BigTable inspired key-value store that they were building since 2008. The project proposal page provides more details about Accumulo history, building blocks, and how it compares to the other BigTable open source implementation HBase:
-
Access Labels: Accumulo has an additional portion of its key that sorts after the column qualifier and before the timestamp. It is called column visibility and enables expressive cell-level access control. Authorizations are passed with each query to control what data is returned to the user.
-
Iterators: Accumulo has a novel server-side programming mechanism that can modify the data written to disk or returned to the user. This mechanism can be configured for any of the scopes where data is read from or written to disk. It can be used to perform joins on data within a single tablet.
-
Flexibility: Accumulo places no restrictions on the column families. Also, each column family in HBase is stored separately on disk. Accumulo allows column families to be grouped together on disk, as does BigTable.
-
Logging: HBase uses a write-ahead log on the Hadoop Distributed File System. Accumulo has its own logging service that does not depend on communication with the HDFS NameNode.
-
Storage: Accumulo has a relative key file format that improves compression.
You can read more about Accumulo here and check the Hacker News and Reddit discussions.
Michael Stack has commented on the HBase mailing list:
The cell based ‘access labels’ seem like a matter of adding an extra field to KV and their Iterators seem like a specialization on Coprocessors. The ability to add column families on the fly seems too minor a difference to call out especially if online schema edits are now (soon) supported. They talk of locality group like functionality too — that could be a significant difference. We would have to see the code but at first blush, differences look small.
Original title and link: Accumulo: A New BigTable Inspired Distributed Key/Value by NSA (©myNoSQL)