graphdb: All content tagged as graphdb in NoSQL databases and polyglot persistence
Wednesday, 29 May 2013
Titan: Data Loading and Transactional Benchmark
The Aurelius team describing an advanced benchmark of Titan, a massive scale property graph allowing real-time traversals and updates, sponsored by Pearson, developed and run over 5 months:
The 10 terabyte, 121 billion edge graph was loaded into the cluster in 1.48 days at a rate of approximately 1.2 million edges a second with 0 failed transactions. These numbers were possible due to new developments in Titan 0.3.0 whereby graph partitioning is achieved using a domain-basedbyte order partitioner.
✚ The answer to why Titan is built on Cassandra can be found in this interview between Aurelius CTO Matthias Broecheler and DataStax co-founder Matt Pfeil:
[…] we don’t have to worry about things like replication, backup, and snap shots because all of that stuff is handled by Cassandra. We really just focus on: “How do you distribute a graph?”, “How do you represent a graph efficiently in a big table model?”, “How do you do things like etched compression and other things that are very graph specific in order to make the database fast? And, lastly, “How do to build intelligence index structures so that the graphs traversals, which are the core of any graph database, so that those are as fast as possible?”
Original title and link: Titan: Data Loading and Transactional Benchmark (©myNoSQL)
via: http://www.planetcassandra.org/blog/post/educating-the-planet-with-pearson
Thursday, 23 May 2013
Neo4j 1.9 General Availability - Auto-clustering, Cypher, and Some comments
The 1.9 release adds primarily three things:
- Auto-Clustering, which makes Neo4j Enterprise clustering more robust & easier to administer, with fewer moving parts
- Cypher language improvements make the language more functionally powerful and more performant, and
- New welcome pages make learning easier for new users
- The first is for the enterprise customers and brings in the features that were initially supported through ZooKeeper
- Cypher is Neo4j’s fast evolving query language
- The site is brilliant.
- The release post is terrible with no links to dive into the newly announced features.
Original title and link: Neo4j 1.9 General Availability - Auto-clustering, Cypher, and Some comments (©myNoSQL)
via: http://blog.neo4j.org/2013/05/neo4j-19-general-availability.html
Monday, 20 May 2013
Neo4j Blog: Reloading my Beergraph - using an in-graph-alcohol-percentage-index
Rik Van Bruggen about data modeling in Neo4j:
One of the things that spurred the discussion was - probably not coincidentally - the AlcoholPercentage. Many people were expecting that to be a property of the Beerbrand - but instead in my beergraph, I had “pulled it out”. The main reason at the time was more coincidence than anything else, but when you think of it - it’s actually a fantastic thing to “pull things out” and normalise the data model much further than you probably would in a relational model. By making the alcoholpercentage a node of its own, it allowed me to do more interesting queries and pathfinding operations - which led to interesting beer recommendations. Which is what this is all about, right?
I can see where this is going, but I’m not sure I agree it’s the right approach. Basically in this case it works because the domain of the field is both discrete and small. Ideally, though, what you’d actually want is an index that could give you nodes that are “close-to-some value” (e.g.: “give me the beers in the 6.9-7.1 range”)
Original title and link: Neo4j Blog: Reloading my Beergraph - using an in-graph-alcohol-percentage-index (©myNoSQL)
via: http://blog.neo4j.org/2013/05/reloading-my-beergraph-using-in-graph.html
Monday, 13 May 2013
Bootstrapping Neo4j With Spring-Data...without XML
The emphasis is on without XML:
With the maturing of Spring-Data I started porting all my personal projects to use Spring Data for bootstrapping.
Quite a bit of annotations needs, but I’d go with that instead of XML.
Original title and link: Bootstrapping Neo4j With Spring-Data…without XML (©myNoSQL)
via: http://codepitbull.wordpress.com/2013/05/12/bootstrapping-neo4j-with-spring-data-without-xml/
Wednesday, 3 April 2013
A Quick Guide to Testing Spring Data Neo4j With NoSQLUnit
Alex Soto:
Spring Data Neo4j is the project within Spring Data project which provides an extension to the Spring programming model for writing applications that uses Neo4j as graph database. To write tests using NoSQLUnit for Spring Data Neo4j applications, you do need nothing special apart from considering that Spring Data Neo4j uses a special property called type in graph nodes and relationships which stores the fully qualified classname of that entity.
Is there a BigDataUnit framework? My only requirement is to use XML. Heavily.
Original title and link: A Quick Guide to Testing Spring Data Neo4j With NoSQLUnit (©myNoSQL)
via: http://www.javacodegeeks.com/2013/03/testing-spring-data-neo4j-applications-with-nosqlunit.html
Monday, 18 March 2013
Neo4j-Based Bitcoin Block Chain Visualizer
Pretty interesting usage of Neo4j for visualizing Bitcoin block chain:
Source code available on GitHub.
Original title and link: Neo4j-Based Bitcoin Block Chain Visualizer (©myNoSQL)
Friday, 8 March 2013
Adding Value Through Graph Analysis Using Titan and Faunus
Interesting slidedeck by Matthias Broecheler introducing 3 graph-related tools developed by Vadas Gintautas, Marko Rodriguez, Stephen Mallette and Daniel LaRocque:
- Titan: a massive scale property graph allowing real-time traversals and updates
- Faunus: for batch processing of large graphs using Hadoop
- Fulgora: for global running graph algorithms on large, compressed, in-memory graphs
The first couple of slides are also showing some possible use cases where these tools would prove their usefulness:
Original title and link: Adding Value Through Graph Analysis Using Titan and Faunus (©myNoSQL)
Tuesday, 12 February 2013
A Human-Readable Jackrabbit Persistence Manager Prototype for Orientdb
Jackrabbit still has a very special place in my heart. I’ve fought it many times, sometimes losing, most of the time winning. But for over 7 years now, it is still the main storage engine serving the content of InfoQ. So this OrientDB engine for Jackrabbit by Thomas Kratz caught my attention:
This has some limitations, as jackrabbit will still access only one node at a time, being able to traverse the graph at the storage level is simply not intended by the whole api. But it works, it’s readable, can be modified at the db level easily.
Original title and link: A Human-Readable Jackrabbit Persistence Manager Prototype for Orientdb (©myNoSQL)
via: http://thomaskratz.blogspot.de/2013/01/a-human-readable-jackrabbit-persistence.html
Tuesday, 5 February 2013
Neo4j Interviews: The Vision, the Business and Enterprise Talk, and the Tech
Over the weekend I’ve watched two interviews with people working on Neo4j. Each of them covers it from a different angle: Ian Robison’s interview is the technical one, while Emil Eifrem is giving more of the vision, business, enterprise interview. Pick the type of topic you like and watch it. Both are great though.
If you like going back in time, I’ve found a couple of old presentations from and interviews with Emil Eifrem:
Original title and link: Neo4j Interviews: The Vision, the Business and Enterprise Talk, and the Tech (©myNoSQL)
Friday, 1 February 2013
Using Treetop and Neo4j Cypher to Simulate Facebook Graph Search
Interesting as an exercise considering Max de Marzi shared all the code on GitHub, but completely unrelated to the breadth and depth of the Facebook Graph Search.
Original title and link: Using Treetop and Neo4j Cypher to Simulate Facebook Graph Search (©myNoSQL)
via: http://maxdemarzi.com/2013/01/28/facebook-graph-search-with-cypher-and-neo4j/
Wednesday, 23 January 2013
Neo Technology Is H… Wait, It’s Building Neo4j-As-A-Service
Neo Technology’s hiring announcement is clear about their intention:
“[…] you will be resonsible for building, managing, and maintaining a 24x7 NOSQL Databases-as-a-Service operation […]”
In the graph databases space, OrientDB is offering a hosting solution NuvolaBase, but I have no numbers about their business so far.
Original title and link: Neo Technology Is H… Wait, It’s Building Neo4j-As-A-Service (©myNoSQL)
Linkurious: Visualize Graph Data Easily
Nice tool for visualizing and exploring graph databases:
Currently it supports only Neo4j, but it can be extended to other graph databases.
Original title and link: Linkurious: Visualize Graph Data Easily (©myNoSQL)
Most Popular Articles
- Translate SQL to MongoDB MapReduce
- Tutorial: Getting Started With Cassandra
- CouchDB vs MongoDB: An attempt for a More Informed Comparison
- Cassandra @ Twitter: An Interview with Ryan King
- A Couple of Nice GUI Tools for MongoDB
- NoSQL benchmarks and performance evaluations
- Ehcache: Distributed Cache or NoSQL Store?
- Document Databases Compared: CouchDB, MongoDB, RavenDB
- Quick Review of Existing Graph Databases
- NoSQL Data Modeling

