The 310 billion triple result that Franz is announcing today was achieved in only two weeks of access (actual loading time of just over 78 hours) to an 8-socket Intel Xeon E7-8870 processor-based server system configured with 2 terabytes of physical memory and 22 terabytes of physical disk.
“We’re confident that with additional time, another terabyte of memory, and a bit more storage capacity, the previously unreachable goal of 1 trillion triples can be achieved. Even double that is not out of the question,” stated Dr. Jans Aasman, CEO of Franz Inc.
I’m afraid to ask how much would this cost. But we already know that scaling graph databases is still an open question.
This next answer shows why different data and processing models are needed for different scenarios:
Dr. Aasman said, “Some people have asked, ‘Why not do this on a distributed cloud system with Hadoop?’ The quick answer: NoSQL databases like Hadoop and Cassandra fail on joins. Big Enterprise, big web companies and big government intelligence organizations are all looking into big data to work with massive amounts of semi-unstructured data. They are finding that NoSQL databases are wonderful if one needs access to a single object in an ocean of billions of objects, however, they also find that the current NoSQL databases fall short if you need to run graph database operations that require many complicated joins. A typical example would be performing a social network analysis query on a large telecom call detail record database.”
Original title and link: Franz’s AllegroGraph Sets New Triple Store Record (NoSQL databases © myNoSQL)