ALL COVERED TOPICS

NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter

NAVIGATE MAIN CATEGORIES

Close

Oracle: All content tagged as Oracle in NoSQL databases and polyglot persistence

The Oracle NoSQL Database and Big Data Appliance

There’s been a lot of speculation about the announcements coming from Oracle’s OpenWorld event. A first part was revealed during the keynote in the form of an in-memory analytics appliance called Exalytics [2]. But there’s talk about a Big Data Appliance and an Oracle NoSQL database.

Here’re my predictions[1]

  1. Oracle became very aggressive in selling products based on hardware, software, and services. So they’ll announce a Hadoop appliance integrated with an existing Oracle product. It could be either the Oracle Exadata or even the newly announced Exalytics.

    This appliance will place Oracle in competition with all other Hadoop appliance sellers: EMC, NetApp, IBM. Also these days most of the analytics databases try to integrate with Hadoop.

  2. Oracle already has a couple of non-relational solutions in their portfolio: BerkleyDB, TimesTen, Coherence. And they’ve already started to test the NoSQL market by announcing the MySQL and MySQL Cluster NoSQL hybrid systems.

    I don’t expect Oracle NoSQL database to be a new product. Just a rebranding or repackaging of one of the above mentioned ones. Probably the TimesTen.

  3. Oracle will invest more into integrating its line of products with Hadoop. Having both a Hadoop and an in-memory analytics appliance will make them very competitive in this space.

  4. Oracle will extend the support for NoSQLish interfaces (memcached) to its other database products.

What are your predictions?


  1. or speculations  

  2. I’m currently gathering more details about Exalytics.  

Original title and link: The Oracle NoSQL Database and Big Data Appliance (NoSQL database©myNoSQL)


Will Oracle Win the NoSQL Competition

I agree this title is misleading but problem is clear: today Oracle does not provide any product can compete with new cloud computing needs and with the NoSQL movement. It is not possibile to think that actually the RAC technology of oracle can be used in a cloud environment and also a cloud service cannot be deployed over an Exadata.

The real question though is if Oracle is really interested by the market currently served by NoSQL databases and/or hybrid solutions. And judging by the latest versions of MySQL and MySQL Cluster[1] it looks like they are testing the waters.


  1. Latest versions of MySQL and MySQL Cluster are adding support for using the Memcached protocol. See NoSQL to MySQL with Memcached  

Original title and link: Will Oracle Win the NoSQL Competition (NoSQL database©myNoSQL)

via: http://www.stefanocislaghi.eu/2011/09/will-oracle-win-the-nosql-competition/


Enterprise Big Data Stack vs Open Source Big Data Stack

Goldmacher estimated that YouTube consumption—user uploads of 48 hours of video a minute and 3 billion videos a day along with roughly 45 petabytes of viewed videos a day—would require at least 9 full-rack Exadata machines at $1.5 million each. There would be at least 18 Exadata machines to handle spikes. Those machines would add up to 14 Exalogic devices to serve data at $1.1 million per system. The software stack under Oracle would include WebLogic middleware, Oracle databases, Exadata optimized storage and Oracle as operating system. The open source comparison included JBoss middleware, MySQL, Hadoop and Red Hat Enterprise Linux as the OS.

Big Data Enterprise Stack

Big Data Open Source Stack

Credit Peter Goldmacher (Cowen & Co. analyst)

Two comments (the only I have):

  1. what advantages would the enterprise stack offer to justify a 5x cost?
  2. in case all numbers are completely wrong, what’s the advantage of the enterprise stack?

Original title and link: Enterprise Big Data Stack vs Open Source Big Data Stack (NoSQL database©myNoSQL)

via: http://www.zdnet.com/blog/btl/big-data-vs-traditional-databases-can-you-reproduce-youtube-on-oracles-exadata/52053


Oracle and IBM May Not Know Big Data, but Neither Does Ballmer

The echo chamber is reacting:

Specifically, for a data processing and analytics project to qualify as Big Data, it must encompass not just internal corporate data, but also third-party data that resides outside the firewall, according to Ballmer. He said IBM and Oracle limit their Big Data approaches to internal data, thus they are not in fact Big Data by his definition.

[…]

IBM, Oracle and now Microsoft are jockeying to position each of their approaches to Big Data as the industry standard, and Ballmer is clearly trying to steer the Big Data conversation towards Microsoft’s strengths and away from its weaknesses. That means talking up Microsoft’s ability to integrate third-party data with relatively large volumes of corporate data inside Microsoft’s SQL Server R2 Parallel Data Warehouse and away from its lack of petabyte-scale data processing power.

I guess there will be no end to the Oracle-IBM-Microsoft triangle love, so I’ll stop here until real facts are added to the story.

Original title and link: Oracle and IBM May Not Know Big Data, but Neither Does Ballmer (NoSQL database©myNoSQL)

via: http://wikibon.org/blog/oracle-and-ibm-may-not-know-big-data-but-neither-does-ballmer/


The Disruptive Value of Distributed Key-Value Stores

Martin Schneider (Basho):

Organizations with specific needs best met by a platform like Riak could save a company:

  • Millions of dollars in oracle license/maintenance
  • Hundreds of thousands a year in BI system license/maintenance
  • Up to hundreds of thousands in sys-admin salary/overhead

This sounds correct in theory. But the last couple of Oracle databases I’ve seen were:

  1. serving multiple applications
  2. sharing data between applications
  3. used for generating tens/hundreds of reports

So, the online storage/OLTP costs equation to beat is:

licenses + operational costs + “data integration costs” + etl + reporting « licenses + operational costs

Original title and link: The Disruptive Value of Distributed Key-Value Stores (NoSQL databases © myNoSQL)

via: http://blog.basho.com/2011/04/22/The-Disruptive-Value-of-Distributed-Key-Value-Store/?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+basho%2FknDR+%28The+Basho+Blog%29&utm_content=Google+Reader


Types of Big Data Work

Mike Minelli: Working with big data can be classified into three basic categories […] One is information management, a second is business intelligence, and the third is advanced analytics

Information management captures and stores the information, BI analyzes data to see what has happened in the past, and advanced analytics is predictive, looking at what the data indicates for the future.

There’s also a list of tools for BigData: AsterData (acquired by Teradata), Datameer, Paraccel, IBM Netezza, Oracle Exadata, EMC Greenplum.

Original title and link: Types of Big Data Work (NoSQL databases © myNoSQL)

via: http://www.linuxinsider.com/story/71945.html


Oracle and MySQL Future

Curt Monash:

We’ll know they’re even more serious if they buy MySQL enhancements such as Infobright, dbShards, or Schooner MySQL

Why?

Original title and link: Oracle and MySQL Future (NoSQL databases © myNoSQL)

via: http://www.dbms2.com/2011/03/15/mysql-soundbites/?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+dbms2%2Ffeed+%28DBMS2+--+DataBase+Management+System+Services%29&utm_content=Google+Reader


Paper: Netflix’s Transition to High-Availability Storage Systems

A while ago, Sid Anand[1] has written a series of posts on challenges of a hybrid solution: Oracle - Amazon SimpleDB. This has become now a paper which offers a much better organized and detailed view on Netflix’s transition to using a hybrid Oracle - Amazon Web Services (SimpleDB, S3) architecture.

Go read the ☞ paper if one of these applies:

  • interested in Amazon SimpleDB and SimpleDB best practices
  • interested in running an on-premise and cloud hybrid architecture
  • interested in architecting a multi data source system

  1. Siddharth “Sid” Anand, Netflix cloud engineer, @r39132  ()

Original title and link: Paper: Netflix’s Transition to High-Availability Storage Systems (NoSQL databases © myNoSQL)


Oracle Drops InnoDB from MySQL Classical Edition, But Not From Community Edition

I have heard many mentioning that Oracle removed InnoDB from the MySQL classical edition version. Now, I don’t know too much about the various versions and licenses of MySQL — it looks like there are at least 5: enterprise, classical, standard, cluster carrier grade, and community — but InnoDB doesn’t seem to have been dropped from the community edition too. So, I’m not really sure this is such a big deal.[1]

What are your thoughts on this story?

Update: Basho, creator of Riak that offers a pluggable storage engine based on InnoDB, ☞ clarifies the status of InnoDB:

InnoDB is available under the GPL. Innostore, as a derivative work of Embedded InnoDB, is also available under the GPL. Neither Oracle nor Basho can take that away from you.


  1. If everyone would actually be forced to go back at using MyISAM, that would be a bit more interesting as it would mean MySQL will be less durable and consistent.  ()

Original title and link: Oracle Drops InnoDB from MySQL Classical Edition, But Not From Community Edition (NoSQL databases © myNoSQL)


Oracle impact on the Open Source Relational Databases

Cheap:

Oracle has shut down servers Sun Microsystems was contributing to the build farm for open source database software, PostgreSQL, forcing enthusiasts to scramble to find new hosts to test updates to their software on the Solaris operating system.

Keep in mind that these were 3 (three) servers. Not 300, not even 30.

The fact that I cover NoSQL databases doesn’t mean that I don’t care about relational databases or that we will not need them. Having a healthy open source relational database ecosystem is essential. And please don’t say or even think that this will help in any ways the NoSQL community!

via: http://www.itnews.com.au/News/221051,oracle-shuts-down-open-source-test-servers.aspx


Oracle and Hadoop Getting Closer

Back in January, I was writing about some solutions to integrate Oracle databases with Hadoop and also speculated that most probably more and more tools will look into providing MapReduce support.

Last week, Cloudera and Quest Software have ☞ announced a new connector allowing Hadoop to pull data from Oracle databases.

For those database administrators who’ve suppressed their jealousy, while their programming and scripting colleagues talk gleefully about the death of relational databases, this week’s news may cause them to regain some pride.

[…] The new connector is currently known as Ora-Oop.


Hadoop and Oracle Parallel Processing

At the end of last year, in a “reconciliation” attempt, I was writing that even if not all, more RDBMS are looking into integrating MapReduce in their set of tools.

Oracle seems to be in the first lines of this initiative as it looks like ☞ its database becomes more and more aware of systems like Hadoop. The linked article presents two ways in which Oracle can pull data out of HDFS by either accessing it directly through the FUSE driver or by triggering Hadoop to push data into Oracle queues which are further accessible from table functions. A commenter on the post has suggested a 3rd option that sounds even more interesting: using the Oracle Java support for accessing the Hadoop API.

Diagram from Oracle Blogs

While the presented solutions are only about pulling data from Hadoop and processing them in parallel using Oracle parallel processing support, I do think that sooner than later we will see solutions that will use Hadoop for processing data made accessible directly by Oracle.

Here is just a thought on how this would work:

  • use some special Oracle functions to pull data from tables and push it into Hadoop accessible queues
  • Hadoop (with streaming support) would pull out data from queues and process them internally
  • when processing is done, Hadoop can push back data into Oracle accessible queues (as per the above solutions).

Isn’t that an interesting future?

Update: in the light of the newly granted MapReduce patent (Google), I guess it will be a bit more difficult to blame anyone for not incorporating or integrating more closely with Hadoop. What do you think?