EC2: All content tagged as EC2 in NoSQL databases and polyglot persistence
Friday, 8 February 2013
Deploying Riak on EC2 - What to Pick?
Deepak Bala sharing his recommendations for running Riak on EC2 based on his own experience:
There are a couple of problems to field when deploying Riak.
The EC2 instances that are provisioned by default change the following on restart.
- Private IP address
- Public IP address
- Private DNS
- Public DNS
- EBS instances provide stable durable storage while Ephemeral storage provides for better predictable performance at the cost of losing data on restarts.
Performance.
Original title and link: Deploying Riak on EC2 - What to Pick? (©myNoSQL)
Wednesday, 31 October 2012
Thursday, 25 October 2012
YCSB Benchmark Results for Cassandra, HBase, MongoDB, MySQL Cluster, and Riak
Put together by the team at Altoros Systems Inc., this time run in the Amazon EC2 and including Cassandra, HBase, MongoDB, MySQL Cluster, sharded MySQL and Riak:
After some of the results had been presented to the public, some observers said MongoDB should not be compared to other NoSQL databases because it is more targeted at working with memory directly. We certainly understand this, but the aim of this investigation is to determine the best use cases for different NoSQL products. Therefore, the databases were tested under the same conditions, regardless of their specifics.
Teaser: HBase got the best results in most of the benchmarks (with flush turned off though). And I’m not sure the setup included the latest HBase read improvements from Facebook.
Original title and link: YCSB Benchmark Results for Cassandra, HBase, MongoDB, MySQL Cluster, and Riak (©myNoSQL)
Tuesday, 31 July 2012
I/O Intensive Apps and Amazon Cloud Improvements: EBS Provisioned IOPS & Optimized Instance Types
James Hamilton puts in perspective the last two new I/O related features coming from Amazon: the high performance I/O EC2 instances and EBS provisioned IOPS together with EBS-optimized EC2 instances:
With the announcement today, EC2 customers now have access to two very high performance storage solutions. The first solution is the EC2 High I/O Instance type announced last week which delivers a direct attached, SSD-powered 100k IOIPS for $3.10/hour. In today’s announcement this direct attached storage solution is joined by a high-performance virtual storage solution. This new type of EBS storage allows the creation of striped storage volumes that can reliably delivery 10,000 to 20,000 IOPS across a dedicated virtual storage network.
I’ve already said it, but this confirms it once again that Amazon is addressing most of the complains of running I/O intensive applications on EC2 and EBS.
Original title and link: I/O Intensive Apps and Amazon Cloud Improvements: EBS Provisioned IOPS & Optimized Instance Types (©myNoSQL)
via: http://perspectives.mvdirona.com/2012/08/01/EBSProvisionedIOPSOptimizedInstanceTypes.aspx
Thursday, 19 July 2012
Amazon Introduces High I/O SSD-backed EC2 Instances
Jeff Barr:
In order to meet this need, we are introducing a new family of EC2 instances1 that are designed to run low-latency, I/O-intensive applications, and are an exceptionally good host for NoSQL databases such as Cassandra and MongoDB.
Many complains about running databases on EC2 instances were about the I/O. I guess Amazon has been hearing this loud and clear.
-
Specs of the new EC2 instace: ↩
- 8 virtual cores (35 ECU)
- HVM and PVM virtualization.
- 60.5 GB of RAM.
- 10 Gigabit Ethernet connectivity with support for cluster placement groups.
- 2 TB of local SSD-backed storage, visible as a pair of 1 TB volumes.
Original title and link: Amazon Introduces High I/O SSD-backed EC2 Instances (©myNoSQL)
via: http://aws.typepad.com/aws/2012/07/new-high-io-ec2-instance-type-hi14xlarge.html
Wednesday, 18 July 2012
Benchmarking High Performance I/O With SSD for Cassandra on AWS
Adrian Cockcroft:
The SSD based system running the same workload had plenty of IOPS left over and could also run compaction operations under full load without affecting response times. The overall throughput of the 12-instance SSD based system was CPU limited to about 20% less than the existing system, but with much lower mean and 99th percentile latency. This sizing exercise indicated that we could replace the 48 m2.4xlarge and 36 m2.xlarge with 15 hi1.4xlarge to get the same throughput, but with much lower latency.
Tons of details and data about the benchmarks Netflix ran against the new high I/O SSD-backed EC2 instances. Results are even more impressive than the IOPS numbers in Werner Vogel’s High performance I/O instances for EC2.
Original title and link: Benchmarking High Performance I/O With SSD for Cassandra on AWS (©myNoSQL)
via: http://techblog.netflix.com/2012/07/benchmarking-high-performance-io-with.html
Monday, 2 April 2012
The Total Cost of (Non) Ownership of a NoSQL Database Service
The Amazon team released a whitepaper comparing the total cost of ownership for 3 scenarios:
- on-premise NoSQL database
- NoSQL database deployed on Amazon EC2 and Amazon EBS
- Amazon DynamoDB

As you can imagine DynamoDB comes out as the most cost-effective solution (79% more effective than on-premise NoSQL database and 61% more cost-effective than AWS hosted NoSQL database). Read or download the paper after the break.
Thursday, 22 March 2012
Wordnik: Migrating From a Monolythic Platform to Micro Services
The story of how Wordnik changed a monolithic platform to one based on Micro Services and the implications at the data layer (MongoDB):
To address this, we made a significant architectural shift. We have split our application stack into something called Micro Services — a term that I first heard from the folks at Netflix. […] This translates to the data tier as well. We have low cost servers, and they work extremely well when they stay relatively small. Make them too big and things can go sour, quickly. So from the data tier, each service gets its own data cluster. This keeps services extremely focused, compact, and fast — there’s almost no fear that some other consumer of a shared data tier is going to perform some ridiculously slow operation which craters the runtime performance. Have you ever seen what happens when a BI tool is pointed at the runtime database? This is no different.
Original title and link: Wordnik: Migrating From a Monolythic Platform to Micro Services (©myNoSQL)
via: http://blog.wordnik.com/with-software-small-is-the-new-big
Monday, 30 January 2012
NoSQL Tutorial: Setting Up a Hadoop Cluster with MongoDB Support on EC2
A complete and detailed guide for setting up a Hadoop cluster using MongoDB by Arten Yankov. It uses the MongoDB Hadoop adapter mongo-hadoop , which provides input and output adapters, support for InputSplits, and write-only Pig.
What is covered in the tutorial:
- Creating an AMI with the custom settings (installed hadoop and mongo-hadoop)
- Launching a hadoop cluster on EC2
- Adding more nodes to the cluster
- Running some sample jobs
Original title and link: NoSQL Tutorial: Setting Up a Hadoop Cluster with MongoDB Support on EC2 (©myNoSQL)
via: http://artemyankov.com/post/16717104998/how-to-set-up-a-hadoop-cluster-with-mongo-support-on
Monday, 22 August 2011
Reliable, Scalable, and Kinda Sorta Cheap: A Cloud Hosting Architecture for MongoDB
Using MongoDB replicate sets:
At Famigo, we house all of our valuable data in MongoDB and we also serve all requests from Amazon EC2 instances. We’ve devoted many mental CPU cycles to finding the right architecture for our data in the cloud, focusing on 3 main factors: cost, reliability, and performance.
Original title and link: Reliable, Scalable, and Kinda Sorta Cheap: A Cloud Hosting Architecture for MongoDB (©myNoSQL)
via: http://www.codypowell.com/taods/2011/08/a-cloud-hosting-architecture-for-mongodb.html
Friday, 12 August 2011
MongoDB and Amazon: Why EBS?
After linking to the MongoDB in the Amazon cloud, MongoDB and EC2 and the older MongoDB on Amazon EC2 with EBS volumes , Arnout Kazemier commented:
The only thing I dislike about that EC2 guide is that it’s suggesting to use EBS instead of the regular EC2 instance storage
This is an apt question in the light of the prolongued Amazon outage, Reddit’s experience with EBS, the unpredictable EBS performance, and Netflix’s Adrian Cockcroft explanation of multi-tenancy impact on the Amazon EBS performance. Maybe someone could answer it.
Original title and link: MongoDB and Amazon: Why EBS? (©myNoSQL)
Tuesday, 9 August 2011
Setting Up MongoDB Replica Sets on Amazon EC2
Zachary Witte:
When you have the instance basically set, go back into the AWS control panel, right click the instance and choose Create Image. You can start up any number of these for the replica set, but you need to change the /etc/hostname and /etc/hosts file to reflect the individual IP address and hostname of the bot (db1, db2, db3, etc.)
Before you set up MongoDB on EC2 make sure you understand the various aspects of running MongoDB in the Amazon cloud:
Original title and link: Setting Up MongoDB Replica Sets on Amazon EC2 (©myNoSQL)
via: http://www.zacwitte.com/how-to-set-up-ubuntu-w-mongodb-replica-sets-on-amazon-ec2
Most Popular Articles
- Translate SQL to MongoDB MapReduce
- Tutorial: Getting Started With Cassandra
- CouchDB vs MongoDB: An attempt for a More Informed Comparison
- Cassandra @ Twitter: An Interview with Ryan King
- A Couple of Nice GUI Tools for MongoDB
- NoSQL benchmarks and performance evaluations
- Ehcache: Distributed Cache or NoSQL Store?
- Document Databases Compared: CouchDB, MongoDB, RavenDB
- Quick Review of Existing Graph Databases
- NoSQL Data Modeling
