To begin with let me tell you the choice of using Hive was to understand not to use Hive as a JDBC equivalent. It was more to understand how to use Hive as a powerful datawarehouse analytics engine.
Here's a 1/2 hour conversation that BIG had with VoltDB's Dr. Mike Stonebraker
The importance of doing the right test-driven development is to find a good ratio of what kind of code is to be covered…
Here is a lengthy list of MongoDB-related links for devs interested in using the popular NoSQL solution with Java, Scala, Groovy, and more.
Most Pig tutorials you will find assume that you are working with data where you know all the column names ahead of time, and that the column names themselves are . . .
With Couchbase’s append-only storage design, it’s impossible to corrupt data and index files as updates go only to the end of the file.
One of the cypher queries that I wanted to write recently was one to find all the players that have started matches for Arsenal this season and the number of matches that they’ve played in.
Camel is the best of bread Integration framework and in this post I'm going to show you how to make it even more powerful by leveraging another great project - Redis.
For any complex system (especially a database), sizing is never an easy task. The variables are numerous and the considerations and decisions . . .
Here are steps to follow for achieving this task, as well as the appropriate code snippets and useful screenshots.
One of the requirements we have for one of our Mule ESB implementations is to log the messages that went through the ESB (especially handy during development and testing).
Dependency inversion is the idea that interfaces should depend on abstractions not on specifics. According to Wikipedia, the principle states . . .
Given the rise in popularity of NoSQL solutions, typically set apart by their explicit forfeiture of traditional RDBMS features, you might think that age old concepts like ACID transactions and foreign key constraints are simply antiquated.
This is all explained in more detail in the documentation but I thought it’d be interesting to write about it from the perspective of someone more used to writing SQL and trying to work out how to achieve the same thing in Cypher.
Here's a presentation on Apache Cassandra from Big Data Spain 2012.
Let’s say I have a collection of documents that all kind of look like this . . .
Whit this tutorial, you should be able to reach your SmartFox game server on the DNS name given to you by Amazon EC2 in the control panel.
Here's a webinar presented by Oren Eini from February 2 that covers SQL Replication, Periodic Backup, Replication/Sharding, and more . . .
Warning: This is another super-technical post. If you don’t know what the Semantic Web and RDF are, this will be incomprehensible.
Once upon a time there was a Munich pizza baker who developed a technique to beam pizza out of bright sunshine. He can produce more than a thousand pizzas per second . . .
Over at the 10gen blog, Emrah Ozcelebi, CEO of SPP42 has written a guest post detailing an interesting case study of how MongoDB is being used by Nokta . . .
Spring Data repository abstraction reduces the boilerplate code to write the data access layer of the application. . .
Here's a video from Enterprise Data World 2012 that explains how graph databases work and how they can be used for risk management.
Neo4j performance tuning deserves its own blog post, but at least now you have a great way of testing your performance as you tweak JVM, cache, hardware, load balancing, and other parameters.
This edition of the NoSQL Week in Review covers our top 5 NoSQL links for the last week. These include a couple Couchbase updates, as well as an article on polyglot persistence with Gremlin.