A recent paper written by Kiri Wagstaff claims that that machine learning might be in danger of losing its impact because the community as a whole has become quite self-referential.
The NoSQL hype is omnipresent. And many startups are tempted to go for Cassandra/MongoDB/HBase/Redis/… . Here I’ll argue why they should rather stick to a SQL solution – MySQL or PostgreSQL.
I got word about a port of Nerd Dinner to RavenDB, Dinner Party (source, live demo), and I just had to check the code.
An interesting discussion in response to a recent blogger claiming that the model is the single source of truth.
Brendan W. McAdams presents the following presentation (approx. 1 hour) on bringing together MongoDB, a popular NoSQL database, with the Java Virtual Machine.
Cassandra has native indexing capabilities, but those capabilities only work if the values stored in your columns are the values you want indexed.
An attempt to use Neo4j to figure out which people within the graph have worked with the most co-workers
I got a couple of interesting questions about RavenDB implementation, and I thought it would make a good blog post.
This session introduces the basic components of high availability before going into a deep dive on MongoDB replication.
After googling a problem with boolean type fields, this team realized that the solution involved creating a custom user type.
An illustrated discussion of how Neo4j handles optional relationships with code included.
From Dr. Mike Stonebraker over at VoltDB comes a discussion of the future of flash memory.
A lot of the caution that needs to be taken while working in a shared memory system also applis to a situation where redis is operating in a concurrent/distributed environment.
The problem: I’m working on a project where I’m taking a csv file that contains a little more than 100 columns of data by 10,000 rows. . . but when I turn-on processing for the other 9,999 rows, all I get stored into mongo is the first row of data.
It's Friday, so why not take 2 minutes out of your day for this fun song, with the added advantage of learning about seven of the most popular databases available. Be warned though, it's pretty catchy!
I am going to use Redis' atomic and blocking facilities to build a multi-step RSS feed processor. Along the way, some of the topics I hope to touch upon are: queue priorization, synchronization between processes, using redis to gracefully . . .
Continuing a previous post on geospatial indexing with RavenDB, Ayende shows how to upload the MaxMind data into RavenDB.
New features include: complete pagination for row view of explorer and filtering non-displayable data.
The code in this post allows us to query things efficiently and easily. But before we can get there, we need to actually load the data in . . .
This article weighs the pros and cons of NoSQL adoption by the likes of Facebook, Twitter, and Digg.
Rickard Öberg did a lab project last December developing a first prototype of a JDBC driver that connects to the Neo4j Server Cypher endpoint.
Kristina Chodorow presents another great post on MongoDB; this time with a Princess Bride theme.
When we are looking at an unregistered user (somebody just browsing the site) using the Item based recommendation we already built is all we have to go on.
A great intro to GridFS, a convention and API (not something native to your server), and how you can use it from Python
If you fail to indicate the primary key to sqoop, the following exception will be thrown; a similar issue arises without properly starting Cassandra.