HTML5 Canvas
Written by: Simon Sarris
Featured Refcardz: Top Refcardz:
  1. Apache Hadoop
  2. Web Driver
  3. MVVM
  4. REST
  5. ADO.NET
  1. HTML5
  2. Ajax
  3. jQuery Selectors
  4. CSS Part 1
  5. Git

Link Details

Link 377885 thumbnail
User 346073 avatar

By ray@pageonepr.com
via cloudera.com
Published: Mar 11 2010 / 00:29

nugg.ad operates Europe’s largest targeting platform. The company’s core business is to derive targeting recommendations from clicks and surveys. They measure these, store them in log files and later analyze them. Here's the story of how they arrived at using Hadoop for this – and how jobs that required five days to process now require one hour.
  • 21
  • 1
  • 3052
  • 0

Comments

Add your comment
User 264285 avatar

mafr replied ago:

1 votes Vote down Vote up Reply

nugg.ad is a relatively small player on the German market. There are several other companies that operate larger systems for exactly this use case.

One company I know of processes terabytes of log data per day. That's not impressive compared to, say, Google, but it's at least an order of magnitude more than what nugg.ad does.

User 725035 avatar

richhutton replied ago:

0 votes Vote down Vote up Reply

It is true at nugg.ad that we don’t store massive amounts of data per day, if you compare us with competitors. But we also do not need to store much data per single click, and that is a deliberate design decision. Call this a less-is-more kind of philosophy, if you want.

We can simply log and process more if need be, by adding a new machine to our Hadoop cluster. That requires no other changes to the system.

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.