Link Details

Link 879555 thumbnail
User 1019817 avatar

By praskrishna
via krishnasblog.com
Published: Nov 19 2012 / 07:47

Hadoop is primarily a distributed file system (DFS) where you can store terabytes/petabytes of data on low end commodity computers. This is similar to how companies like Yahoo and Google store their page feeds. Hive, HBase is used for query processing. Spring Batch is used to tie in all these. This article talks about some of these.
  • 9
  • 0
  • 906
  • 697

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (9)



Voters Against This Link (0)



    Apache Hadoop
    Written by: Piotr Krewski
    Featured Refcardz: Top Refcardz:
    1. Play
    2. Akka
    3. Design Patterns
    4. OO JS
    5. Cont. Delivery
    1. Play
    2. Java Performance
    3. Akka
    4. REST
    5. Java