By praskrishna
via krishnasblog.com
Published: Nov 19 2012 / 07:47
Hadoop is primarily a distributed file system (DFS) where you can store terabytes/petabytes of data on low end commodity computers. This is similar to how companies like Yahoo and Google store their page feeds. Hive, HBase is used for query processing. Spring Batch is used to tie in all these. This article talks about some of these.
Tweet
SaveShareSend
Tags: frameworks, java, methodology, open source
Add your comment
Voters For This Link (9)
-
praskrishna -
aravind.kakarla -
leo.anbarasan -
dsal102 -
mswatcher -
anagamca -
ranveera -
kanchanpaul -
nellaivijay