Link Details

Link 1177501 thumbnail
User 1156439 avatar

By Datakey
via quora.com
Submitted: Jul 07 2014 / 21:56

If there is a text file, sales.txt, with ten million sales records, first it to segment the text file with Cursor for parallel computing, the detailed coding illustration is available. When the data size of a text file amount to several TB, it is required to use multiple node based on cluster to make parallel computation.
  • 2
  • 0
  • 80
  • 26

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (2)



Voters Against This Link (0)



    Apache Hadoop
    Written by: Piotr Krewski
    Featured Refcardz: Top Refcardz:
    1. Play
    2. Akka
    3. Design Patterns
    4. OO JS
    5. Cont. Delivery
    1. Play
    2. Java Performance
    3. Akka
    4. REST
    5. Java