Link Details

Link 1177501 thumbnail
User 1156439 avatar

By Datakey
Submitted: Jul 07 2014 / 21:56

If there is a text file, sales.txt, with ten million sales records, first it to segment the text file with Cursor for parallel computing, the detailed coding illustration is available. When the data size of a text file amount to several TB, it is required to use multiple node based on cluster to make parallel computation.
  • 2
  • 0
  • 85
  • 27

Add your comment

Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Upvoters (2)

Downvoters (0)

    Apache Hadoop
    Written by: Piotr Krewski
    Featured Refcardz: Top Refcardz:
    1. Play
    2. Akka
    3. Design Patterns
    4. OO JS
    5. Cont. Delivery
    1. Play
    2. Java Performance
    3. Akka
    4. REST
    5. Java