Link Details

Link 1202121 thumbnail
User 1156439 avatar

By Datakey
Published: Aug 26 2014 / 14:16

R language and Python are often used to group and summarize data from big files, which have smaller computed result and bigger source data. As it is difficult to load all the data wholly to the memory for computing, the only solution is batch importing and computing as well as result merging. Here are examples to illustrate how R language, esProc and Python group and summarize data from big text files, and I’m wondering if you could give some advice on them.
  • 4
  • 0
  • 154
  • 1021

Add your comment

Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (4)

Voters Against This Link (0)

    Apache Hadoop
    Written by: Piotr Krewski
    Featured Refcardz: Top Refcardz:
    1. Play
    2. Akka
    3. Design Patterns
    4. OO JS
    5. Cont. Delivery
    1. Play
    2. Java Performance
    3. Akka
    4. REST
    5. Java