Core Spring Data
Written by: Oliver Gierke
Featured Refcardz: Top Refcardz:
  1. Git
  2. Prep for CD
  3. DNS
  4. Data Mining
  5. Spring Data
  1. Data Mining
  2. Spring Data
  3. DNS
  4. Machine Learning
  5. MongoDB

Link Details

Link 908545 thumbnail
User 1059807 avatar

By acoin
via java.dzone.com
Published: Jan 18 2013 / 10:47

Last week I wrote a little script in node.js. Its goal? GET ALL THE DATA! The plan was to scrape a massive dataset off Github and do some analysis of programmers’ working habits. The scraping job took the bigger part of a week. On Saturday morning I had a mongo database with a list of 513,900 repositories on a small EC2 instance. They were not guaranteed to be unique.
  • 3
  • 0
  • 43
  • 219

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (3)



Voters Against This Link (0)