Link Details

Link 39584 thumbnail
User 188435 avatar

By rob_dimarco
via innovationontherun.com
Submitted: Sep 05 2007 / 18:01

Scraping static web sites to verify functionality or to access data has been around as long as there has been a web (example of scraping of a static web page with Ruby). But with the advent of AJAX and other techniques that use JavaScript to dynamically insert HTML into a web page, scraping has gotten more challenging. With the 1.12 release of HtmlUnit, this headless web browser can now support parsing and executing JavaScript and when combined with JRuby, is a great technology for easily construction of a script that parses a dynamic site.
  • 3
  • 0
  • 918
  • 91

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (3)



Voters Against This Link (0)