Link Details

Link 70230 thumbnail
User 261096 avatar

By pumba
via xml.lt
Published: Mar 13 2008 / 05:52

Regular expressions is probably the most widely–used technique for HTML scraping. However, there are several issues in the regexp implementation. Another approach to scraping is using Document Object Model (DOM).
  • 6
  • 3
  • 3678
  • 2

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (6)



Voters Against This Link (3)



Play Framework
Written by: Ryan Knight
Featured Refcardz: Top Refcardz:
  1. Akka
  2. Design Patterns
  3. OO JS
  4. Cont. Delivery
  5. HTML5 Mobile
  1. Akka
  2. JUnit/EasyMock
  3. Java Performance
  4. REST
  5. Java