Link Details

Link 724029 thumbnail
User 478055 avatar

By mitchp
via blogs.perl.org
Published: Jan 09 2012 / 13:29

This is the first of a series of posts that will detail a Marpa-based "Ruby Slippers" approach to parsing liberal and defective HTML. As an example, let's look at a few lines taken more or less at random from the middle of the perl.org landing page. That page is exactly 400 lines long. Here is line 200 and some lines lines to either side of it.
  • 7
  • 0
  • 1025
  • 782

Comments

Add your comment
User 205784 avatar

cbegin replied ago:

1 votes Vote down Vote up Reply

It's sad that we're taking a step backwards, because HTML5 is no longer required to be well formed XML -- or call it whatever you want... even though people don't particularly love XML, having well formed HTML that mandates closing tags should have been a priority. Like really... what does a / cost at the end of a tag? I don't mind naked attributes, as that's easy to work into a parser (especially if they're required to be at the end of the element).

Anyway.. kind of sad. Reminds me of 90s web development.

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (7)



Voters Against This Link (0)



    Spring Integration
    Written by: Soby Chacko
    Featured Refcardz: Top Refcardz:
    1. Search Patterns
    2. Python
    3. C++
    4. Design Patterns
    5. OO JS
    1. PhoneGap
    2. Spring Integration
    3. Regex
    4. Git
    5. Java