DZone Snippets is a public source code repository. Easily build up your personal collection of code snippets, categorize them with tags / keywords, and share them with the world
Extract The Body Of An HTML Document
For example, print out just the body of Google's home page:
use LWP::UserAgent;
use HTML::TreeBuilder;
$ua = LWP::UserAgent->new;
my $req = HTTP::Request->new(GET => 'http://www.google.com/');
my $res = $ua->request($req);
if ($res->is_success) {
my $tree = HTML::TreeBuilder->new_from_content($res->content);
$tree->elementify();
my $body = $tree->find('body');
foreach $e ($body->content_list())
{
print $e->as_HTML();
}
}






Comments
Brian Tetreault replied on Wed, 2007/02/07 - 4:06pm