Hello everyone,
This question is in regards to the HTML::TreeBuilder module in perl. I am using it to extract meta tags from website contents. It was working fine until i tried to extract meta tags from http://www.audicanada.ca/audi/ca/language_selector.html
This site contains all the meta tags that i need, author, keywords, description. But the writer forgot to use newline once in a while and has the whole data in the Head tag in one single line.
Now I am unable to get the tag by using:
for my $tag ($htmlData->look_down(_tag => "meta")) {
$document{lc($tag->attr("name"))} = $tag->attr("content");
}
Now I have a feeling that i missed something in the loop, but haven't been able to figure it out, so think a fresh pair of eyes might find what i must be missing.
Also, 'document' is a hash variable.
Thanks in advance!
Sid