Hey guys,
I want to know how you guys parse html. I'm writing a program to compare products on a website. I already have a system that scrapes the websites.
I looked at HTMLParser, I'm a little confused on how to use it, I was thinking about writing my own script that searches the site for example:
The pricing code looks like this:
<dl class="pricing">
<dt>
Regular Price:
</dt>
<dd>
$899.99
</dd>
</dl>
I would like to extract the Regular Price:
and the 899.99
Should I use a parsing api like HTMLParser or write my own?
PO