Hi,

This is my first post here, since I am new to Python. I've been messing around a bit with it and I think I have the basics in my fingers now.

I've run into a problem with the HTMLParser: I want to use it to collect the url's contained in <a></a> tags, which I have done successfully, but now I also want to extract every single word, that is displayed in your internet browser, from a HTML-file. So not the <br />, <!-- -->, ... tags, but just the text that can be seen by any visitor.

Can I use the HTMLParser for this?

Many thanks in advance!

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.