I'm trying to get the content of a web page that is written in pdf format.
The following code worked very well for me when I tried to read a regular web page, but it prints all kinds of weird letters when I try it on a pdf page like this one:
url='http://fetac.ie/MODULES/D20120.pdf'
content=urllib2.urlopen(url).read()
print content
Any suggestion? (a brief code example will be great, thanks!)