find() and rfind() questions

Question

betatype 0 Newbie Poster

16 Years Ago

First post so thanks in advance for any help. I am looking to pull the anchor text from a series of links in some html. I am doing this with find() and rfind():

linkend=users.find("</a>:")
linkstart=users.rfind(">",0,linkend)

My question is that once I have found the first link, how do I then continue to move on from that point? If I were to run this in a for loop 50 times it would just give me the first link 50 times, rather than finding all 50 links on the page, which is what I'm after. Thanks for your help everybody.

python

Edited 12 Years Ago by Reverend Jim because: Fixed formatting

3 Contributors
3 Replies
190 Views
6 Hours Discussion Span
Latest Post 16 Years Ago Latest Post by mn_kthompson

Recommended Answers

Answered by bvdet 75 in a post from 16 Years Ago

Supply a starting index for str.find() and update it each iteration. Example:

users = '''Users
<a>User 1</a>
<a>User 2</a>
<a>User 3</a>
<a>User 4</a>
<a>User 5</a>
'''

userList = []
idx = 0
while True:
    linkstart=users.find("<a>",idx)
    linkend=users.find("</a>", idx)
    if -1 in [linkstart,linkend]:
        break
    else:
        userList.append(users[linkstart+3:linkend])
        idx = linkend+4 …

Jump to Post

All 3 Replies

bvdet 75 Junior Poster

16 Years Ago

Supply a starting index for str.find() and update it each iteration. Example:

users = '''Users
<a>User 1</a>
<a>User 2</a>
<a>User 3</a>
<a>User 4</a>
<a>User 5</a>
'''

userList = []
idx = 0
while True:
    linkstart=users.find("<a>",idx)
    linkend=users.find("</a>", idx)
    if -1 in [linkstart,linkend]:
        break
    else:
        userList.append(users[linkstart+3:linkend])
        idx = linkend+4

print userList

Output:

>>> ['User 1', 'User 2', 'User 3', 'User 4', 'User 5']

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

betatype 0 Newbie Poster · Answer 1 · 2009-01-05T03:29:23+00:00

betatype 0 Newbie Poster

16 Years Ago

Genius! Thanks so much.

mn_kthompson 3 Junior Poster · Answer 2 · 2009-01-05T03:33:02+00:00

Is this for a homework assignment? I just want to know if you're allowed to use any Python tool you want, or if you're limited to what you've covered in class.

I'm sure there is a way to do it with find, but why don't you check out this tutorial on HTMLParser. It includes an example of how to do what you're trying to do.
http://cis.poly.edu/cs912/parsing.txt

find() and rfind() questions

Recommended Answers Collapse Answers

All 3 Replies

Recommended Answers