MultipleSeqAlignment object in biopython

Question

weblover 0 Junior Poster

13 Years Ago

hi

i have an aligmenet file which contains 3 species generated from clustalx

AAAACGT Alpha
AAA-CGT Beta
AAAAGGT Gamma

i already sliced the aligment using the predefined indexing in biopython align[:,:4]

but now when i print the result i get:

AAAA Alpha
AAA- Beta
AAAA Gamma

the questions is: how can i get only the sub-aligmenet as without the species names(like a multiple aligment object without species names) e.g something like that:

AAAA
AAA-
AAAA

i tried : align[:,:4].seq but it did not worked.

any help would be appreciated

thank you

python

Edited 13 Years Ago by weblover

2 Contributors
6 Replies
287 Views
2 Days Discussion Span
Latest Post 13 Years Ago Latest Post by weblover

TrustyTony 888 ex-Moderator

13 Years Ago

from here http://biopython.org/DIST/docs/api/Bio.Align.MultipleSeqAlignment-class.html
looks like you should do just slice the part you want

print align[:,:1]

Can not test without any data even if one installs the biopython.

Edited 13 Years Ago by TrustyTony

TrustyTony 888 ex-Moderator

13 Years Ago

Doesn't the

print align[:,3:11]

work?

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

weblover 0 Junior Poster · Answer 1 · 2012-05-09T09:42:29+00:00

thank you.

i already found this, but for example how can i take the columns between 4 and 10 without getting the specie names ???

weblover 0 Junior Poster · Answer 2 · 2012-05-10T08:15:28+00:00

Mmmm, no, it prints the needed columns but with the sepcies names ...

TrustyTony 888 ex-Moderator Team Colleague Featured Poster · Answer 3 · 2012-05-10T17:18:05+00:00

maybe you can line.split(None, 1)[0] to get rid of last word in each line, but there must be a proper way also.

data = """
AAAACGT Alpha
AAA-CGT Beta
AAAAGGT Gamma""".splitlines()

print('\n'.join(line.split(None, 1)[0] for line in data if ' ' in line))

weblover 0 Junior Poster · Answer 4 · 2012-05-11T09:50:31+00:00

you cannot do this, because the result of align[:,3:11] contains something like that:

SingleLetterAlphabet() alignment with 3 rows and 7 columns
AAAACGT Alpha
AAA-CGT Beta
AAAAGGT Gamma

so the split will not work ..