解析fasta文件

2017-07-23  本文已影响0人  南山欧巴
sequences={}
ac = ''
seq = ''
for line in open("swissprot.fasta"):
    if line.startswith(">") and seq != '':
        sequences[ac]=seq
        seq = ''
    if line.startswith('>'):
        ac=line.split('|')[1]
    else:
        seq=seq+line.strip()
sequences[ac]=seq
print sequences.keys()
print sequences['P62258']

上一篇 下一篇

猜你喜欢

热点阅读