Py100Skills

[Py014] 剔除含N序列

2018-10-27  本文已影响16人  安哥生个信

剔除所有含有N碱基的序列

from Bio import SeqIO

no_N_records = [record for record in SeqIO.parse('test.fa','fasta') if 'N' not in record.seq]

SeqIO.write(no_N_records, 'out.fa','fasta')

剔除全是N碱基的序列

from Bio import SeqIO

not_only_N_records = [record for record in SeqIO.parse('test.fa','fasta') if ''.join(set(record.seq))!='N']

SeqIO.write(not_only_N_records, 'out.fa','fasta')
上一篇 下一篇

猜你喜欢

热点阅读