计算基因组大小(除去N)

2021-09-04  本文已影响0人  余绕
#coding=utf-8
import sys
aList=[]
fa_file = sys.argv[1]
with open(fa_file,'r') as f:
    for line in f:
        line = line.strip()
        line = line.upper()
        if not line.startswith(">"):
            baseA = line.count("A")
            baseT = line.count("T")
            baseC = line.count("C")
            baseG = line.count("G")
            aList.extend([baseA, baseT, baseC, baseG])
            # print(aList)
    print("effective_genome_size =", sum(aList))

运行脚本:

python  genomeSize.py m38

参考:https://www.jianshu.com/p/bcd70b8c6bce

上一篇 下一篇

猜你喜欢

热点阅读