fasta.py 文件源码

python
阅读 31 收藏 0 点赞 0 评论 0

项目:RSeQC 作者: MonashBioinformaticsPlatform 项目源码 文件源码
def cal_entropy(self,l=3):
        '''calculate entropy for each sequence'''
        for (id,seq) in self.seqs.items():
            entropy = 0
            dna_chars_uniq = FrameKmer.all_possible_kmer(l)
            dna_len = len(seq)

            for c in dna_chars_uniq:
                if 'N' in c:
                    continue
                prop = seq.count(c)/(1.0*dna_len)
                if prop ==0:
                    continue
                information = numpy.log2(1.0/prop)
                entropy += prop * information
            yield(id, entropy)
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号