10X genomics bam文件的格式
2020-11-19 本文已影响0人
生信编程日常
samtools view -@ 8 possorted_genome_bam.bam |head
| Line | Tag | Description | Example |
|---|---|---|---|
| 1 | NA | Query name | A00682:284:HYVYMDSXX:4:1219:23954:12164 |
| 2 | NA | Flag | 0 |
| 3 | NA | Reference name | 1 |
| 4 | NA | Position | 12434 |
| 5 | NA | Mapping quality | 255 |
| 6 | NA | Cigar string | 130M286409N20M |
| 7 | NA | Reference name of mate | * |
| 8 | NA | Position of mate | 0 |
| 9 | NA | Template length | 0 |
| 10 | NA | Sequence | CTCCAAAGA....ATCAAAAAAAAAAAAAA |
| 11 | NA | Sequence quality | FFFFFFFFFFFFFFFFFF.... |
| 12 | NH | Number of reported alignments for query | NH:i:1 |
| 13 | HI | Query hit index | HI:i:1 |
| 14 | AS | Alignment score | AS:i:135 |
| 15 | nM | Number of mismatches per pair | nM:i:5 |
| 16 | RE | Region type (E = exonic, N = intronic, I = intergenic) | RE:A:I |
| 17 | BC | Sample index read | BC:Z:TCGCCAGC |
| 18 | QT | Sample index read quality | QT:Z:FFFFFFFF |
| 19 | CR | Cell barcode | CR:Z:GCGTGCACATAGAAAC |
| 20 | CY | Cell barcode read quality | CY:Z:FFFF:FFFFFF,FFFF |
| 21 | CB | Cell barcode that is error-corrected and confirmed against a list of known-good barcode sequences | CB:Z:GCGTGCACATAGAAAC-1 |
| 22 | UR | Unique Molecular Identifier (UMI) | UR:Z:CGCTCACGCATA |
| 23 | UY | UMI read quality | UY:Z::FFFFFFFFFFF |
| 24 | UB | UMI that is error-corrected among other molecular barcodes with the same cellular barcode and gene alignment | UB:Z:CGCTCACGCATA |
| 25 | RG | Read group | RG:Z:Day1:0:1:HWJ5VDSXX:4 |
欢迎关注~
参考:
https://davetang.org/muse/2018/06/06/10x-single-cell-bam-files/
https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/output/bam