生物信息数据科学

8. 《Bioinformatics Data Skills》之

2021-05-26  本文已影响0人  DataScience

工程的笔记有很多选择,作者推荐使用纯文本的方式记录。优点在于可以使用命令行进行读写与检索,以及方便移植与长期保存。
纯文本中Markdown是一种非常出色的记录方式。历史悠久,在互联网出现前就已经应用于邮件的格式渲染。Markdown结构清晰,易读易写,还可以转化为html,PDF, word等格式,下面介绍一下markdown的基本用法与渲染方式。

markdown基本格式

建议参考简书官方提供的Markdown教程,清楚明白够用,这里就不啰嗦了。下面补充2点

  1. 除了6级标题外,还可以以下面这种方式生成2级标题
Header level 1
===========
Header level 2
-------------------

效果:

Header level 1

Header level 2

  1. 在列表下面生成代码需要增加缩进(4个空格键或者1个tab键),同样适用于代码块。
    举例来说:
- 测试
    ```
    hello world
    ```

效果为:

使用Pandoc进行markdown渲染

Pandoc是非常流行的markdown格式渲染工具,可以将markdown转换为HTML等格式。

效果展示:
输入:

# *Zea Mays* SNP Calling
We sequenced three lines of *zea mays*, using paired-end
sequencing. This sequencing was done by our sequencing core and we
received the data on 2013-05-10. Each variety should have **two**
sequences files, with suffixes `_R1.fastq` and `_R2.fastq`, indicating
which member of the pair it is.
## Sequencing Files
All raw FASTQ sequences are in `data/seqs/`:
$ find data/seqs -name "*.fastq"
data/seqs/zmaysA_R1.fastq
data/seqs/zmaysA_R2.fastq
data/seqs/zmaysB_R1.fastq
data/seqs/zmaysB_R2.fastq
data/seqs/zmaysC_R1.fastq
data/seqs/zmaysC_R2.fastq
## Quality Control Steps
After the sequencing data was received, our first stage of analysis
was to ensure the sequences were high quality. We ran each of the
three lines' two paired-end FASTQ files through a quality diagnostic
and control pipeline. Our planned pipeline is:
1. Create base quality diagnostic graphs.
2. Check reads for adapter sequences.
3. Trim adapter sequences.
4. Trim poor quality bases.
Recommended trimming programs:
- Trimmomatic
- Scythe

HTML输出:


Snipaste_2021-05-26_11-22-33.png
上一篇 下一篇

猜你喜欢

热点阅读