Omics Academy

Experience of using Mageck-vispr

2018-10-20  本文已影响0人  OmicsAcademy

Installation

conda create -n mageck-vispr mageck-vispr

After this step is done. You will get some information like this:

#
# To activate this environment, use:
# > source activate mageck-vispr
#
# To deactivate an active environment, use:
# > source deactivate
#

Initiate

mageck-vispr init . --reads ./test_megeck/esc-testdata/reads/ERR37*
# General configuration:

# Path to library design file (csv or tab-separated txt format, columns: id, sequence, gene)
library: yusa_library.csv
# Species to use for linkouts in VISPR (e.g. mus_musculus, homo_sapiens, ...)
species: mus_musculus
# Genome assembly to use for linkouts in VISPR (e.g. hg19, hg38, mm9, mm10, ...)
assembly: mm10

# Configuration of knockout target display in VISPR
targets:
    # if screening genes, set this to true for proper linkouts to GeneMANIA and Ensembl in VISPR
    genes: true
    # file with genes to hide per default in VISPR (optional, one gene per line)
    #controls: ribosomal_genes.txt

# Configuration of sgRNAs
sgrnas:
    # estimate sgRNA knockout efficiency during EM-procedure of MAGeCK-MLE
    update-efficiency: false
    # trim the 5 prime end to get rid of barcode sequences in the reads
    # if a number (instead of AUTO) is specified, use quotes; for example:
    # trim-5: "0"
    trim-5: "23"
    # specify the length of the sgRNAs (without PAM sequence)
    len: "19"
    # sequencing adapter that shall be removed from reads before processing with MAGeCK (optional)
    #adapter: ACGGCTAGCTGA

# ATTENTION: You should and only should choose one type of input file between "samples" and "counts".
# Configuration of samples (Cannot set "counts" as valid at the same time!)
samples:
    # The following sample information was inferred from the given FASTQ files.
    # Adjust it according to your needs (e.g. providing descriptive sample names and grouping replicates together).
    esc1:
        - reads/ERR376998.subsample.fastq
    plasmid:
        - reads/ERR376999.subsample.fastq
    esc2:
        - reads/ERR377000.subsample.fastq

# Provide your own normalized count matrix (Cannot set "samples" as valid at the same time!).
# Support QC from count matrix
# counts: rawcount/rawcount.txt

# Provide mageck count --day0-label (optional). Multiple labels should be seperated with comma.
#day0label: plasmid

# Provide normalization methods and a list of negative control sgrnas. These parameters will be applied to mageck_count, mageck_rra and mageck_mle modules.
# norm_method: control
# control_sgrna: lib/hg19_library_1.aavs1.txt

# Run mle with multi-thread. Default thread number is 1.
# When this parameter is set (e.g., threads: 4), make sure to specify the cores in running snakemake (snakemake --cores 4)
# threads: 1

# Provide a batch matrix if the samples need to be batch corrected (optional).
# The format should be as follows (tab-separated):
# sample          batch   covariate 1 ...
# Sample 1        1       0           ...
# Sample 2        2       0           ...
# Sample 3        2       1           ...
#
# The first column must refer to the samples defined above.
# batchmatrix: path/to/batchmatrix.txt

# Additional parameters to run MLE or RRA
# additional_mle_rra_parameter: --remove-zero both

# Configuration of experiments.
# An experiment defines the comparison that shall be analyzed with MAGeCK.
# You can define as many experiments as you want.
# You can define either MAGeCK-RRA or MAGeCK-MLE experiments, but cannot define both in a single configuration file.
experiments:
    # provide a descriptive name for your experiment (it will show up in VISPR)
    "esc_mle":
        # This is a MAGeCK-MLE experiment.
        # Here, users can either specify a day0label, or a design matrix file (see http://mageck.sourceforge.net for details).
        # if day0label is specified, use an empty file (like /dev/null) as a design matrix file.
        # Sample names in the design matrix must refer to the samples defined above.
        designmatrix: designmatrix.txt
    #"myexperiment2":
        # This is a MAGeCK-RRA experiment.
        # You must specify treatment and control samples.
        # The sample names must refer to the samples defined above in the
        # samples section.
        #treatment:
        #    - A
        #control:
        #    - B
        #    - C
~

上一篇下一篇

猜你喜欢

热点阅读