基因组生信工具

RepeatMasker

2016-04-23  本文已影响902人  6有才

What

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library derived from Repbase sequences ) and Repbase, a service of the Genetic Information Research Institute.

在线服务

本地安装RepeatMasker

本地安装RepeatMasker,除了需要RepeatMasker主程序外,还需要TRF(Tandem Repeats Finder)、序列搜索引擎(以RMBlast为例)以及Repbase数据库。

wget http://tandem.bu.edu/trf/downloads/trf407b.linux
sudo mv trf407b.linux /usr/local/bin/trf # 记住这个地址1
sudo /usr/local/bin/trf
wget ftp://ftp.ncbi.nlm.nih.gov/blast/executables/rmblast/2.2.28/ncbi-rmblastn-2.2.28-src.tar.gz
tar -zvcf ncbi-rmblastn-2.2.28-src.tar.gz
cd ncbi-rmblastn-2.2.28-src/c++
./configure --with-mt --prefix=/usr/local/rmblast --without-debug
make
sudo make install
# 记住安装RMBlast的地址2, */ncbi-rmblastn-2.2.28-src/c++/GCC480-ReleaseMT64/bin
wget http://www.repeatmasker.org/RepeatMasker-open-4-0-6.tar.gz
cd RepeatMasker
perl configure
<PRESS ENTER TO CONTINUE> # 回车继续
Enter path [ ]: # 输入perl程序路径
Enter path [ ]: # 输入RepeatMasker要安装的路径
Enter path [ ]: # 输入TRF路径(地址1)

Add a Search Engine: # 选择一个搜索引擎(需要事先安装好),并输入引擎路径(地址2)
1. CrossMatch: [ Un-configured ]
2. RMBlast - NCBI Blast with RepeatMasker extensions: [ Un-configured ]
3. WUBlast/ABBlast (required by DupMasker): [ Un-configured ]
4. HMMER3.1 & DFAM: [ Un-configured ]
5. Done
Do you want RMBlast to be your default # 设置默认搜索引擎
search engine for Repeatmasker? (Y/N)  [ Y ]: 
# 可以安装多个引擎,完成后按5
Congratulations!  RepeatMasker is now ready to use. # 提示已经安装完成
# RepeatMasker已经安装完成,下一步将之前下载解压的Repbase文件COPY到RepeatMasker安装路径下的Libraries文件夹中即可
RepeatMasker -species human test.fa

更多详细内容请期待后续更新!

上一篇下一篇

猜你喜欢

热点阅读