生物信息学生信工具走进转录组

misa和primer3结合快速设计SSR引物

2018-06-08  本文已影响84人  灵动的小猪

文件下载

下载misa,同时将misa.ini放在misa的同一个文件夹下,然后下载三个perl脚本get_set_trimmer.plp3_in.plp3_out.pl,建议下载到同一个文件夹下。

介绍

misa.ini :
p3_in.pl :输入 misa.pl 的输出结果(file.fasta.misa),将引物设计的参数文件(模板,产物长度,目标区域等)导入到一个以“p3in”为后缀的文件中。
Get_est_trimmer.pl,针对EST序列,可以除去EST序列中短的序列和两端不明确的碱基。
p3_out.pl,对primer3产生的文件进行提取合,得到最后的结果文件 filename.result。

########################

perl misa.pl  Zea_mays.AGPv4.dna.chromosome.1.fa

生成的文件说明

Zea_mays.AGPv4.dna.chromosome.1.fa.misa:以表格的形式列出微卫星的类型和位点;
Zea_mays.AGPv4.dna.chromosome.1.fa.statistics:统计微卫星的类型和频数。

因为如果直接使用p3_in.pl进行转换生成的文件会比较大,所以下面多了几步#提取misa文件中的染色体编号和开始,结束的位置,两边各延伸150bp,生成一个bed文件。

cat Zea_mays.AGPv4.dna.chromosome.1.fa.misa |tr "_" "\t" |awk 'NR>1 {print $1"\t"$9-150"\t"\10+150}' Zea_mays.AGPv4.dna.chromosome.1_ssr.bed  #使用bedtools工具提取重复序列
bedtools getfasta -name+ -fi Zea_mays.AGPv4.dna.chromosome.1.fa -bed Zea_mays.AGPv4.dna.chromosome.1_ssr.bed >Zea_mays.AGPv4.dna.chromosome.1_ssr.fa

再进行一次misa查找一次

perl misa.pl Zea_mays.AGPv4.dna.chromosome.1_ssr.fa
print OUT "PRIMER_SEQUENCE_ID=$id"."_$ssr_nr\nSEQUENCE=$seq\n";

改为

print OUT "SEQUENCE_ID=$id"."_$ssr_nr\nSEQUENCE_TEMPLATE=$seq\n";

调用p3_in.pl

perl p3_in.pl Zea_mays.AGPv4.dna.chromosome.1_ssr.fa #然后使用primer3进行设计引物
~/software/primer3-2.4.0/src/primer3_core -default_version=1 - output=Zea_mays.AGPv4.dna.chromosome.1_ssr.fa.p3put Zea_mays.AGPv4.dna.chromosome.1_ssr.fa.p3in
print OUT "ID\tSSR nr.\tSSR type\tSSR\tsize\tstart\tend\t";
print OUT "FORWARD PRIMER1 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER1 (5'-3')\tTm(癈)\tsize\tPRODUCT1 size (bp)\tstart (bp)\tend (bp)\t";
print OUT "FORWARD PRIMER2 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER2 (5'-3')\tTm(癈)\tsize\tPRODUCT2 size (bp)\tstart (bp)\tend (bp)\t"; 
print OUT "FORWARD PRIMER3 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER3 (5'-3')\tTm(癈)\tsize\tPRODUCT3 size (bp)\tstart (bp)\tend (bp)\n";

改为

print OUT "ID\tSSR nr.\tSSR type\tSSR\tsize\tstart\tend\t";
print OUT "FORWARD PRIMER0 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER0 (5'-3')\tTm(癈)\tsize\tPRODUCT0 size (bp)\tstart (bp)\tend (bp)\t";
print OUT "FORWARD PRIMER1 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER1 (5'-3')\tTm(癈)\tsize\tPRODUCT1 size (bp)\tstart (bp)\tend (bp)\t";
print OUT "FORWARD PRIMER2 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER2 (5'-3')\tTm(癈)\tsize\tPRODUCT2 size (bp)\tstart (bp)\tend (bp)\t";
print OUT "FORWARD PRIMER3 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER3 (5'-3')\tTm(癈)\tsize\tPRODUCT3 size (bp)\tstart (bp)\tend (bp)\t";
print OUT "FORWARD PRIMER4 (5'-3')\tTm(癈)\tsize\tREVERSE PRIMER4 (5'-3')\tTm(癈)\tsize\tPRODUCT4 size (bp)\tstart (bp)\tend (bp)\t";
/PRIMER_LEFT_SEQUENCE=(.*)/ || do {$count_failed++;print OUT "$misa\n"; next};  my $info = "$1\t";    
/PRIMER_LEFT_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT=\d+,(\d+)/; $info .= "$1\t";   
/PRIMER_RIGHT_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT=(\d+),\d+/; $info .= "$1\t"; 
/PRIMER_RIGHT=(\d+),\d+/; $info .= "$1\t";      
/PRIMER_LEFT_1_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1=\d+,(\d+)/; $info .= "$1\t";      
/PRIMER_RIGHT_1_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_1_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_1=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE_1=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_1=(\d+),\d+/; $info .= "$1\t";      
/PRIMER_LEFT_2_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2=\d+,(\d+)/; $info .= "$1\t";      
/PRIMER_RIGHT_2_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_2_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_2=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE_2=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_2=(\d+),\d+/; $info .= "$1";

改为

/PRIMER_LEFT_0_SEQUENCE=(.*)/ || do {$count_failed++;print OUT "$misa\n"; next};  my $info = "$1\t";    
/PRIMER_LEFT_0_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_0=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_RIGHT_0_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_0_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_0=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE_0=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_0=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_0=(\d+),\d+/; $info .= "$1\t";      
/PRIMER_LEFT_1_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1=\d+,(\d+)/; $info .= "$1\t";      
/PRIMER_RIGHT_1_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_1_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_1=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE_1=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_1=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_1=(\d+),\d+/; $info .= "$1\t";      
/PRIMER_LEFT_2_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2=\d+,(\d+)/; $info .= "$1\t";      
/PRIMER_RIGHT_2_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_2_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_2=\d+,(\d+)/; $info .= "$1\t";    
/PRIMER_PRODUCT_SIZE_2=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_2=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_2=(\d+),\d+/; $info .= "$1";    
/PRIMER_LEFT_3_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_3_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_3=\d+,(\d+)/; $info .= "$1\t";  
/PRIMER_RIGHT_3_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_3_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_3=\d+,(\d+)/; $info .= "$1\t";  
/PRIMER_PRODUCT_SIZE_3=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_3=(\d+),\d+/; $info .= "$1\t";  
/PRIMER_RIGHT_3=(\d+),\d+/; $info .= "$1";  
/PRIMER_LEFT_4_SEQUENCE=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_4_TM=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_4=\d+,(\d+)/; $info .= "$1\t";  
/PRIMER_RIGHT_4_SEQUENCE=(.*)/;  $info .= "$1\t";  
/PRIMER_RIGHT_4_TM=(.*)/; $info .= "$1\t";  
/PRIMER_RIGHT_4=\d+,(\d+)/; $info .= "$1\t";  
/PRIMER_PRODUCT_SIZE_4=(.*)/; $info .= "$1\t";  
/PRIMER_LEFT_4=(\d+),\d+/; $info .= "$1\t";`  
/PRIMER_RIGHT_4=(\d+),\d+/; $info .= "$1";
perl p3_out.pl Zea_mays.AGPv4.dna.chromosome.1_ssr.fa.p3out Zea_mays.AGPv4.dna.chromosome.1_ssr.fa.misa
上一篇 下一篇

猜你喜欢

热点阅读