TwoSampleMR 遇到的坑

2020-06-09 本文已影响0人 MaiyuAmmy

第一次尝试医学统计常用的MR，我选择了比较多用的R包，TwoSampleMR, 这个包还有一个webApp -- MRbase.所有流程基本和网上可以搜到的一样，这里推荐这个网址：https://www.jianshu.com/p/253309a571aa。

我基本每一步的坑都遇到了，很感谢作者还有另外微信公众号的作者-陈澜澜一直帮我处理报错。记录一下，以防还有小伙伴遇到。

1.读取包中文件报错：

Error in data.table::fread(filename, header = TRUE, sep = sep) :

Input is empty or only contains BOM or terminal control characters

The exact path to the file will be different on everyone's computer, but it can be located like this:

```{r eval=TRUE}

bmi_file<-system.file("extdata","bmi.txt",package="TwoSampleMR")

```

You can read the data in like this:

```{r eval=TRUE}

bmi_exp_dat<-read_exposure_data(bmi_file)

head(bmi_exp_dat)

```

解决方法：推断exposure中的SNP 没有做clump处理，导致全部读入数据过大。

解决方法：

[if !supportLists]（1） [endif]分染色体

[if !supportLists]（2） [endif]或者使用ieugwasr包的ld_clump_local()这个函数

[if !supportLists]（3） [endif]用ieugwasr的那个函数需要你在电脑上下载好reference panal和plink软件的

解决方法：Google账号登录，国内可能被墙了。

解决方法：需要下载对应的包，plink bin和参考基因组版本号。

命令行

c<-ld_clump_local(b,clump_kb=10000, clump_r2=0.001, clump_p=1,bfile="/Projects/reference_genome/data_maf0.01_rs_ref.tgz",plink_bin="software/plink")

解决方法：注意这个文件b文件应该只包含两列，并且列名分别是rsid和pval，ld_clump_local和clump_data不兼容

这个专题我没有继续更新，因为我看目前网上的资料已经很丰富了。这里推荐两个公众号

--bio生物信息

--生信与临床

其中关于MR部分很详细，还有GWAS文件中beta，zscore等换算。

顺利的话只有三行，不顺利的话，每行都有错。一路踩坑，才爬过来。希望大家可以借鉴。