pandas.dataframe/ xgb.DMatrix设置h
2018-04-24 本文已影响878人
Ten_Minutes
1.dataframe经常由csv得到,可以在用pd.read_csv()时通过参数names设置df的名称,比如:Cov = pd.read_csv("path/to/file.txt", sep='\t' ,names = ["Sequence", "Start", "End", "Coverage"])
2.也可以把header=None,然后再设置columns属性,比如:
Cov = pd.read_csv("path/to/file.txt", sep='\t', header=None)
Cov.columns = ["Sequence", "Start", "End", "Coverage"]
3.pandas.DataFrame(data=None, index=None, columns=None, dtype=None, copy=False)
4.xgb.Dmatrix()也经常需要设置header,特别是在调用predict方法之前,如果test集里没有header,则会报错,此时可以通过xgb.Dmatrix()参数中的feature_names属性设置header,比如:
columns=['fast_text_socre','testsim_score1','testsim_score2','testsim_score3','tradeid_minus'] test_X_new = xgb.DMatrix(test_X.values,feature_names=columns)