2021-04-27 Stop Using All Your F

2021-04-27  本文已影响0人  春生阁

A real-world dataset contains a lot of relevant and redundant features. It is true that the presence of more data in terms of the number of instances or rows leads to training a better machine learning model.

Before proceeding, we must know why it’s not recommended to use all sets of features. To train a robust machine learning model, the data must be free of redundant features. There are various reasons why feature selection is important:

So it’s essential to remove the irrelevant features from the dataset. A data scientist should be selective in terms of features he/she is using for model training. Selecting all combinations of features, then picking the best set of features is a polynomial solution. There are various techniques to select the best set of features, you need to know some feature selection techniques.

上一篇 下一篇

猜你喜欢

热点阅读