散文简友广场想法

数据挖掘:基于实例的分类器-最近邻KNN

2022-04-13  本文已影响0人  Cache_wood

@[toc]

Instance Based Classifiers

Examples:

Nearest Neighbor Classifiers

Requires three things

To classify an unknown record

Definition of Nearest Neighbor

K-nearest neighbors of a record x are data points that have the k smallest distance to x.

Nearest Neighbor Classification

Compute distance between two points:

Euclidean distance :d(p,q) = \sqrt{\sum_i (p_i-q_i)^2}

Determine the class from nearest neighbor list

Choosing the value of k:

Scaling issues

Problem with Euclidean measure:

k-NN classifiers are lazy learners

Example: PEBLS

PEBLS: Parallel Examplar-Based Learning System(Cost & Salzberg)

Distance between record X and record Y:
\Delta (X,Y) = w_Xw_Y\sum_{i=1}^d d(X_i,Y_i)^2
where: w_X = Number of times X is used for prediction/(Number of times X predicts correctly)

w_X if X makes accurate prediction most of the time.

w_X >1 if X is not reliable for making predicti

上一篇 下一篇

猜你喜欢

热点阅读