《机器学习》- chp2.

2013-06-29 本文已影响0人九叔

摘记：

概念学习 是指从有关某个布尔函数的输入输出训练样例中推断出该布尔函数；
从特殊的训练样例中归纳出 一般函数 是机器学习的中心问题；
机器学习的目标是 寻找一个假设 h ，使对于 X 中的所有 x，h(x) = c(x) ；
如果没有更多信息，我们只能假定，对于未知实例 最好的假设 就是与训练数据 最佳拟合 的假设；
Hypothesis h is a conjunction of constraints on attributes: 以 实例的各属性约束的组合 来表示假设

如果某些实例x满足假设h的所有约束，那么h将x分类为正例(h(x)=1)
实例集第i个属性的可取值个数为 $n_{i}$ , 该属性可取值为 $\{x_{i1}, x_{i2}, ..., x_{in_{i}}\}$ ，分类标注（类别）数为 $n_{c}$ ，则：

实例总数（distinct instances）： $\prod n_{i}$
;

概念总数（distinct concepts）： $2^{\prod n_{i}}$ ;
语法不同的假设（syntactically distinct hypotheses）：<img src=http://latex.codecogs.com/gif.latex?\prod%20(n_{i}+2) />;

语义不同的假设（semantically distinct hypotheses）：<img src=http://latex.codecogs.com/gif.latex?1+\prod%20(n_{i}+1) />;

其中，属性 $x_{i}$ 的约束的全部组合为 $\left \langle ?, \varnothing, x_{i1}, x_{i2}, ..., x_{in_{i}} \right \rangle$ ，共 $n_{i}+2$ ，因而语法不同的假设共<img src=http://latex.codecogs.com/gif.latex?\prod%20(n_{i}+2) />，而实际上，只要假设中存在一个∅，则该实例一定是反例，因而去除掉含有∅的重复语法假设，剩余语义不同的假设为<img src=http://latex.codecogs.com/gif.latex?\prod%20(n_{i}+1)+1) />。

假设的一般到特殊序：

《机器学习》- chp2.

摘记：

猜你喜欢

热点阅读