心理想法简友广场

数据挖掘:集成方法Bagging和Boosting

2022-04-12  本文已影响0人  Cache_wood

@[toc]

Rule-Based Classifier

Classify records by using a collection of “if…then…” rules
Rule: (Condition) → y

A rule r covers an instance x if the attributes of the instance satisfy the condition of the rule.

Rule Coverage and Accuracy

Coverage of a rule: Fraction of records that satisfy the antecedent of a rule.

Accuracy of a rule: Fraction of records that satisfy both the antecedent and consequent of a rule.

Characteristics of Rule-Based Classifier

Mutually exclusive rules

Exhaustive rules

Rules are mutually exclusive and exhaustive
Rule set contains as much information as the tree

Effect of Rule Simplification

Rules are no longer mutually exclusive

Rules are no longer exhaustive

Ordered Rule Set

Rules are rank ordered according to their priority

When a test record is presented to the classifier

Rule Ordering Schemes

Rule-based ordering

Class-based ordering

Building Classification Rules

Direct Method

Indirect Method

Direct Method: Sequential Covering

  1. Start from an empty rule
  2. Grow a rule using the Learn-One-Rule function
  3. Remove training records covered by the rule
  4. Repeat Step (2) and (3) until stopping criterion is met

Aspects of Sequential Covering

Rule Growing (Examples)

CN2 Algorithm:

RIPPER Algorithm:

Instance Elimination

Why do we need to eliminate instances?

Why do we remove positive instances?

Why do we remove negative instances?

Stopping Criterion and Rule Pruning

Stopping criterion

Rule Pruning

Summary of Direct Method

Indirect Methods C4.5 rules

Instead of ordering the rules, order subsets of rules (class ordering)

Advantages of Rule-Based Classifiers

上一篇 下一篇

猜你喜欢

热点阅读