Big Data Integration
2016-11-04 本文已影响12人
Shuailong
Presenter: Zou Yanyan
Challenges: 4 V
- Volume
- Velocity
- Variety
- Veracity
- Schema Mapping
- Record Linkage: blocking -> pairwise matching -> clustering
- Data Fusion: voting -> source quality -> copy detection