论文阅读“Multi-view clustering via a

2022-04-28  本文已影响0人  掉了西红柿皮_Kee

Li Y, Liao H. Multi-view clustering via adversarial view embedding and adaptive view fusion[J]. Applied Intelligence, 2021, 51(3): 1201-1212.

摘要逻辑简记

当前任务背景介绍:

Multi-view clustering, which explores complementarity and consistency among multiple distinct feature sets to boost clustering performance, is becoming more and more useful in many real-world applications.

首先介绍了传统多视图方法的步骤:

Traditional approaches usually map multiple views to a unified embedding, in which some weighted mechanisms are utilized to measure the importance of each view. The embedding, serving as a clustering-friendly representation, is then sent to extra clustering algorithms.

接着指出其缺陷:

(1)However, a unified embedding cannot cover both complementarity and consistency among views and the weighted scheme measuring the importance of each view as a whole ignores the differences of features in each view. (忽略了各视图的特有信息)
(2) Moreover, because of lacking in proper grouping structure constraint imposed on the unified embedding, it will lead to just multi-view representation learned, which is not clustering friendly. (学到的信息并不是聚类友好的)

提出本文的方法:

In this paper, we propose a novel multi-view clustering method to alleviate the above problems.

阐述该方法的具体做法:

By dividing the embedding of a view into unified and view-specific vectors explicitly, complementarity and consistency can be reflected.
Besides, an adversarial learning process is developed to force the above embeddings to be non-trivial.
Then a fusion strategy is automatically learned, which will adaptively adjust weights for all the features in each view.
Finally, a Kullback-Liebler (KL) divergence based objective is developed to constrain the fused embedding for clustering friendly representation learning and to conduct clustering.

关键词

多视图聚类
对抗视图嵌入
自适应视图融合
聚类友好的表示学习

模型浅析

该模型分为5个部分:
view-specific encoder networks,
view reconstruction decoder networks,
view classification network,
adaptive multi-view fusion network,
a KL divergence based clustering modular

给定多视图数据X=\{X^v\},v=1,\cdots,l,并使用X^v=\{x_k^v\}_{k=1}^n \in R^{d_v \times n}表示数据中的第v个视图。


整体上来说,主要的创新包括两点:首先是使用一个编码器生成两个嵌入向量,区分了view-common和view-specific两种表示,其次此基础上引入了一个视图分类器,形成了对抗的视图学习过程。思想上的创新大于技术上的创新。所谓自适应的权重向量学习,讲了一大堆,最后融合成了一个。且e的主体部分还是基于view-specific的拼接。从这个角度来讲,我觉得创新不如将其迁移到VAE极其变种模型中,使得\mu充当view-specific,而大局的log~\sigma作为数据分布学习view-common特征。这样的话,可以更加适合对抗生成过程的解释和迁移。以上为拙见。

上一篇 下一篇

猜你喜欢

热点阅读