Visual Tactile Fusion Object Clustering | |
Zhang T(张涛)1,2![]() ![]() ![]() | |
Department | 机器人学研究室 |
Conference Name | Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) |
Conference Date | February 7-12, 2020 |
Conference Place | New York |
Source Publication | Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) |
Publisher | AAAI Press, |
Publication Place | Palo Alto, California USA |
2020 | |
Pages | 10426-10433 |
Contribution Rank | 1 |
ISSN | 2159-5399 |
ISBN | 978-1-57735-835-0 |
Abstract | Object clustering, aiming at grouping similar objects into one cluster with an unsupervised strategy, has been extensivelystudied among various data-driven applications. However, most existing state-of-the-art object clustering methods (e.g., single-view or multi-view clustering methods) only explore visual information, while ignoring one of most important sensing modalities, i.e., tactile information which can help capture different object properties and further boost the performance of object clustering task. To effectively benefit both visual and tactile modalities for object clustering, in this paper, we propose a deep Auto-Encoder-like Non-negative Matrix Factorization framework for visual-tactile fusion clustering. Specifically, deep matrix factorization constrained by an under-complete Auto-Encoder-like architecture is employed to jointly learn hierarchical expression of visual-tactile fusion data, and preserve the local structure of data generating distribution of visual and tactile modalities. Meanwhile, a graph regularizer is introduced to capture the intrinsic relations of data samples within each modality. Furthermore, we propose a modality-level consensus regularizer to effectively align the visual and tactile data in a common subspace in which the gap between visual and tactile data is mitigated. For the model optimization, we present an efficient alternating minimization strategy to solve our proposed model. Finally, we conduct extensive experiments on public datasets to verify the effectiveness of our framework. |
Language | 英语 |
Document Type | 会议论文 |
Identifier | http://ir.sia.cn/handle/173321/27997 |
Collection | 机器人学研究室 |
Corresponding Author | Ding ZM(丁正明) |
Affiliation | 1.State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences 2.University of Chinese Academy of Sciences 3.Xidian University 4.Indiana University-Purdue University Indianapolis, USA |
Recommended Citation GB/T 7714 | Zhang T,Cong Y,Sun G,et al. Visual Tactile Fusion Object Clustering[C]. Palo Alto, California USA:AAAI Press,,2020:10426-10433. |
Files in This Item: | ||||||
File Name/Size | DocType | Version | Access | License | ||
Visual Tactile Fusio(1443KB) | 会议论文 | 开放获取 | CC BY-NC-SA | View Application Full Text |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment