Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

Gu, Zejun; Zhao, Zhong-Qiu; Ding, Henghui; Shen, Hao; Zhang, Zhao; Huang, De-Shuang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.11448 (cs)

[Submitted on 19 May 2024]

Title:Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

Authors:Zejun Gu, Zhong-Qiu Zhao, Henghui Ding, Hao Shen, Zhao Zhang, De-Shuang Huang

View PDF HTML (experimental)

Abstract:In practical applications of human pose estimation, low-resolution inputs frequently occur, and existing state-of-the-art models perform poorly with low-resolution images. This work focuses on boosting the performance of low-resolution models by distilling knowledge from a high-resolution model. However, we face the challenge of feature size mismatch and class number mismatch when applying knowledge distillation to networks with different input resolutions. To address this issue, we propose a novel cross-domain knowledge distillation (CDKD) framework. In this framework, we construct a scale-adaptive projector ensemble (SAPE) module to spatially align feature maps between models of varying input resolutions. It adopts a projector ensemble to map low-resolution features into multiple common spaces and adaptively merges them based on multi-scale information to match high-resolution features. Additionally, we construct a cross-class alignment (CCA) module to solve the problem of the mismatch of class numbers. By combining an easy-to-hard training (ETHT) strategy, the CCA module further enhances the distillation performance. The effectiveness and efficiency of our approach are demonstrated by extensive experiments on two common benchmark datasets: MPII and COCO. The code is made available in supplementary material.

Comments:	11 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.11448 [cs.CV]
	(or arXiv:2405.11448v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.11448

Submission history

From: Zejun Gu [view email]
[v1] Sun, 19 May 2024 04:57:17 UTC (1,166 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators