Current Issue Cover
结合密集连接的轻量级高分辨率人体姿态估计

高坤, 李汪根, 束阳, 葛英奎, 王志格(安徽师范大学计算机与信息学院)

摘 要
为了更好地实现轻量化的人体姿态估计,在轻量级模型极为有限的资源下实现更高的检测性能,基于高分辨率网络(HRNet)提出了结合密集连接网络的轻量级高分辨率人体姿态估计网络(LDHNet)。通过重新设计HRNet中的阶段分支结构以及提出新的轻量级特征提取模块,构建了轻量高效的特征提取单元,与此同时对多分支之间特征融合部分进行了轻量化改进,进一步的降低模型的复杂度,最终大幅降低了模型的参数量与计算量,实现了轻量化的设计目标,并且保证了模型的性能。通过实验表明,在MPII测试集上相比于最轻量的自顶向下的轻量级人体姿态估计模型LiteHRNet,LDHNet仅通过增加了少量参数量与计算量,平均预测准确度即提升了1.5个百分点,与LiteHRNet的改进型DiteHRNet相比也提升了0.9个百分点,在COCO验证集上的结果表明,与LiteHRNet相比,LDHNet的平均检测准确度提升了3.4个百分点,与DiteHRNet相比也提升了2.3个百分点,与融合Transformer的HRFormer相比,LDHNet在参数量更低并且计算量相当的条件下有着近似的检测性能,在面对实际场景时LDHNet也有着稳定的表现,在同样的环境下LDHNet的推理速度要高于基线HRNet以及LiteHRNet等,以上实验结果表明该模型有效实现了轻量化并保证了预测性能。
关键词
Lightweight high-resolution human pose estimation combined with densely connected networks

gaokun, liwanggen, shuyang, geyingkui, wangzhige(School of Computer and Information,Anhui Normal University)

Abstract
Human pose estimation is a technology that can be widely used in life. In recent years, many excellent high-precision methods have been proposed, but they are often accompanied by very large model scale, which will encounter the problem of computing power bottleneck in application. Whether for model training or deployment, large models require a lot of computing power as the basis. Most of them have low computing power. Similarly, for the scenes in daily life, the equipment needs more applicability and detection speed of the model, which is difficult to achieve by large models. Because of such requirements, lightweight human pose estimation has become a hot research field. The main problem is how to achieve higher detection accuracy and faster detection speed under the extremely limited number of resources. Lightweight models will inevitably fall into a disadvantage in detection accuracy compared with large models, but fortunately, from many studies in recent years, the lightweight model can also achieve higher detection accuracy. A good balance can be reached between them.
Keywords

订阅号|日报