复杂场景下视觉先验信息的地图恢复SLAM
Visual prior-information-based map recovery SLAM in complex scenes
- 2020年25卷第1期 页码:158-170
收稿:2019-02-16,
修回:2019-6-29,
录用:2019-7-6,
纸质出版:2020-01-16
DOI: 10.11834/jig.190041
移动端阅览

浏览全部资源
扫码关注微信
收稿:2019-02-16,
修回:2019-6-29,
录用:2019-7-6,
纸质出版:2020-01-16
移动端阅览
目的
2
目前已有的单目视觉SLAM(simultaneous localization and mapping)系统每次开始运行时都将初始帧而不是绝对位置设置为参考帧,不能在一个固定的坐标系中获得位姿,导致无法重用已有的建图信息,而且在复杂场景中相机容易跟踪失败,需要当前帧与已有的关键帧非常相似时才能重定位并继续建图。针对这个问题,提出一种具有重新初始化、地图重用与地图恢复能力的视觉SLAM系统。
方法
2
首先,加载先验地图,通过ORB(oriented brief)特征匹配SLAM系统当前帧与先验地图关键帧,并结合重定位方法完成SLAM系统的初始化。接着,为了避免丢失地图,建立一种应对SLAM系统跟踪失败的地图保存机制,保存跟踪成功地图,并提出一种自适应快速重新初始化算法,引入灭点检测,自动选择最佳重新初始化策略,保证SLAM系统继续跟踪与建图,建立的地图称为恢复地图。最后,对于跟踪成功地图与恢复地图,采用改进的回环方法获得它们之间的转换关系,并提出一种地图恢复法,减少跟踪成功地图与恢复地图尺度不一带来的误差,确保得到的全局一致地图更加准确。
结果
2
在经过加噪处理的KITTI数据集上进行地图恢复融合的测试,实验结果表明,在KITTI00、KITTI02、KITTI05数据集下,本文提出的SLAM系统比ORB-SLAM2系统分别可以多获得39.25%、47.75%、32.46%的地图信息。在EuRoC数据集上的运行结果表明,本文提出的单目视觉SLAM系统不仅在建图精度方面与ORB-SLAM2效果相当,还在跟踪稳定性方面有显著提升。
结论
2
本文提出的SLAM系统可以在跟踪失败的情况下有效恢复地图;此外,还可以高效重用SLAM系统已有的建图结果,固定SLAM地图坐标系,提升SLAM系统运行稳定性。
Objective
2
Simultaneous localization and mapping (SLAM) has been an important research topic in the last two decades in the computer vision and robotics communities. SLAM aims to build a map of an unknown environment and localize the sensor in the map with real-time operation. Among the existing research results
many state-of-the-art monocular visual SLAM schemes are used. In most existing traditional approaches
the system would set the initial frame instead of the absolute position as the reference frame when it begins running
and it could not acquire the pose in a fixed coordinate system
resulting in the failure of re-using the existing mapping information. Additionally
the present monocular visual SLAM schemes are prone to tracking failure in case of complicated scenes such as cluttered background
motion blur
and defocus
and the existing schemes process it by place recognition
which is a key module of a SLAM system to close loops and relocalize the camera. However
the main drawback of the solution is that it requires the current frame to be highly similar to the existing key frame to ensure the success of relocalization
generally causing inconvenience in certain real application scenarios. For example
an advanced mobile robot is required to return to the place where the location information is lost after tracking failure so that the system could continue tracking and mapping. However
the map information could not be recovered from the failure of system tracking to the success of relocalization
thereby resulting in considerable map information loss. Therefore
to address the limitations
such as loss of map information and requirement for high similarity between a current frame and an existing frame while relocalizing
we propose a prior-information-based visual SLAM system with the capacity of re-initialization
map re-using
and map recovery in complex scenes.
Method
2
In this study
the first step is loading the prior map
matching the current frame with the key frame of the prior map in the SLAM system by applying ORB features to obtain matched frames
and finishing the initialization of SLAM system in combination of relocalization
which is conducive to ensuring the consistency of the SLAM coordinate system. Second
a map-saving mechanism is built to save the map of successful tracking before tracing failure occurs to avoid losing the map information. The traditional SLAM always conducts relocalization after tracking failure
and the map could not be generated before the relocalization succeeds accordingly
further resulting in the loss of map information. Considering the aforementioned issues
we address tracking failure by re-initializing rather than relocalizing to establish a new map called the recovery map. To improve the probability of successful initialization and the ability to recover map information
we investigate a self-adaptation fast re-initialization algorithm with the introduction of vanishing point detection. Initially
the scene vanishing points are detected by M-estimator sample consensus (MSAC) method. If vanishing points are present
the fast initialization method would be used. Otherwise
a simple initialization method would be applied; in other words
the proposed algorithm reduces the initialization requirements of the traditional SLAM system. The optimal re-initialization strategy could be selected automatically to ensure that the SLAM system continues tracking and mapping. Finally
for the successful tracking map and the recovery map
the improved loop method is used to obtain the transformation relationship between them. Once the overlapping maps are detected
the relative position and pose relationships between overlapping maps is calculated. Furthermore
the recovery map is transformed into a coordinate system that is consistent with the coordinate system of the matched map in memory. Second
scale information is used to fuse overlapping map points
and a map recovery method is proposed to reduce the errors caused by different scales between the successful tracking map and the recovered map
thereby solving the discrepancy between two maps caused by different scales to obtain accurate global consistency.
Result
2
We have compared the proposed system with the state-of-the-art system ORB-SLAM2 on two public datasets
namely
KITTI and EuroC
and a dataset is recorded in complex scenarios. The evaluation metrics contain the number of key frames
integrity rate of key frames
and recovery rate of key frames; moreover
we provide several point cloud maps and trajectory maps of the two systems for comparison. The experimental results show that the proposed system not only performs comparably in accuracy with ORB-SLAM2
but also significantly outperforms state-of-the-art existing methods in various real-world settings in tracking and mapping robustness. The comparative experiment consists of three parts
and the final experiment results are used to verify the effectiveness of the proposed system. In the first part
the KITTI dataset is added with noise
causing the traditional SLAM system to fail in tracking. The map restoration ability of the proposed system on KITTI00
KITTI02
and KITTI05 increased by 39.25%
47.75%
and 32.46%
respectively. The experimental results show that the advantages of the proposed system are used to address tracking failure effectively and build a complete map
and the proposed systems are better than the state-of-the-art system ORB-SLAM2. In the second part
compared with the results in EuRoC
the proposed system has the same mapping accuracy on V1_01_easy dataset and V1_02_medium dataset as that of ORB-SLAM2. ORB-SLAM2 could not run steadily on the V1_03_difficult dataset
whereas the proposed system could
thereby also indicating that tracking stability has been significantly improved. The third part is the test in the real application scenario. The running dataset is obtained during walking more than one closed track in the underground garage with a camera. The underground garage has a small unlit room. The experimental results show that the proposed system still performs well in real application scenarios with environmental and motion complexity
and they are more complete than the map obtained by ORB-SLAM2.
Conclusion
2
Experimental results indicate that the proposed system can effectively recover maps in case of tracking failure. Furthermore
the proposed system can efficiently reuse the existing mapping results of the SLAM system
fix the map coordinate system
and significantly improve the robustness of the system.
Burri M, Nikolic J, Gohl P, Schneider T, Rehder J, Omari S, Achtelik M W and Siegwart R. 2016. The EuRoC micro aerial vehicle datasets. International Journal of Robotics Research, 35(10):1157-1163[DOI:10.1177/0278364915620033]
Engel J, Koltun V and Cremers D. 2018. Direct sparse odometry. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(3):611-625[DOI:10.1109/TPAMI.2017.2658577]
Forster C, Pizzoli M and Scaramuzza D. 2014. SVO: fast semi-direct monocular visual odometry//Proceedings of 2014 IEEE International Conference on Robotics and Automation (ICRA). Hong Kong, China: IEEE: 15-22[ DOI: 10.1109/ICRA.2014.6906584 http://dx.doi.org/10.1109/ICRA.2014.6906584 ]
Galvez-López D and Tardos J D. 2012. Bags of binary words for fast place recognition in image sequences. IEEE Transactions on Robotics, 28(5):1188-1197[DOI:10.1109/TRO.2012.2197158]
Geiger A, Lenz P, Stiller C and Urtasun R. 2013. Vision meets robotics:the KITTI dataset. The International Journal of Robotics Research, 32(11):1231-1237[DOI:10.1177/0278364913491297]
Horn B K P. 1987. Closed-form solution of absolute orientation using unit quaternions. Journal of the Optical Society of America A, 4(4):629-642[DOI:10.1364/JOSAA.4.000629]
Klein G and Murray D. 2007. Parallel tracking and mapping for small AR workspaces//Proceedings of the 6th IEEE and ACM International Symposium on Mixed and Augmented Reality. Nara, Japan: IEEE: 225-234[ DOI: 10.1109/ISMAR.2007.4538852 http://dx.doi.org/10.1109/ISMAR.2007.4538852 ]
Kasyanov A, Engelmann F, Stückler J and Leibe B. 2017. Keyframe-based visual-inertial online SLAM with relocalization//Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. Vancouver, BC, Canada: IEEE: 6662-6669[ DOI: 10.1109/IROS.2017.8206581 http://dx.doi.org/10.1109/IROS.2017.8206581 ]
Moreno-Noguer F, Lepetit V and Fua P. 2007. Accurate non-iterative o ( n ) solution to the P n P problem//Proceedings of the 11th IEEE International Conference on Computer Vision. Rio de Janeiro, Brazil: IEEE: 1-8[ DOI: 10.1109/ICCV.2007.4409116 http://dx.doi.org/10.1109/ICCV.2007.4409116 ].
Mur-Artal R and Tardós J D. 2017. ORB-SLAM2:an open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics, 33(5):1255-1262[DOI:10.1109/TRO.2017.2705103]
Nieto M and Salgado L. 2010. Non-linear optimization for robust estimation of vanishing points//Proceedings of 2010 IEEE International Conference on Image Processing. Hong Kong, China: IEEE: 1885-1888[ DOI: 10.1109/ICIP.2010.5652381 http://dx.doi.org/10.1109/ICIP.2010.5652381 ]
Rublee E, Rabaud V, Konolige K and Bradski G. 2011. ORB: an efficient alternative to SIFT or SURF//Proceedings of 2011 International Conference on Computer Vision. Barcelona, Spain: IEEE: 2564-2571[ DOI: 10.1109/ICCV.2011.6126544 http://dx.doi.org/10.1109/ICCV.2011.6126544 ]
Strasdat H. 2012. Local Accuracy and Global Consistency for Efficient SLAM. London:Imperial College
Trinh H H and Jo K H. 2006. Image-based structural analysis of building using line segments and their geometrical vanishing points//Proceedings of 2006 SICE-ICASE International Joint Conference. Busan, South Korea: IEEE: 566-571[ DOI: 10.1109/SICE.2006.315548 http://dx.doi.org/10.1109/SICE.2006.315548 ]
Zhang S B, Liu S, Zhang J H, Wang Z H and Wang X Y. 2017. Fast initialization for feature-based monocular slam//Proceedings of 2017 IEEE International Conference on Image Processing. Beijing, China: IEEE: 2119-2123[ DOI: 10.1109/ICIP.2017.8296656 http://dx.doi.org/10.1109/ICIP.2017.8296656 ]
Zhang J H, Wang Y Y, Wang Z Y, Chen S Y and Guan Q. 2018. Recovery of lost map for monocular simultaneous localization and mapping. Journal of Image and Graphics, 23(3):372-383
张剑华, 王燕燕, 王曾媛, 陈胜勇, 管秋. 2018.单目同时定位与建图中的地图恢复融合技术.中国图象图形学报, 23(3):372-383)[DOI:10.11834/jig.170505]
相关文章
相关作者
相关机构
京公网安备11010802024621