Underwater Target Recognition and Path Planning Using Otsu-KMeans Segmentation and EM-PF-SLAM with Enhanced A-JPS Algorithm
Abstract
Autonomous Underwater Vehicles (AUVs) face significant challenges in obstacle recognition and path planning due to sonar image noise and uncertain underwater environments. This study proposes a robust target recognition and obstacle avoidance framework integrating forward-looking sonar image processing, probabilistic SLAM, and a fast global path planner. Sonar images are first segmented using Otsu thresholding and refined through K-means clustering to extract obstacle features. These features are then fused with odometry and inertial data using an Expectation-Maximization (EM) based data association module and Particle Filter (PF) for posterior state estimation, enabling accurate SLAM under sonar uncertainty. For navigation, an improved A*-JPS algorithm is applied to achieve time-efficient path planning. Experiments were conducted on the publicly available SeaNet Dataset, which contains diverse acoustic scenes from Northwest Pacific environments. Tests were run on a platform with an Intel i5-10210U CPU and 16 GB RAM, using evaluation metrics including recognition accuraacy, RMSE, absolute trajectory error (ATE), loop closure recall, and path smoothness. Results show that the proposed Otsu-K-means sonar segmentation achieves 99.3% recognition accuracy, outperforming standalone methods by over 25%. The EM-PF-SLAM system achieves an ATE of 0.42 m and a loop closure recall of 95.3%, reducing localization uncertainty by over 40% compared to PF-only baselines. The hybrid A*-JPS planner reduces path planning time to 0.12 s and achieves a smoothness score of 0.85. These findings highlight the method’s suitability for real-time, high-precision AUV operations in complex acoustic environments.References
Sun Q , Wang K .Underwater single-channel acoustic signal multitarget recognition using convolutional neural networks[J].The Journal of the Acoustical Society of America, 2022, 151(3):2245-2254.
Feng S , Ma S , Zhu X ,et al.Artificial Intelligence-Based Underwater Acoustic Target Recognition: A Survey[J].Remote Sensing, 2024, 16(17):144-152.
Yang J, Yan S, Zeng D,et al.Self-supervised learning minimax entropy domain adaptation for the underwater target recognition[J].Applied acoustics, 2024, 216(1):109725.1-109725.10.
Zhou X, Yang K, Yan Y,et al.Underwater Noise Target Recognition Based on Sparse Adversarial Co-Training Model with Vertical Line Array[J].Journal of Ocean University of China, 2023, 22(5):1201-1215.
Shen Q, Jia J, Cai L .Underwater Incomplete Target Recognition Network via Generating Feature Module[J].International Journal of Distributed Sensor Networks, 2023, 22(32):41-54.
Guan Z, Hou C, Zhou S,et al.Research on Underwater Target Recognition Technology Based on Neural Network[J].Wireless Communications & Mobile Computing, 2022, 14(8):211-223.
Yang S, Jin A, Zeng X,et al.Underwater acoustic target recognition based on knowledge distillation under working conditions mismatching[J].Multimedia Systems, 2024, 30(1):12.1-12.14.
Chen D, Sun S, Lei Z, Shao H, Wang Y.Ship Target Detection Algorithm Based on Improved YOLOv3 for Maritime Image. Journal of Advanced Transportation, 2021, 21(10):212-223.
Hu G X, Hu B L, Yang Z, Huang L, Li P. Pavement Crack Detection Method Based on Deep Learning Models. Wireless Communications and Mobile Computing, 2021, 32(1):1-13.
Kikuchi T, Fukuda T, Yabuki N. Diminished reality using semantic segmentation and generative adversarial network for landscape assessment: evaluation of image inpainting according to colour vision. Journal of Computational Design and Engineering, 2022, 9(5): 1633-1649.
Li G, Ji Z, Qu X, Zhou R, Cao D. Cross-domain object detection for autonomous driving: A stepwise domain adaptative YOLO approach. IEEE Transactions on Intelligent Vehicles, 2022, 7(3): 603-615.
Lee J, Hwang K. YOLO with adaptive frame control for real-time object detection applications. Multimedia Tools and Applications, 2022, 81(25): 36375-36396.
Liang S, Wu H, Zhen L, Hua Q, Garg S, Kaddoum G. Edge YOLO: Real-time intelligent object detection system based on edge-cloud cooperation in autonomous vehicles. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(12): 25345-25360.
Huang L, Ye L, Li R, Zhang S, Qu C, Li S. Dynamic human retinal pigment epithelium (RPE) and choroid architecture based on single-cell transcriptomic landscape analysis. Genes & Diseases, 2023, 10(6): 2540-2556.
Wang X, Sun X, Wang Z. Construction of visual evaluation system for building block night scene lighting based on multi-target recognition and data processing. IET Circuits, Devices & Systems, 2023, 17(3): 149-159.
Hui, Peng, Yifan, Zhang, Sen, Yang, Bin, Song. Battlefield Image Situational Awareness Application Based on Deep Learning. IEEE Intelligent Systems, 2019, 35(1):36-43.
Laroca R, Zanlorensi L A, Gonalves G R, Todt E, Menotti D. An efficient and layout﹊ndependent automatic license plate recognition system based on the YOLO detector. IET Intelligent Transport Systems, 2021, 15(4):483-503.
Feng H, Jie S, Hang M, Wang R, Fang F, Zhang G. A novel framework on intelligent detection for module defects of PV plant combining the visible and infrared images. Solar Energy, 2022, 236(4):406-416.
DOI:
https://doi.org/10.31449/inf.v50i10.12725Downloads
Published
How to Cite
Issue
Section
License
Authors retain copyright in their work. By submitting to and publishing with Informatica, authors grant the publisher (Slovene Society Informatika) the non-exclusive right to publish, reproduce, and distribute the article and to identify itself as the original publisher.
All articles are published under the Creative Commons Attribution license CC BY 3.0. Under this license, others may share and adapt the work for any purpose, provided appropriate credit is given and changes (if any) are indicated.
Authors may deposit and share the submitted version, accepted manuscript, and published version, provided the original publication in Informatica is properly cited.







