Underwater Target Recognition and Path Planning Using Otsu-KMeans Segmentation and EM-PF-SLAM with Enhanced A-JPS Algorithm

Abstract

Autonomous Underwater Vehicles (AUVs) face significant challenges in obstacle recognition and path planning due to sonar image noise and uncertain underwater environments. This study proposes a robust target recognition and obstacle avoidance framework integrating forward-looking sonar image processing, probabilistic SLAM, and a fast global path planner. Sonar images are first segmented using Otsu thresholding and refined through K-means clustering to extract obstacle features. These features are then fused with odometry and inertial data using an Expectation-Maximization (EM) based data association module and Particle Filter (PF) for posterior state estimation, enabling accurate SLAM under sonar uncertainty. For navigation, an improved A*-JPS algorithm is applied to achieve time-efficient path planning. Experiments were conducted on the publicly available SeaNet Dataset, which contains diverse acoustic scenes from Northwest Pacific environments. Tests were run on a platform with an Intel i5-10210U CPU and 16 GB RAM, using evaluation metrics including recognition accuraacy, RMSE, absolute trajectory error (ATE), loop closure recall, and path smoothness. Results show that the proposed Otsu-K-means sonar segmentation achieves 99.3% recognition accuracy, outperforming standalone methods by over 25%. The EM-PF-SLAM system achieves an ATE of 0.42 m and a loop closure recall of 95.3%, reducing localization uncertainty by over 40% compared to PF-only baselines. The hybrid A*-JPS planner reduces path planning time to 0.12 s and achieves a smoothness score of 0.85. These findings highlight the method’s suitability for real-time, high-precision AUV operations in complex acoustic environments.

References

Sun Q , Wang K .Underwater single-channel acoustic signal multitarget recognition using convolutional neural networks[J].The Journal of the Acoustical Society of America, 2022, 151(3):2245-2254.

Feng S , Ma S , Zhu X ,et al.Artificial Intelligence-Based Underwater Acoustic Target Recognition: A Survey[J].Remote Sensing, 2024, 16(17):144-152.

Yang J, Yan S, Zeng D,et al.Self-supervised learning minimax entropy domain adaptation for the underwater target recognition[J].Applied acoustics, 2024, 216(1):109725.1-109725.10.

Zhou X, Yang K, Yan Y,et al.Underwater Noise Target Recognition Based on Sparse Adversarial Co-Training Model with Vertical Line Array[J].Journal of Ocean University of China, 2023, 22(5):1201-1215.

Shen Q, Jia J, Cai L .Underwater Incomplete Target Recognition Network via Generating Feature Module[J].International Journal of Distributed Sensor Networks, 2023, 22(32):41-54.

Guan Z, Hou C, Zhou S,et al.Research on Underwater Target Recognition Technology Based on Neural Network[J].Wireless Communications & Mobile Computing, 2022, 14(8):211-223.

Yang S, Jin A, Zeng X,et al.Underwater acoustic target recognition based on knowledge distillation under working conditions mismatching[J].Multimedia Systems, 2024, 30(1):12.1-12.14.

Chen D, Sun S, Lei Z, Shao H, Wang Y.Ship Target Detection Algorithm Based on Improved YOLOv3 for Maritime Image. Journal of Advanced Transportation, 2021, 21(10):212-223.

Hu G X, Hu B L, Yang Z, Huang L, Li P. Pavement Crack Detection Method Based on Deep Learning Models. Wireless Communications and Mobile Computing, 2021, 32(1):1-13.

Kikuchi T, Fukuda T, Yabuki N. Diminished reality using semantic segmentation and generative adversarial network for landscape assessment: evaluation of image inpainting according to colour vision. Journal of Computational Design and Engineering, 2022, 9(5): 1633-1649.

Li G, Ji Z, Qu X, Zhou R, Cao D. Cross-domain object detection for autonomous driving: A stepwise domain adaptative YOLO approach. IEEE Transactions on Intelligent Vehicles, 2022, 7(3): 603-615.

Lee J, Hwang K. YOLO with adaptive frame control for real-time object detection applications. Multimedia Tools and Applications, 2022, 81(25): 36375-36396.

Liang S, Wu H, Zhen L, Hua Q, Garg S, Kaddoum G. Edge YOLO: Real-time intelligent object detection system based on edge-cloud cooperation in autonomous vehicles. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(12): 25345-25360.

Huang L, Ye L, Li R, Zhang S, Qu C, Li S. Dynamic human retinal pigment epithelium (RPE) and choroid architecture based on single-cell transcriptomic landscape analysis. Genes & Diseases, 2023, 10(6): 2540-2556.

Wang X, Sun X, Wang Z. Construction of visual evaluation system for building block night scene lighting based on multi-target recognition and data processing. IET Circuits, Devices & Systems, 2023, 17(3): 149-159.

Hui, Peng, Yifan, Zhang, Sen, Yang, Bin, Song. Battlefield Image Situational Awareness Application Based on Deep Learning. IEEE Intelligent Systems, 2019, 35(1):36-43.

Laroca R, Zanlorensi L A, Gonalves G R, Todt E, Menotti D. An efficient and layout﹊ndependent automatic license plate recognition system based on the YOLO detector. IET Intelligent Transport Systems, 2021, 15(4):483-503.

Feng H, Jie S, Hang M, Wang R, Fang F, Zhang G. A novel framework on intelligent detection for module defects of PV plant combining the visible and infrared images. Solar Energy, 2022, 236(4):406-416.

Authors

  • Gang Ji School of Information Engineering, Xiamen Ocean Vocational College, Xiamen, FuJian, 361101, China

DOI:

https://doi.org/10.31449/inf.v50i10.12725

Downloads

Published

03/18/2026

How to Cite

Ji, G. (2026). Underwater Target Recognition and Path Planning Using Otsu-KMeans Segmentation and EM-PF-SLAM with Enhanced A-JPS Algorithm. Informatica, 50(10). https://doi.org/10.31449/inf.v50i10.12725