Skip to content

Language

Currency

Efficient Autonomous Navigation for Air-Ground Robots in Complex Dynamic Environments

by AMOVLAB 14 Apr 2025 0 Comments

As the application scope of air-land amphibious robots (AGR) continues to expand in fields such as emergency rescue and urban inspection, the challenge of achieving autonomous navigation in complex dynamic environments has become increasingly prominent. In this regard, Wang Junming of the University of Hong Kong is based on AMOVLABP600UAVThe platform independently built a set of amphibious amphibious robots, using the Prometheus open-source framework to complete algorithm simulation verification and real-machine deployment. Following the research results of AGRNav [ICRA’24] and HE-Nav [RA-L’24], Wang Junming’s recently published papers OMEGA [RA-L 2025] and OccRWKV [ICRA 2025] provide innovative ideas for solving autonomous navigation problems in complex dynamic environments. These studies not only expand the application scenarios of air-ground amphibious robots, but also lay a theoretical foundation for improving their adaptability in uncertain environments.

Air-ground amphibious robot series content

OMEGA navigation system

Traditional air-ground robot navigation systems rely on the 3D semantic occupancy network to predict occlusion areas and perform path planning through ESDF (Euclidean signed distance field). However, in dynamic scenes (such as crowded areas), this solution will suffer from problems such as perception delay and inefficient planning.OMEGA is the first specially designed for air-ground amphibious robots to ensure efficient autonomous navigation in highly obscured and rapidly changing environments.

Technical Highlights

The OMEGA system achieves end-to-end optimization through the OccMamba perception network and the AGR-Planner planner.

OccMamba Network:

  • A three-branch network structure is innovatively proposed to decouple semantic and geometric prediction into different parts, and efficiently perform 3D semantic occupancy prediction by integrating Sem-Mamba and Geo-Mamba modules.

  • The Mamba state space model is introduced to capture long-distance dependencies using linear complexity.

  • Features are projected to BEV space fusion, and the computing load is reduced by 37% (22.1 FPS real-time inference).

AGR-Planner Planner:

  • ESDF-Free path search: Combining the Kinodynamic A* algorithm and gradient optimization, the planning time is shortened from 6.5s to 0.8s.

  • The energy consumption constraint is introduced to increase the decision-making efficiency of air-to-ground mode switching by 3 times.

Real machine verification

  • On the SemanticKITTI data set, OccMamba’s mIoU reaches 25.0 (SOTA)

  • The measured dynamic environment planning takes only 0.8 seconds, with a success rate of 98%

Video placeholder: The original Chinese article includes a video here. AMOVLAB will manually connect the corresponding YouTube video.

open-source link

https://jmwang0117.github.io/OMEGA/

OccRWKV 3D Semantic Occupancy Network

Computational redundancy is common in existing 3D semantic occupancy networks, and OccRWKV is the first 3D semantic occupancy network designed based on the RWKV architecture, which can achieve global feature modeling with linear complexity.

technological breakthrough

  • Dual branch decoupling design: Separate semantic prediction (identifying object categories) and geometric prediction (judging occupied space) into independent branches to avoid feature interference.

  • RWKV attention mechanism: Through a recurrent network structure similar to human brain memory, in a three-dimensional map containing 256×256×32 three-dimensional units, cross-regional feature association is achieved with linearly increasing calculations.

  • BEV space projection: Three-dimensional features are compressed into a bird's-eye view space and fused, reducing the amount of calculation by 78.5%.

Experimental verification

  • The deployment speed is 22.2 FPS (Jetson Xavier NX) to meet the real-time needs of robots.

  • The zero-sample migration experiment shows that the movement time in the unknown environment navigation task is reduced by 16.5%.

Video placeholder: The original Chinese article includes a video here. AMOVLAB will manually connect the corresponding YouTube video.

open-source link

https://jmwang0117.github.io/OccRWKV/

Leave a comment

All blog comments are checked prior to publishing

Thanks for subscribing!

This email has been registered!

Shop the look

Choose Options

Recently Viewed

Edit Option
Back In Stock Notification
Terms & Conditions
What is Lorem Ipsum? Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum. Why do we use it? It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using 'Content here, content here', making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for 'lorem ipsum' will uncover many web sites still in their infancy. Various versions have evolved over the years, sometimes by accident, sometimes on purpose (injected humour and the like).
this is just a warning
Login
Shopping Cart
0 items