跳到内容

语言

Currency

Can UAVs Race Without an IMU? University of Zurich Team Demo

by AMOVLAB 06 May 2025 0 条评论

In March 2025, at the WorldMinds conference at the Kaufleuten Theater in Zurich, more than 500 live audiences witnessed two autonomous racing UAVs that relied entirely on vision shuttled between stages. The large background screen visualizes the observations and decisions of each UAVAI algorithm in real time, and the entire system no mapNo inertial measurement unit (IMU), no traditional SLAM, and can still fly stably under dim lights and strict safety restrictions.

Video placeholder: The original Chinese article includes a video here. AMOVLAB will manually connect the corresponding YouTube video.

This performance embodies the many years of AIUAV research by the Robotics and Perception Research Group (RPG) at the University of Zurich.A series of key technical achievements, marking a major milestone for them in the field of purely visual autonomous flight. This article will help you sort out these research threads, analyze key technologies, and look forward to future application prospects.

01

Swift: AI beats world champion

When the algorithm calculates every dive and roll more accurately and faster than humans, the champion pilot can only look at the "machine" and sigh. In 2023, the RPG team of the University of Zurich and the Intel team designed an autonomous UAV system - Swift, and defeated three world championship-level human UAV racing pilots in the official competition, with a total record of 15 wins and 10 losses, and broke the fastest UAV racing record. This blockbuster research result was also published as a cover article in the current issue of Nature magazine.

Video placeholder: The original Chinese article includes a video here. AMOVLAB will manually connect the corresponding YouTube video.

Technical Highlights

  • Using only airborne vision + IMU, it completely gets rid of external localization and SLAM dependence, and the information conditions are consistent with human pilots.

  • With only 50 seconds of real flight data, Gaussian process + KNN can be used to quickly compensate for perception and dynamics errors and achieve zero-sample migration.

  • The two stages of perception and control realize the end-to-end reinforcement learning strategy.

  • The reward function takes into account both speed and field of view, and “aiming the camera at the gate” is written into the reward to ensure stable passage at high speeds.

  • High-fidelity parallel simulation, 1e8 steps of training in 50 minutes, accurately replicating the PID, ESC and battery models, greatly shortening the iteration cycle.

The picture comes from the paper "Champion-level Drone Racing Using Deep Reinforcement Learning", Elia Kaufmann et al., Nature 2023

02

Reinforcement learning outperforms optimal control

In a study published in Science Robotics in September 2023, the RPG team of the University of Zurich compared the performance of reinforcement learning and optimal control in UAV racing. The results show that reinforcement learning can directly optimize task-level goals and demonstrate stronger adaptability and performance when facing complex environments and model uncertainties.

Comparison of three control schemes

  • Trajectory Tracking: Find the shortest trajectory offline, and track MPC online.

  • Contouring Control: Online MPC simultaneously maximizes path progress and minimizes deviations.

  • Gate-Progress RL: End-to-end RL directly maximizes the displacement toward the center of the next gate without reference to the trajectory.

Main conclusions

  • The optimization method is not the decisive factor, the optimization goal is the key.

  • It is demonstrated that in highly dynamic robotic tasks, choosing the appropriate optimization objective allows reinforcement learning (RL) to surpass optimal control.

  • The next step is to get rid of external VICONs and improve resilience to environmental changes and post-collision.

03

End-to-end visual flight controller

In professional UAV racing, human pilots can complete high-speed door crossings by relying only on first-person perspective (FPV) video streams; and the most “agile” autonomous quadcopter in academia still relies on the explicit state estimation of VIO/SLAM. In 2024, the University of Zurich RPG team verified it in a real environment for the first time—It does not rely on IMU and state estimation at all, and can complete three laps of racing at 40 km/h and 2 g acceleration using vision alone.

The picture comes from the paper "Demonstrating Agile Flight from Pixels without State Estimation", Ismail Geles et al., Robotics: Science and Systems 2024

Technical Highlights

  • Visual feature driven control: For the first time, thrust and angular velocity instructions are directly generated based on camera visual characteristics (rather than state estimation), eliminating dependence on IMU and SLAM.

  • Abstract modeling of the inner edge of the door frame: It is proposed to use the inner edge features of the door frame as visual input to efficiently simulate training and accelerate the reinforcement learning process.

  • Asymmetric Actor-Critic architecture: Privileged information is introduced to guide Critic during the training process to improve the sample efficiency and stability of learning complex control strategies from visual input.

  • Swin Transformer visual perceptron: Develop a highly robust gate detector based on Swin Transformer V2 to cope with challenges such as illumination and blur in actual environments.

  • Real deployment with zero state estimation: In the real world, high-speed flight missions were completed with a 100% success rate, verifying the ability to directly transfer from simulation to reality.

04

Let the environment also "think"

In the past few years, reinforcement learning (RL) has achieved great success in the field of robot control, from dexterous operation and quadruped running to highly dynamic UAV racing. However, a key problem remains unresolved: once the RL agent changes to a new environment (such as a change in track layout), it is almost impossible to adapt and must be retrained, which greatly limits its practical application. This year the University of Zurich RPG team presented at the ICRA conference by introducing a environmental strategy(Environment Policy), dynamically adjust the track layout, the system can cultivate a system that can operate in various situations.Strange track Universal UAV strategy for on-flight without the need to retrain every time.

Image source: Paper "Environment as Policy: Learning to Race in Unseen Tracks", Hongze Wang et al., ICRA 2025

Technical Highlights

  • Introduction of environmental adaptation strategies: For the first time, the environmental strategy (SAC) based on reinforcement learning is introduced into UAV racing to dynamically shape the training track and guide the growth of flight strategies.

  • Relative ranking reward mechanism: The reward design of the environmental strategy is based on the relative performance of the flight strategy in different tracks, automatically balancing the difficulty of the track.

  • A single flight strategy can conquer many unknown tracks: The trained single strategy can complete the flight on 6 complex 2D and 3D unseen tracks without retraining, with a 100\% success rate.

  • Dynamic track flight test: On the moving door track, the flight strategy showed far greater adaptability than traditional methods and could cope with large dynamic disturbances.

  • Real world migration verification: It was directly deployed on a real physical UAV platform and successfully completed all unseen flight missions, verifying the universal migration ability from simulation to reality.

05

The future of AIUAV is here

Application prospects

  • Industrial inspection and maintenance: There is no need to install external localization equipment and it can fly efficiently in restricted environments such as complex pipeline corridors and nuclear power plants.

  • Search, rescue and emergency response:A lightweight purely visual system, suitable for low-visibility scenes such as night or smoke, to achieve fast automated search.

  • Smart logistics and urban air transportation: Large-scale deployment is more economical and supports adaptive planning of diverse routes and robust flight.

  • Film and television aerial photography and live event broadcast: Precise tracking and high-dynamic image capture bring a more impactful imaging experience.

Resource Express

Championship-level UAV racing based on deep reinforcement learning
Champion-level Drone Racing Using Deep _Reinforcement Learning
Paper link:_https://www.nature.com/articles/s41586-023-06419-4.pdf

Challenging the Limits: Comparative Study of Optimal Control and Reinforcement Learning in Autonomous Racing
Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning_
Paper link:_https://www.science.org/doi/10.1126/scirobotics.adg1462

Vision-driven agile flight without state estimation
Demonstrating Agile Flight from Pixels without State Estimation_
Paper link:_https://rpg.ifi.uzh.ch/docs/RSS24\_Geles.pdf

In vision-based agile flight, imitation learning assists reinforcement learning.
Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight_
Paper link:_https://rpg.ifi.uzh.ch/docs/CoRL24\_Xing.pdf

Multi-task reinforcement learning method for quadcopter
Multi-Task Reinforcement Learning for Quadrotors
_Paper link:_https://rpg.ifi.uzh.ch/docs/RAL24\_Xing.pdf

Using the environment as strategy: learning to race autonomously on unknown tracks
Environment as Policy: Learning to Race in Unseen Tracks Paper link: https://rpg.ifi.uzh.ch/docs/ICRA25\_Wang.pdf

发表评论

All blog comments are checked prior to publishing

感谢您订阅

This email has been registered!

Shop the look

Choose Options

Recently Viewed

Edit Option
Back In Stock Notification
Terms & Conditions
What is Lorem Ipsum? Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum. Why do we use it? It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using 'Content here, content here', making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for 'lorem ipsum' will uncover many web sites still in their infancy. Various versions have evolved over the years, sometimes by accident, sometimes on purpose (injected humour and the like).
this is just a warning
登录
Shopping Cart
0 items