avatarPatrick Langechuan Liu

Summary

The website content discusses the challenges of mass production autonomous driving in China, with a focus on the complexities of traffic light systems, and highlights the recent progress made by Xpeng Motors in 2023 in addressing these issues.

Abstract

The article provides an in-depth look at the unique challenges faced by autonomous driving systems in China, emphasizing the dynamic nature of traffic participants, the complexity of road structures, and the intricate long-tail corner cases associated with traffic light systems. It delves into the specific difficulties posed by dynamic objects such as vulnerable road users and animals, as well as static challenges like complex intersections and left-turn waiting areas. The content also explores the variety of traffic lights, including those for buses and multibulb signals with countdown timers, and the need for accurate perception and interpretation by autonomous vehicles. The piece concludes by showcasing Xpeng Motors' advancements in autonomous driving solutions, notably their Surround Reality (SR) display and the deployment of XNet, an end-to-end perception stack, which together represent a significant step forward in the mass production of autonomous driving technology in China.

Opinions

  • The author suggests that human driving in China is particularly challenging, which complicates the development of autonomous driving systems.
  • The complexity of traffic light systems in China, with their static geometry but dynamic semantics, is presented as a unique and significant challenge for autonomous vehicles.
  • The left-turn waiting area design in China, while intended to improve traffic flow, is noted to be potentially confusing for both human drivers and autonomous driving systems.
  • The article implies that the perception of traffic lights by autonomous vehicles is a non-trivial task, requiring the correct matching of traffic lights to lanes and the interpretation of various signals and signs.
  • The author expresses that Xpeng Motors has successfully navigated these challenges, delivering a state-of-the-art autonomous driving solution that is currently best-in-class in the Chinese market.
  • The use of actual recordings from mass-produced Xpeng vehicles to demonstrate the capabilities of their autonomous driving system suggests a confidence in the robustness and real-world applicability of their technology.

Challenges of Mass Production Autonomous Driving in China

And the Recent Progress from Xpeng Motors in 2023

This blog post is based on the keynote speech in the End-to-end Autonomous Driving Workshop at CVPR 2023 held in Vancouver, titled “The Practice of Mass Production Autonomous Driving in China”. The recording of the keynote can be found here.

Autonomous driving is a daunting challenge, especially in China, where human driving is already one of the most challenging in the world. There are three main factors that comes into play: dynamic traffic participants, static road structure, and traffic signals. In particular, traffic light control signals pose a unique challenge as they are static in geometry but dynamic in semantics. In the following sessions, we will review the dynamic objects and static environments briefly, and do a deep dive on the interesting and special topic of traffic light.

Dynamic and Static Challenges

Dynamic traffic participants, such as vulnerable road users (VRUs), pose significant challenges for autonomous vehicles in China. VRUs are often unpredictable, taking on different poses and appearing where drivers least expect them. Large animals can suddenly appear on rural roads, while pets may wander onto urban streets. In addition, fully loaded vehicles or tricycles can be difficult to pinpoint the exact vehicle type. Consider the last photo in the middle row, it is actually even very challenging for humans to recognize the scene at first sight. The vehicle, loaded with tree branches, is inadvertently in perfect camouflage.

Various dynamic road users

Static road structure and topology can pose a significant challenge for autonomous vehicles as well. For example, the complex intersection shown here highlights the level of complexities that needs to be addressed here. While resembling a screenshot from a sci-fi movie, this intersection is, in fact, a real place viewable on Google Earth.

Satellite images of an complex intersection

If we zoom in, we will find an interesting road element which is perhaps unique in China, the Left-turn Waiting Area. It is designed to increase left turn traffic throughput, allowing more cars to go through the intersection within one cycle of traffic light. Note the design may not be symmetric, and each direction are designed individually depending on the traffic pattern. And we can even find academic papers about it and its effectiveness. Although it was proposed out of good intention, it could be really confusing for new drivers and the autonomous driving vehicle.

Turning left at an intersection with a waiting area involves a two-step process. Both of them involves different combination of traffic light signals. Here I only showed the most common traffic light pattern. The traffic light combination could be more complex, sometimes involving special traffic lights dedicated to waiting areas.

Left turn waiting areas are backed up by scientific papers

The King of Corner Case: Traffic Lights

Now we can take a deep dive into all the corner cases of traffic lights. Traffic lights are perhaps the category of objects with the most long-tail corner cases. The perception of traffic lights are complicated for two different reasons. First we have to recognize the location and type and color of the traffic light, then we also need to know out of all the traffic lights we detected, here we have six, which one our vehicle should pay attention to. To make this decision, it is essential to obtain the correct matching between traffic lights and different lanes.

A typical traffic light scene in China

One special type of light is traffic lights designed for buses. We have to recognize them correctly for two different reasons. First of all, for planning and control of ego vehicle, we need to recognize them in order to correctly ignore them, as they may carry information conflicting with the lights we should pay attention to and cause confusion for our autonomous vehicles. Yet to predict how a potential bus nearby would maneuver, we need to know its status correctly as well.

Traffic light for buses

Traffic lights designed for buses in China come in many forms, including LED lights with labels such as “BRT,” “SRT,” “Bus,” or a single letter “B”. They can also feature specific Chinese characters like “公交” (bus) or “有轨电车” (monorail), and sometimes include icons depicting a cute little bus. Alongside these features, traffic sign modifiers may also be included, making it essential for autonomous vehicles to detect and recognize these features and associate them accurately with the corresponding traffic lights.

Multibulb traffic light

In addition to the traffic lights dedicated to buses, another complex type of traffic light is the multibulb traffic light. Unlike traditional traffic lights where only one bulb is lit up at a time, multibulb traffic lights may have multiple bulbs illuminated simultaneously within the same socket. Therefore, detecting a traffic light box is not enough; it is equally important to detect the individual light bulbs and interpret their semantic meaning accurately.

In the additonal image of multibulb traffic light, we also see some additional numbers here. They are countdown timer until the next color change. We see countdown timers for pedestrians quite often in north America, but these timers are meant for vehicles. If this piece of information is used correctly, they could be helpful for planning to improve the smoothness of the ride.

Traffic light for countdown timers

Countdown timers can take on a variety of forms and be presented in different ways. They could be standalone displays or integrated with the traffic light system. The format of the digits could vary, including the use or absence of leading zeros, and the fonts used could differ as well, with some being more artistic than others. Furthermore, there are even traffic lights designed in the style of a progress bar. This involves an animation where the progress bar gradually shortens before changing to a full progress bar of a different color. While this design may be considered the most innovative, it can also pose challenges for our perception engineers.

Traffic light for Left Turn Waiting Areas

Finally and here are the traffic lights dedicated to waiting areas, they can take on the form of an icon, or text. The icon ones are also typically involves an animation, with lights gradually lighting up to guide you to the waiting area. Text ones could be on LED display or traffic sign boards. For text, there is no standard pattern either, which requires Optical Character Recognition (OCR) and a bit of natural language processing to extract the semantic meaning.

XNGP: Xpeng’s Autonomous Driving Solution

Despite these challenges, our team at Xpeng have succeeded in delivering the best-in-class autonomous driving solution in China, as of 2023. The Surround Reality (SR) display shows only the information obtained from the onboard perception system. On the left, the display accurately detects road geometries, while on the right, it depicts a common scenario where a group of pedestrians is crossing the road. It is worth noting that the footage shown represents actual recordings from mass-produced Xpeng vehicles that are available on the market today, without any aftermarket modification.

If you are interested in how this was made possible, please refer to the other post on the deployment of XNet — an end-to-end perception stack in the BEV (birds-eye-view) paradigm. It is the key enabler for the above mass produced autonomous driving solution, and lays a solid foundation towards a fully end-to-end autonomous driving solution.

Takeaways

  • Driving in China is challenging due to ubiquitous dynamic objects, complex road topology, and extreme long-tail corner cases of traffic lights.
  • Traffic lights have many special types, each type with many corner cases. Sometimes multiple traffic lights need to be reasoned together, and also with nearby countdown timers or traffic signs.
  • Xpeng Motors has taken on these challenges head-on and delivered best-in-class mass production autonomous driving product in China, as of 2023.

References

  • Ma, Wanjing, et al. “Increasing the capacity of signalized intersections with left-turn waiting areas.” Transportation Research Part A: Policy and Practice 105 (2017): 181–196.
Autonomous Cars
Machine Learning
Deep Learning
Recommended from ReadMedium