Decentralized Multi-UAV Cooperative Exploration

Decentralized Multi-UAV Cooperative Exploration: Comparison

Please note this is a comparison between Version 1 by Tianyou Yu and Version 2 by Dean Liu.

Efficient exploration is a critical issue in swarm UAVs with substantial research interest due to its applications in search and rescue missions. In this study, we propose a cooperative exploration approach that uses multiple unmanned aerial vehicles (UAVs).

path planning
collaborative exploration
area partition
swarm UAVs

1. Multi-UAV Coordination for Exploration

Using multiple UAVs to increase exploration efficiency is a common practice, and related issues have been extensively studied ^[1][16]. One classic method involves maximizing overall utility while minimizing the potential overlap of measurements among UAVs ^[2][8]. This idea has been employed in many works such as ^[3][4][5][10,11,17]. However, as the number of UAVs increases, uncertainty ^[6][18] and redundant scanning between them become more prevalent, especially in larger environments where the sensor range is relatively small compared to the scale of the environment.

In conventional multi-agent allocation problems, a TSP-greedy allocation (TSGA) planner with ideal centralized architecture and communication assumptions is utilized to optimize global utility ^[7][19]. This approach considers the whole global task, which may be time-consuming for collecting tasks in the center. Alternatively, a dynamic Voronoi partition has been utilized in ^[3][8][7,10] to assign different target locations to individual UAVs, guaranteeing the separateness between them. However, this area partition-based method may not always be optimal as it does not consider the exploring process of each UAV, resulting in less efficient task allocation.

Therefore, in this paper, a dynamic centroid-based area partition is proposed, which considers the exploration process of each UAV for more reasonable task allocation. When a UAV has an insufficient number of candidates, it will be assigned a larger partitioned area to explore. The partition is processed dynamically to adapt to changing situations.

2. Multi-UAV Mapping for Exploration

To perform target selection and quantitative calculation in planning, it is necessary to have a map that depicts the environment and further exploration areas. Two representative volumetric mapping methods used in UAV exploration are truncated signed distance field (TSDF) ^[9][20] and occupancy ^[10][14]. When employing multi-UAV mapping methods, the key issue is often the map merging ^[11][21]. Previous works such as ^[2][5][8,17] involve each UAV maintaining its local map and correcting odometry errors while exploring. They then transmit their local maps with uncertain information to a central work station who can combine local maps into a global one for further optimization. In ^[12][22], sensor messages are shared among UAVs, and Gaussian mixture models (GMMs) are adopted to assist the exploration planner of each UAV. In ^[4][11], two maps are utilized: a low-resolution map for navigation and a high-resolution map for reconstruction. In order to achieve efficient coordination in a decentralized method, it is crucial to share the global map message among UAVs as quickly as possible. This is one of the central issues that we address in this paper.

3. Exploration in Unknown Environments

While fully functional UAVs possess autonomous sensing and computing capabilities, the exploration planner enables them to independently perform tasks in unknown environments. Existing works fall into two categories when executing under unknown: frontier-based methods ^[13][14][15][23,24,25] and sampling-based methods ^{[4][16][17][18]}[11,13,26,27]. With given frontier clusters ^[19][28] or sampled viewpoints ^[18][27], an information-theoretic measure is optimized to calculate information gain, resulting in reduced map uncertainty. The frontier-based method explicitly computes the boundary between the known and unknown areas and assigns UAVs to frontiers iteratively, but the frontier selection process can be time-consuming as it traverses all surface voxels in a large environment ^[13][23]. Some methods reject unsuitable frontiers during selection ^[20][1] to ease the computational burden. On the other hand, the sampling-based method randomly selects viewpoints in free areas, such as the rapidly-exploring random tree (RRT) ^[16][13] and probabilistic roadmap planner (PRM) ^[21][3], which deliver speed and probabilistic completeness. However, these two methods could converge locally.

The two mentioned categories were widely used in the exploration planning of a single UAV. However, for multi-UAV exploration, a coordination module is required to prevent collisions and redundancies. The NBV method ^[22][12] is commonly utilized in such scenarios. This method iteratively selects viewpoints in free space to refresh candidates’ paths, ensuring a consistent update rate. TheOur proposed method follows this approach by integrating the strengths of the sampling-based method. This enables frequent recollection of viewpoints to avoid collisions and facilitate flexible collaboration between UAVs.

4. Evaluation in Practical Experiments

To further validate the proposed method, practical indoor experiments were conducted with three self-assembly UAVs equipped with depth cameras flying in a room with obstacles, as shown in Figure 1b. A

10 \times 8 \times 3 m^{3}

bounding box was used to constrain the space for exploration. Due to the UAV structure, the cameras could only be mounted with a downward pitch angle of

5^{\circ}

on the front side, and the UAVs’ precise location was ensured through the use of VICON, a motion capture system, for safe piloting. Parameter values for the practical experiments were set based on the simulation experiments conducted for the indoor scenario. Although limitations such as hardware restrictions, network bandwidth, and flight trajectory control were not the primary focus of this paper, multiple trials were carried out to ensure the proposed method’s usability.

Figure 1. The practical experiments. (a) shows the initial status of three UAVs; they are placed on the same side of a room. (b) shows three UAVs are performing exploration in one trial; a

10 \times 8 \times 3 m^{3}

virtual boundary is set to bound the exploring space.

The proposed algorithm was tested in a practical experiment involving a team of three UAVs, as shown in Figure 1. The UAVs were initially positioned closely together on the same side of the exploration area, which is typical for real deployments. The exploration process was repeated 20 times, with a maximum exploration time of

232.2 s

, a minimum of

194.6 s

, and an average of

209.4 s

. The decentralized nature of the planner ensures that the UAVs can perform their tasks robustly, with interruptions to one UAV having no impact on the work of others, as demonstrated in Figure 2. The effectiveness and usability of the proposed method in a practical scenario are demonstrated by the exploration maps at six different sampling times, as shown in Figure 3. The virtual centroid in three colors dynamically changes during the exploration process, and the working area of each UAV is partitioned reasonably and iteratively. The UAV denoted as the yellow on the left moves gradually to the lower-left area after completing its task in the upper-left corner and collaborates with the UAV in the lower-right section to adjust the task areas. In the final map of Figure 3, the gap areas on the ground were detected, which were affected by the range of the depth camera. ItWe iscan assumed that using more robust sensors such as 3D LiDAR could alleviate this phenomenon, but such an approach requires greater consideration of the comprehensiveness of the experimental system and its applicability to different settings, which needs to be further considered in future research.

Figure 2. Robust coordination case. (a) shows UAV represented by the red arrow has stopped exploring due to insufficient power at an early stage; (b) shows other UAVs continue to finish the task. The yellow one helps the red to explore the bottom right corner of this environment.

Figure 3. The mapping process of one practical experiment. The sampling times display the complete process of three UAVs collaborating on exploration, with each UAV’s designated area being continuously updated. The UAV represented by the yellow icon on the left gradually moves towards the lower region, collaborating with the other UAVs to adjust the exploration area. In the final map, gaps on the ground were influenced by the depth camera’s perception range. It is assumed that using more powerful sensors such as 3D LiDAR ^[23][29] may mitigate this phenomenon, but this approach necessitates further consideration of the experimental system’s applicability.