1000/1000
Hot
Most Recent
This video is adapted from 10.3390/buildings14030578
Street view imagery (SVI) is a rich source of information for architectural and urban analysis using computer vision techniques. Still, its integration with other building-level data sources requires an additional step of visual building identification. This step is particularly challenging in architecturally homogeneous, dense residential streets featuring narrow buildings, due to a combination of SVI geolocation errors and occlusions that significantly increase the risk of confusing a building with its neighboring buildings.
This video introduces a robust deep learning-based method to identify buildings across multiple street views taken at different angles and times. It uses global optimization to correct the position and orientation of street view panoramas relative to their surrounding building footprints. Evaluating the method on a dataset of 2000 street views shows that its identification accuracy (88%) outperforms previous deep learning-based methods (79%), while methods solely relying on geometric parameters correctly show the intended building less than 50% of the time.
These results indicate that previous identification methods lack robustness to panorama pose errors when buildings are narrow, densely packed, and subject to occlusions. Collecting multiple views per building can be leveraged to increase the robustness of visual identification by ensuring that building views are consistent.