With the development of remote sensing technology, remote sensing images of buildings are of great significance in urban planning, disaster response, and other directions. When we use a neural network containing batch normalization layers for semantic segmentation, the neural network is sensitive to batch size and has low segmentation accuracy for occluded and dense buildings. This paper proposes a method for building segmentation in remote sensing images based on Nested UNet (UNet++) deep neural network. First, the UNet++ network is used to extract features, and the Group Normalization (GN) method is used instead of Batch Normalization (BN) to alleviate the model's sensitivity to batch size. Then, the weighted combination of Cross-Entropy Loss (CELoss) and DiceLoss is used as the loss function to improve the feature extraction ability of the neural network for unbalanced buildings. Finally, experiments are carried out on the WHUBuilding dataset. The experimental results show that the improved model (UNet++-GN) improves Mean Intersection over Union (MIoU) and Mean Pixel Accuracy (Macc) by 12.16% and 2.92%, respectively, compared with the original model (UNet++-BN).
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.