摘要
U-Net has achieved good performance with the small-scale datasets through skip connections to merge the features of the low-level layers and high-level layers and has been widely utilized in biomedical image segmentation as well as recent microstructure image segregation of the materials. Three representative visual attention mechanism modules, named as squeeze-and-excitation networks, convolutional block attention module, and extended calibration algorithm, were introduced into the traditional U-Net architecture to further improve the prediction accuracy. It is found that compared with the original U-Net architecture, the evaluation index of the improved U-Net architecture has been significantly improved for the microstructure segmentation of the steels with the ferrite/martensite composite microstructure and pearlite/ferrite composite microstructure and the complex martensite/austenite island/bainite microstructure, which demonstrates the advantages of the utilization of the visual attention mechanism in the microstructure segregation. The reasons for the accuracy improvement were discussed based on the feature maps analysis.