Wheat lodging is one of the key isuess threatening stable and high yields. Lodging detection technology based on deep learning generally limited to identifying lodging at a single growth stage of wheat, while lodging may occur at various stages of the growth cycle. Moreover, the morphological characteristics of lodging vary significantly as the growth period progresses, posing a challenge to the feature capturing ability of deep learning models. The aim is exploring a deep learning-based method for detecting wheat lodging boundaries across multiple growth stages to achieve automatic and accurate monitoring of wheat lodging.
A model called Lodging2Former was proposed, which integrates the innovative hierarchical interactive feature pyramid network (HI-FPN) on top of the advanced segmentation model Mask2Former. The key focus of this network design lies in enhancing the fusion and interaction between feature maps at adjacent hierarchical levels, enabling the model to effectively integrate feature information at different scales. Building upon this, even in complex field backgrounds, the Lodging2Former model significantly enhances the recognition and capturing capabilities of wheat lodging features at multiple growth stages.
The Lodging2Former model demonstrated superiority in mean average precision (mAP) compared to several mainstream algorithms such as mask region-based convolutional neural network (Mask R-CNN), segmenting objects by locations (SOLOv2), and Mask2Former. When applied to the scenario of detecting lodging in mixed growth stage wheat, the model achieved mAP values of 79.5%, 40.2%, and 43.4% at thresholds of 0.5, 0.75, and 0.5 to 0.95, respectively. Compared to Mask2Former, the performance of the improved model was enhanced by 1.3% to 4.3%. Compared to SOLOv2, a growth of 9.9% to 30.7% in mAP was achieved; and compared to the classic Mask R-CNN, a significant improvement of 24.2% to 26.4% was obtained. Furthermore, regardless of the IoU threshold standard, the Lodging2Former exhibited the best detection performance, demonstrating good robustness and adaptability in the face of potential influencing factors such as field environment changes.
The experimental results indicated that the proposed HI-FPN network could effectively utilize contextual semantics and detailed information in images. By extracting rich multi-scale features, it enabled the Lodging2Former model to more accurately detect lodging areas of wheat across different growth stages, confirming the potential and value of HI-FPN in detecting lodging in multi-growth-stage wheat.