Transformer-Based Hybrid Architecture for Semantic Segmentation Using Multispectral Imagery in Precision Agriculture

Math and Computer Science
MSCS

Weed infestation continues to be a significant impediment to sustainable and efficient crop production. Precision agriculture aims to overcome such hurdles through reliable management processes that account for the complexity of site-specific knowledge and data on weeds. UAV-based multispectral imaging can capture bands beyond the visible spectrum including Near-Infrared and Red-Edge, can improve crop–weed differentiation moving from traditional management practices based on traditional RGB images. However, robust semantic segmentation using multispectral data is still challenging due to spectral variation, occlusions, and the heterogeneous nature of field settings.

This thesis proposes a hybrid CNN–Transformer segmentation framework tailored for multispectral crop–weed mapping. The model integrates modality-specific ConvNeXt encoders for spectral feature extraction, Swin Transformer blocks for global contextual reasoning, a gated Feature Pyramid Network (FPN) for adaptive multispectral fusion, and a Pyramid Pooling Module (PPM) for multi-scale decoding.

When evaluated on the WeedsGalore dataset, the proposed model achieved a mean Intersection-over-Union (mIoU) of 90.04%, a considerable improvement over conventional CNN-based and RGB-only baselines. Furthermore, zero-shot and few-shot fine-tuning studies on carrot and onion field datasets show that the proposed model has promising cross-domain generalization ability while learning from limited labeled examples. These findings highlight the potential for multispectral fused learning in conjunction with hybrid architectures to drive site-specific weed management, paving the way towards more scalable and sustainable agricultural practices.

Read the Full Report [PDF]

 

 

» Involved Students

» View More

» Document Viewer

Use Your Cell Phone as a Document Camera in Zoom

  • What you will need to have and do
  • Download the mobile Zoom app (either App Store or Google Play)
  • Have your phone plugged in
  • Set up video stand phone holder

From Computer

Log in and start your Zoom session with participants

From Phone

  • Start the Zoom session on your phone app (suggest setting your phone to “Do not disturb” since your phone screen will be seen in Zoom)
  • Type in the Meeting ID and Join
  • Do not use phone audio option to avoid feedback
  • Select “share content” and “screen” to share your cell phone’s screen in your Zoom session
  • Select “start broadcast” from Zoom app. The home screen of your cell phone is now being shared with your participants.

To use your cell phone as a makeshift document camera

  • Open (swipe to switch apps) and select the camera app on your phone
  • Start in photo mode and aim the camera at whatever materials you would like to share
  • This is where you will have to position what you want to share to get the best view – but you will see ‘how you are doing’ in the main Zoom session.