U-Net + Transformer Encoder and Lora Adaptation for Rural Sheep Segmentation
Resumo
This study presents a lightweight and efficient sheep segmentation method by combining U-Net with Transformerbased encoders and Low-Rank Adaptation (LoRA). RGB videos of 99 sheep were collected using a custom framework with Kinect V2, generating a set of images. Three variants of U-Net were evaluated under identical training conditions. The MiTB5+LoRA model achieved an average IoU of 95.0%±0.04 with a reduced number of trainable parameters. The results show that the proposed approach balances accuracy and efficiency, making it suitable for deployment in resource-limited rural environments.
Palavras-chave:
Training, Performance evaluation, Image segmentation, Adaptation models, Accuracy, Computational modeling, Computer architecture, Transformers, Real-time systems, Videos
Publicado
30/09/2025
Como Citar
MARQUES, Júlio V. M.; SANTOS, Willians S.; LUZ, Anthony I. M.; ARAÚJO, Rafael L.; SOUSA, Alcilene D. de; SARMENTO, Jose L. R.; SILVA, Romuere R. V..
U-Net + Transformer Encoder and Lora Adaptation for Rural Sheep Segmentation. In: CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 38. , 2025, Salvador/BA.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2025
.
p. 445-450.
