A Novel Hybrid CNN-Mamba Framework with DySample-Enhanced YOLOv11 for Automated Pediatric Wrist Fracture Detection

Authors

    Jafar Tanha * Faculty of Electrical and Computer Engineering, University of Tabriz, Iran. tanha@tabrizu.ac.ir
    Mahdi Zarrin Faculty of Electrical and Computer Engineering, University of Tabriz, Iran.
    Haniyeh Nikkhah Faculty of Electrical and Computer Engineering, University of Tabriz, Iran.

Keywords:

Wrist Fractures Detection, Object Localization, CNN-Mamba Framework, Hybrid Deep Learning, Medical Imaging, Feature Fusion

Abstract

Wrist fractures, particularly distal radius and ulna fractures, are among the most common injuries in pediatric populations. Early and accurate detection of these injuries is critical for preventing long-term complications, yet interpreting pediatric wrist radiographs remains a challenging task due to the subtle nature of some abnormalities. In response to this challenge, we propose a novel hybrid framework for automated medical image detection, combining the strengths of convolutional neural networks (CNNs) and Mamba-based encoders to capture both local and global feature dependencies. To address the challenges in fusing features from these two distinct architectures, we introduce the Feature Aggregation Attention Module (FAAM), which dynamically combines the feature maps for more robust representation. Additionally, we enhance the YOLOv11 framework by replacing conventional upsampling in the neck with the Dysample technique, which improves feature propagation and refinement. We evaluate our method on the GRAZPEDWRI-DX dataset, a comprehensive collection of pediatric wrist trauma X-rays, demonstrating significant improvements in fracture detection. Our approach achieves an mAP@0.5 of 69.12% and an mAP@0.95 of 48.4%, showcasing its effectiveness in both general and challenging detection scenarios.

Downloads

Download data is not yet available.

Downloads

Published

2025-01-01

Submitted

2025-08-27

Revised

2025-10-25

Accepted

2025-12-19

How to Cite

Tanha, J., Zarrin, M., & Nikkhah, H. (2025). A Novel Hybrid CNN-Mamba Framework with DySample-Enhanced YOLOv11 for Automated Pediatric Wrist Fracture Detection. Journal of Artificial Intelligence, Applications and Innovations, 2(1), 11-30. https://aiaijournal.com/index.php/aiai/article/view/62