Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Overview
Medical Informatics
Authors
Affiliations
State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features-using the recently popular terminology of neural networks with 'attention' mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model [3] , our detection system has a frame rate of 5 fps (including all steps) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available.
Sattari M, Zonouri S, Salimi A, Izadi S, Rezaei A, Ghezelbash Z Sci Rep. 2025; 15(1):8721.
PMID: 40082561 PMC: 11906767. DOI: 10.1038/s41598-025-92423-9.
Improvement of RT-DETR model for ground glass pulmonary nodule detection.
Tang S, Bao Q, Ji Q, Wang T, Wang N, Yang M PLoS One. 2025; 20(3):e0317114.
PMID: 40067875 PMC: 11896049. DOI: 10.1371/journal.pone.0317114.
IMRMB-Net: A lightweight student behavior recognition model for complex classroom scenarios.
Feng C, Luo Z, Kong D, Ding Y, Liu J PLoS One. 2025; 20(3):e0318817.
PMID: 40063594 PMC: 11892879. DOI: 10.1371/journal.pone.0318817.
Novel cross-dimensional coarse-fine-grained complementary network for image-text matching.
Liu M, Khairuddin A, Hasikin K, Liu W PeerJ Comput Sci. 2025; 11:e2725.
PMID: 40062258 PMC: 11888920. DOI: 10.7717/peerj-cs.2725.
YOLOv8-DEE: a high-precision model for printed circuit board defect detection.
Yi F, Mohamed A, Noor M, Che Ani F, Zolkefli Z PeerJ Comput Sci. 2025; 10:e2548.
PMID: 40061243 PMC: 11888845. DOI: 10.7717/peerj-cs.2548.