HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification

Overview

Journal Biosensors (Basel)

Specialty Biotechnology

Date 2022 Jul 27

PMID 35884345

Authors

Zongqing Ma

Qiaoxue Xie

Pinxue Xie

Fan Fan

Xinxiao Gao

Jiang Zhu

Affiliations

Soon will be listed here.

Abstract

Automatic and accurate optical coherence tomography (OCT) image classification is of great significance to computer-assisted diagnosis of retinal disease. In this study, we propose a hybrid ConvNet-Transformer network (HCTNet) and verify the feasibility of a Transformer-based method for retinal OCT image classification. The HCTNet first utilizes a low-level feature extraction module based on the residual dense block to generate low-level features for facilitating the network training. Then, two parallel branches of the Transformer and the ConvNet are designed to exploit the global and local context of the OCT images. Finally, a feature fusion module based on an adaptive re-weighting mechanism is employed to combine the extracted global and local features for predicting the category of OCT images in the testing datasets. The HCTNet combines the advantage of the convolutional neural network in extracting local features and the advantage of the vision Transformer in establishing long-range dependencies. A verification on two public retinal OCT datasets shows that our HCTNet method achieves an overall accuracy of 91.56% and 86.18%, respectively, outperforming the pure ViT and several ConvNet-based classification methods.

Citing Articles

Discriminative, generative artificial intelligence, and foundation models in retina imaging.

Ruamviboonsuk P, Arjkongharn N, Vongsa N, Pakaymaskul P, Kaothanthong N Taiwan J Ophthalmol. 2025; 14(4):473-485.

PMID: 39803410 PMC: 11717344. DOI: 10.4103/tjo.TJO-D-24-00064.

Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images.

Alenezi A, Aloqalaa D, Singh S, Alrabiah R, Habib S, Islam M Front Med (Lausanne). 2024; 11:1499393.

PMID: 39582968 PMC: 11583944. DOI: 10.3389/fmed.2024.1499393.

L2NLF: a novel linear-to-nonlinear framework for multi-modal medical image registration.

Deng L, Zou Y, Yang X, Wang J, Huang S Biomed Eng Lett. 2024; 14(3):497-509.

PMID: 38645595 PMC: 11026354. DOI: 10.1007/s13534-023-00344-1.

Multi-Scale-Denoising Residual Convolutional Network for Retinal Disease Classification Using OCT.

Peng J, Lu J, Zhuo J, Li P Sensors (Basel). 2024; 24(1).

PMID: 38203011 PMC: 10781341. DOI: 10.3390/s24010150.

Vision transformers: The next frontier for deep learning-based ophthalmic image analysis.

Wu J, Koseoglu N, Jones C, Liu T Saudi J Ophthalmol. 2023; 37(3):173-178.

PMID: 38074310 PMC: 10701151. DOI: 10.4103/sjopt.sjopt_91_23.

References

Hussain M, Bhuiyan A, Luu C, Smith R, Guymer R, Ishikawa H . Classification of healthy and diseased retina using SD-OCT imaging and Random Forest algorithm. PLoS One. 2018; 13(6):e0198281. PMC: 5986153. DOI: 10.1371/journal.pone.0198281. View

Venhuizen F, van Ginneken B, van Asten F, van Grinsven M, Fauser S, Hoyng C . Automated Staging of Age-Related Macular Degeneration Using Optical Coherence Tomography. Invest Ophthalmol Vis Sci. 2017; 58(4):2318-2328. DOI: 10.1167/iovs.16-20541. View

Hwang D, Hsu C, Chang K, Chao D, Sun C, Jheng Y . Artificial intelligence-based decision-making for age-related macular degeneration. Theranostics. 2019; 9(1):232-245. PMC: 6332801. DOI: 10.7150/thno.28447. View

Xu Z, Wang W, Yang J, Zhao J, Ding D, He F . Automated diagnoses of age-related macular degeneration and polypoidal choroidal vasculopathy using bi-modal deep convolutional neural networks. Br J Ophthalmol. 2020; 105(4):561-566. DOI: 10.1136/bjophthalmol-2020-315817. View

Srinivasan P, Kim L, Mettu P, Cousins S, Comer G, Izatt J . Fully automated detection of diabetic macular edema and dry age-related macular degeneration from optical coherence tomography images. Biomed Opt Express. 2014; 5(10):3568-77. PMC: 4206325. DOI: 10.1364/BOE.5.003568. View

Sun Y, Li S, Sun Z . Fully automated macular pathology detection in retina optical coherence tomography images using sparse coding and dictionary learning. J Biomed Opt. 2017; 22(1):16012. DOI: 10.1117/1.JBO.22.1.016012. View

. Retinal Diseases and VISION 2020. Community Eye Health. 2007; 16(46):19-20. PMC: 1705858. View

Lim L, Mitchell P, Seddon J, Holz F, Wong T . Age-related macular degeneration. Lancet. 2012; 379(9827):1728-38. DOI: 10.1016/S0140-6736(12)60282-7. View

Fang L, Wang C, Li S, Rabbani H, Chen X, Liu Z . Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification. IEEE Trans Med Imaging. 2019; 38(8):1959-1970. DOI: 10.1109/TMI.2019.2898414. View

10.

Tsuji T, Hirose Y, Fujimori K, Hirose T, Oyama A, Saikawa Y . Classification of optical coherence tomography images using a capsule network. BMC Ophthalmol. 2020; 20(1):114. PMC: 7082944. DOI: 10.1186/s12886-020-01382-4. View

11.

He X, Deng Y, Fang L, Peng Q . Multi-Modal Retinal Image Classification With Modality-Specific Attention Network. IEEE Trans Med Imaging. 2021; 40(6):1591-1602. DOI: 10.1109/TMI.2021.3059956. View

12.

Rasti R, Rabbani H, Mehridehnavi A, Hajizadeh F . Macular OCT Classification Using a Multi-Scale Convolutional Neural Network Ensemble. IEEE Trans Med Imaging. 2018; 37(4):1024-1034. DOI: 10.1109/TMI.2017.2780115. View

13.

Yoo T, Choi J, Seo J, Ramasubramanian B, Selvaperumal S, Kim D . The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment. Med Biol Eng Comput. 2018; 57(3):677-687. DOI: 10.1007/s11517-018-1915-z. View

14.

Lemaitre G, Rastgoo M, Massich J, Cheung C, Wong T, Lamoureux E . Classification of SD-OCT Volumes Using Local Binary Patterns: Experimental Validation for DME Detection. J Ophthalmol. 2016; 2016:3298606. PMC: 4983398. DOI: 10.1155/2016/3298606. View

15.

Attebo K, Mitchell P, Cumming R, Smith W . Knowledge and beliefs about common eye diseases. Aust N Z J Ophthalmol. 1997; 25(4):283-7. DOI: 10.1111/j.1442-9071.1997.tb01516.x. View

16.

Huang D, Swanson E, Lin C, Schuman J, Stinson W, Chang W . Optical coherence tomography. Science. 1991; 254(5035):1178-81. PMC: 4638169. DOI: 10.1126/science.1957169. View

17.

Lee C, Baughman D, Lee A . Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration. Ophthalmol Retina. 2019; 1(4):322-327. PMC: 6347658. DOI: 10.1016/j.oret.2016.12.009. View

18.

Thomas A, Harikrishnan P, Ramachandran R, Ramachandran S, Manoj R, Palanisamy P . A novel multiscale and multipath convolutional neural network based age-related macular degeneration detection using OCT images. Comput Methods Programs Biomed. 2021; 209:106294. DOI: 10.1016/j.cmpb.2021.106294. View

19.

Karri S, Chakraborty D, Chatterjee J . Transfer learning based classification of optical coherence tomography images with diabetic macular edema and dry age-related macular degeneration. Biomed Opt Express. 2017; 8(2):579-592. PMC: 5330546. DOI: 10.1364/BOE.8.000579. View

20.

Saha S, Nassisi M, Wang M, Lindenberg S, Kanagasingam Y, Sadda S . Automated detection and classification of early AMD biomarkers using deep learning. Sci Rep. 2019; 9(1):10990. PMC: 6662691. DOI: 10.1038/s41598-019-47390-3. View