Automatic classification of canine thoracic radiographs using deep learning

Banzato, Tommaso; Wodzinski, Marek; Burti, Silvia; Osti, Valentina Longhin; Rossoni, Valentina; Atzori, Manfredo; Zotti, Alessandro

doi:10.1038/s41598-021-83515-3

Download PDF

Article
Open access
Published: 17 February 2021

Automatic classification of canine thoracic radiographs using deep learning

Tommaso Banzato¹^na1,
Marek Wodzinski²^na1,
Silvia Burti¹,
Valentina Longhin Osti¹,
Valentina Rossoni¹,
Manfredo Atzori^3,4 &
…
Alessandro Zotti¹^na1

Scientific Reports volume 11, Article number: 3964 (2021) Cite this article

9004 Accesses
25 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The interpretation of thoracic radiographs is a challenging and error-prone task for veterinarians. Despite recent advancements in machine learning and computer vision, the development of computer-aided diagnostic systems for radiographs remains a challenging and unsolved problem, particularly in the context of veterinary medicine. In this study, a novel method, based on multi-label deep convolutional neural network (CNN), for the classification of thoracic radiographs in dogs was developed. All the thoracic radiographs of dogs performed between 2010 and 2020 in the institution were retrospectively collected. Radiographs were taken with two different radiograph acquisition systems and were divided into two data sets accordingly. One data set (Data Set 1) was used for training and testing and another data set (Data Set 2) was used to test the generalization ability of the CNNs. Radiographic findings used as non mutually exclusive labels to train the CNNs were: unremarkable, cardiomegaly, alveolar pattern, bronchial pattern, interstitial pattern, mass, pleural effusion, pneumothorax, and megaesophagus. Two different CNNs, based on ResNet-50 and DenseNet-121 architectures respectively, were developed and tested. The CNN based on ResNet-50 had an Area Under the Receive-Operator Curve (AUC) above 0.8 for all the included radiographic findings except for bronchial and interstitial patterns both on Data Set 1 and Data Set 2. The CNN based on DenseNet-121 had a lower overall performance. Statistically significant differences in the generalization ability between the two CNNs were evident, with the CNN based on ResNet-50 showing better performance for alveolar pattern, interstitial pattern, megaesophagus, and pneumothorax.

Generative models improve fairness of medical classifiers under distribution shifts

Article Open access 10 April 2024

Ira Ktena, Olivia Wiles, … Sven Gowal

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

AI in health and medicine

Article 20 January 2022

Pranav Rajpurkar, Emma Chen, … Eric J. Topol

Introduction

Thoracic radiographs are part of routine clinical evaluation of patients with confirmed or suspected thoracic pathology both in human and veterinary medicine. Nevertheless, interpreting thoracic radiographs is a challenging and error-prone task for the medical doctor^1,2, and for the veterinary practitioner alike³. In human medicine, despite the efforts to improve radiology residents’ training programmes, the prevalence of interpretation errors has not significantly improved in recent decades^1,2. The prevalence and the impact of interpretation errors on thoracic radiographs have only seldom been investigated in veterinary medicine⁴. Conversely, this topic has been widely studied in human medicine and the most common causes of interpretation errors have been identified^5,6,7. Different strategies to reduce interpretation errors have been proposed both in human^1,8 and veterinary medicine³; among these is the use of computer-aided detection (CAD) tools to support the practitioner in everyday practice^6,9.

The high performances shown by deep-learning algorithms in several radiology-related tasks have driven very active research in this field, with an increasing number of publications¹⁰. In particular, deep learning algorithms for the detection of specific pathologies or conditions such as pneumothorax¹¹, pneumonia¹², malignant nodules¹³ and COVID-19¹⁴ have been proposed. In addition, broader applications of these algorithms, such as automatic triaging¹⁵ and automatic labeling of chest radiographs¹⁶, have been investigated. Furthermore, several artificial intelligence-based products for the automatic detection of specific conditions, both on plain radiographs and computed tomographic images, have been approved by the Food and Drug Administration in the last few years, thereafter becoming commercially available.

To date, the possibilities offered by deep learning in veterinary medicine have been investigated for the classification of magnetic resonance images^17,18 for the detection of liver degeneration from ultrasound images¹⁹ and for the automatic classification of corneal lesions from photographs²⁰. Multi-label algorithms allow for the detection of different objects (in our case lesions) on the same image. In multi-label training each image is annotated with multiple labels according to the lesions evident on the radiograph²¹. To the best of the authors’ knowledge, both in human^11,12,22 and in veterinary medicine^22,23, most of the studies on applying CNNs to thoracic radiographs are focused on detecting individual pathologies or conditions, whereas studies using a multi-label approach are relatively scarce in the human medical literature^16,21,24,25 and the scope to use multi-label algorithms on canine thoracic radiographs has not been explored yet. Therefore, the aims of this study are: (1) to develop a multi-label deep learning-based network capable of detecting some of the most common lesions found on plain radiographs of the canine thorax; (2) to test the generalization ability of the developed algorithm on an external Data Set of radiographs.

Results

Database

The complete database was composed of 3839 latero-lateral (LL) radiographs. Data Set 1 comprised 3063 LL images, 632 LL images were discarded due to incorrect positioning or poor image quality. Data Set 2 comprised 776 LL, 77 LL radiographs were excluded because of positioning error or poor image quality. In both data sets, “unremarkable” and “cardiomegaly” were the two most represented lesions. There was an uneven distribution of the different radiographic findings between the two data sets, with some over-represented and some under-represented in Data Set 2 when compared to Data Set 1.

Table 1 Number of LL radiographs showing the following included radiographic findings.

Full size table

Selection of the radiographic findings

Only a limited number of radiographs showing tracheal collapse, hernia, fracture and pneumomediastinum were available in Data Set 1 (Table 1) , and, therefore, these radiographic findings were excluded from training. Thus the radiographic findings used to train the network were: unremarkable, cardiomegaly, alveolar pattern, bronchial pattern, interstitial pattern, mass, pleural effusion, pneumothorax, megaoesophagus.

Classification results

ResNet-50 had a higher classification accuracy than DenseNet-121, both on Data Set 1 and on Data Set 2, for all the considered radiographic findings except pleural effusion. Classification accuracy of the two architectures on Data Set 1 and Data Set 2 is reported in Tables 2 and 3. For some radiographic findings the classification accuracy of both ResNet-50 and DenseNet-121 was higher on Data Set 2 than on Data Set 1. In particular, both architectures showed a higher accuracy on Data Set 2 than on Data Set 1 for alveolar pattern. Furthermore, DenseNet-121 showed higher accuracy on Data Set 2 than on Data Set 1 also for bronchial pattern, cardiomegaly, megaoesophagus, unremarkable and pneumothorax. For the remaining radiographic findings, accuracy on Data Set 2 was lower than on Data Set 1. Statistically significant differences in accuracy on Data Set 2 (generalization accuracy) between ResNet-50 and DenseNet-121 were evident for: (1) alveolar pattern (Z = 3.813, P = 0.0001); (2) interstitial pattern (Z = 3.283, P = 0.0010); (3) megaeosophagus (Z = 2.257, P = 0.0240); (4) pneumothorax (Z = 3.314, P = 0.0009). No differences were evident for: cardiomegaly (Z = 0.800, P = 0.427); mass (Z = 1.580, P = 0.1142); unremarkable (Z = 0.817, P = 0.4137); pleural effusion (Z = 0.347, P = 0.7286). A graphical representation of the classification results of the model is reported in Fig. 1.

Table 2 Performances of ResNet-50 in Data Set 1 and Data Set 2. Parentheses show 95% CIs.

Full size table

Table 3 Performances of DenseNet-121 in Data Set 1 and Data Set 2. Parentheses show 95% CIs.

Full size table

Discussion

A new, deep learning-based, multi-label classification method for the automatic detection of several radiographic findings in canine thoracic radiographs is proposed. The high classification accuracy shown by both tested architectures on Data Set 2, for almost all the radiographic findings, suggests that multi-label CNNs can be successfully trained also in the case of relatively small-sized and highly unbalanced databases. On the other hand, the classification differences in several radiographic findings between the veterinary and the human medical literature make comparison with similar studies^21,25 not entirely straightforward. Moreover, some of the radiographic findings that are common in humans (e.g. emphysema, fibrosis) are rarely found in dogs. Nonetheless, it is feasible to make this direct comparison between human and veterinary examples for some radiological findings, such as cardiomegaly, pleural effusion, pneumothorax, consolidation (labelled “ alveolar pattern” in this study) and unremarkable^21,25. Interestingly, for all the above-mentioned radiographic findings, the AUC of the developed CNN was similar to or higher than that reported in similar studies on humans^21,25 both for Data Set 1 and for Data Set 2.

Another interesting aspect of this research is related to the large variability in body size and body shape typical of the dog, which directly translates into a wide range of normality in the radiographic appearance of the canine thorax. Indeed, the dog is the only known species that has a 50-fold variability in dimensions among individuals. Therefore, it is easily understood that the radiographic appearance of the thorax of, for example, a bulldog, a dachshund, or a German shepherd, is very different in radiological terms. Despite such variability, the developed CNN was able to detect most of the radiographic findings included in the CNN with an accuracy ranging from moderate to very good. In particular, ResNet-50 displayed an AUC above 0.8 in the detection of alveolar pattern, cardiomegaly, megaoesophagus, pleural effusion, and pneumothorax. In addition it showed high accuracy in identifying normal radiographs (labelled “unremarkable”). Interestingly, in similar experiments in humans the accuracy in identifying radiologically normal images was lower²⁵. Conversely, accuracy was lower than 0.8 for bronchial pattern, interstitial pattern and mass. It is the authors’ opinion that the limited generalization ability shown by ResNet-50 in the detection of bronchial and interstitial patterns might be related to the difference in image quality of the original DICOM images between Data Set 1 and Data Set 2. In fact, the radiographs acquired using the CR system had a lower image quality than those acquired through the DR system. Another possible explanation is that bronchial and interstitial patterns were not assessed on VD images. On the other hand, the low accuracy in the detection of masses could be related to the inability of the network to consider orthogonal views simultaneously. The low accuracy in detecting masses shown by ResNet-50 and DenseNet-121, both on Data Set 1 and Data Set 2, is probably related to the fact that several mass-like structures (for example nipples, degeneration of the costochondral joints in older animals, pleural mineralizations) are often present in normal radiographs. Interestingly, also in the experiments by Wang et al. 2017²⁴ and Yao et al. 2018²⁶ accuracy in detecting masses and nodules in humans was low (AUC below 0.8). The developed CNN had variable performances for the detection of the different lesions and, therefore, results obtained with the current version of the CNN should be confirmed with other methods (e.g.: interpretation by radiologist, computed tomography, magnetic resonance imaging) before taking clinical decisions based on those results.

ResNet-50 and DenseNet-121 are the two most commonly used pre-trained CNNs for multi-label chest X-ray image classification^21,24,26. In this study, ResNet-50 showed a significantly higher generalization ability than DenseNet-121 in the detection of alveolar pattern, interstitial pattern, megaoesophagus, and pneumothorax, whereas no differences were evident for cardiomegaly, mass, unremarkable and pleural effusion. In previous human studies, these two network architectures demonstrated a variable accuracy in the detection of radiographic lesions ,with ResNet-50 performing better than DenseNet-121 for some lesions and vice versa²¹. Furthermore, in some studies, both ResNet-50 and DenseNet-121 were used as backbones for category-wise, residual operations, and attention-based mechanisms²¹. Incorporating the above modules within the network is reported to increase the average AUC²¹. The above modules were not included in the present study, mainly due to the limited data set size and because of the high imbalance lesion distribution.

Models trained on a specific data set do not always obtain comparable performance when tested on data sets from a different institution. Accuracy increases if the data sets acquired from multiple institutions are used for the training²⁷. A limitation of this study is that both data sets were acquired at the same institution and a data set from an external veterinary clinic was not available. However, in order to keep center generalization into account, Data Set 1 and Data Set 2 (used respectively for training and testing) were acquired using two different radiograph acquisition systems. Further studies, possibly including radiographs acquired at multiple veterinary clinics, could help clarify the current generalization performances of the developed CNN. Furthermore, it is also possible that the exclusion of incorrectly positioned and exposed radiographs from both the training and the test set might have influenced the classification accuracy towards more favorable results. The possibility to automatically detect positioning or exposure abnormalities has not been explored yet.

Another limitation of the present study is that the radiographic findings included in the training set do not, of course, fully represent all the lesions types that might occur in thoracic radiographs in dogs. Furthermore, due to the limited number of available cases, radiographs showing the least represented radiographic findings (tracheal collapse, hernia, fracture, and pneumomediastinum) were not included in the training. For the above reasons, the real “in-field” generalization ability of the developed CNN has yet to be fully tested.

The developed CNN is prospectively aimed to assist veterinary clinicians, both general practitioners and radiology specialists, in their daily work. It is the authors’ opinion that the scope to use deep learning-based tools during routine clinical activity will increase productivity while decreasing the error rate. Generally speaking, veterinary facilities are smaller than human hospitals and the global number of veterinary specialists in all the disciplines is significantly lower the global number of specialist doctors. Therefore, veterinary general practitioners are required to develop expertise in several different fields of medicine, such as radiology, surgery, internal medicine, pathology, and so on. It is the authors’ opinion that, in such a scenario, veterinarians could greatly benefit from the use of deep learning-based tools to assist them in their clinical routine. Indeed, several application cases for these algorithms have been proposed and analysed in the human medical literature. For instance, the use of deep learning-based algorithms is reported to increase accuracy in the detection of pulmonary nodules by skilled radiologists⁹, or to decrease the average reporting delay in a clinical setting¹⁵. The possible impact CNN use in the veterinary medical field has not been evaluated yet.

Methods

Database creation

Radiographic findings

All the images were reviewed by three experienced veterinary radiologists (AZ, TB and SB, with more than 20, 10 and 3, years’ experience respectively). Before interpretation, image quality was assessed and, in particular, radiograph exposure and patient positioning were evaluated. Only properly exposed images with the animal positioned correctly were included in both data sets. Radiographs of immature dogs and images with evident artefacts (double exposure, dirt on the cassette, etc.) were also excluded. When available, both LL and VD radiographs of the same patient were reviewed simultaneously. The radiographs were classified strictly based on the presence or absence of individual radiographic findings and not on the presence or absence of pathologies (e.g.: pneumonia) or conditions (e.g.: oedema) that might be characterized by the simultaneous presence of several radiographic findings. All the radiographs were labelled according to the following radiographic findings: alveolar pattern, interstitial pattern, bronchial pattern, mass, cardiomegaly, pleural effusion, pneumothorax, hernia, megaoesophagus, fracture, pneumomediastinum, tracheal collapse. If no radiographic findings were evident, the image was classified as unremarkable. The distribution (focal vs. diffused) of both alveolar and interstitial patterns was not considered. Interstitial and bronchial patterns were graded as mild, moderate, or severe. Mild bronchial and interstitial patterns were considered as normal variations in the radiographic appearance of the canine thorax and, therefore, not included in the training. If only mild bronchial and interstitial patterns were evident, the radiographs were classified as unremarkable. Cases showing both segmental and diffused megaoesophagus were classified as megaoesophagus. The presence of cardiomegaly was assessed based on the authors’ experience. In unclear cases, the vertebral heart score²⁸ was calculated and then compared with the breed-specific reference intervals reported in the literature. Mediastinal and thoracic wall masses were included in the mass tag. Both diaphragmatic and abdominal wall hernias were classified as hernia. Likewise, both fractures to the ribs and to the vertebral column were classified as fracture. Fractures of the long bones were not considered. No grading score was assigned to tracheal collapse. All the images were reviewed simultaneously by the three authors and all the labels were assigned following a consensus discussion.

Image processing and deep learning

The deep-learning analysis was performed on a dedicated workstation (Linux operating system, Ubuntu 18.04, Canonical) equipped with four graphic processing units (Tesla V100; NVIDIA), a 2.2 GHz processor (Intel Xeon E5-2698 v4; Intel) and 256 GB random-access memory. Before feeding to the CNN the images were downsampled to 224x224 pixels. The images were not cropped during the test phase, neither lossy compressed or converted to JPEG. Instead, the lossless MHA format was used. Radiograph classification was performed using convolutional neural networks (CNN), a special class of deep-learning algorithms specifically designed to work with images, and this classification was performed using two different CNN architectures: (1) DenseNet-121²⁹, (2) ResNet-50³⁰. The tested CNN architectures were pre-trained on a large-scale data set of everyday images called ImageNet and then fine-tuned. Different radiographic findings are usually evident on the same radiograph, often as a result of a single condition or pathology, and, therefore, a multi-label approach was used. Binary cross-entropy was used as the objective function. The same training parameters were used for all the networks. Training was performed until convergence using the Adam optimizer and a learning-rate scheduler with exponential decay. The weights from the epoch with the lowest loss on the validation set were chosen and further used for testing. The training set was augmented by random horizontal/vertical flips, cropping, affine warping, and linear contrast changes. All the images were normalized to the 0-1 range, where 0 denotes the background. The split ratio for training, validation, and test set (for Data Set 1) was 8:1:1 respectively.The training scheme was not directly optimizing any of the evaluation metric, e.g. AUC, sensitivity, or specificity. No information from Data Set 2 was used during the training.

Statistical analysis

We assessed individual architectures, both on Data Set 1 and Data Set 2, with the area under the receiver operating characteristic curve (AUC) using a commercially available statistical software (MedCalc). Sensitivity was calculated as: true positive /(true positive \(+\) false negative), specificity as: true negative/ (false positive \(+\) true negative), positive likelihood ratio (PLR) as: sensitivity / (1 − specificity) and negative likelihood ratio (NLR) as: (1 − sensitivity)/specificity. The performances of the two architectures were compared, on the Data Set 2 only, with the DeLong test. The differences in the AUCs of the considered tests, as a result of the DeLong test, are expressed as Z score. All p-values were assessed at an alpha of 0.05.

Conclusions

A multi-label CNN-based network for the automatic classification of canine LL radiographs was developed and tested. The developed network had a variable accuracy in the detection of radiographic findings in an external test set. Further studies, hopefully including a larger number of radiographs acquired in several different veterinary institutions, could allow the development of a network with a broader generalization ability. Furthermore, a larger database could allow testing the network also on VD images. CNN-based tools could, prospectively, assist the veterinarian in his everyday work allowing for a higher quality veterinary care. Nonetheless, for a successful application of these tools in the clinical workflow, the advantages and the pitfalls of such tool must be clearly known by the operator.

Data availibility

The data sets generated during and/or analysed during the current study are not publicly available because they are property of the Veterinary Teaching Hospital of the University of Padua but are available from the corresponding author on reasonable request.

References

Bruno, M. A., Walker, E. A. & Abujudeh, H. H. Understanding and confronting our mistakes: The epidemiology of error in radiology and strategies for error reduction. RadioGraphics 35, 1668–1676. https://doi.org/10.1148/rg.2015150023 (2015).
Article PubMed Google Scholar
Berlin, L. Accuracy of diagnostic procedures: Has it improved over the past five decades?. Am. J. Roentgenol. 188, 1173–1178. https://doi.org/10.2214/ajr.06.1270 (2007).
Article Google Scholar
Alexander, K. Reducing error in radiographic interpretation. Can. Vet. J. 51, 533–536 (2010).
PubMed PubMed Central Google Scholar
Froes, T. . R. et al. Interobserver agreement in interpretation of radiographic pulmonary changes in dogs in relation to radiology training. Sem. Cienc. Agrarias 35, 2513–2526. https://doi.org/10.5433/1679-0359.2014v35n5p2513 (2014).
Article Google Scholar
Gatt, M. E., Spectre, G., Paltiel, O., Hiller, N. & Stalnikowicz, R. Chest radiographs in the emergency department: Is the radiologist really necessary?. Postgrad. Med. J. 79, 214–217. https://doi.org/10.1136/pmj.79.930.214 (2003).
Article CAS PubMed PubMed Central Google Scholar
Waite, S. et al. Interpretive error in radiology. Am. J. Roentgenol. 208, 739–749. https://doi.org/10.2214/ajr.16.16963 (2016).
Article Google Scholar
Kelly, B. S., Rainford, L. A., Darcy, S. P., Kavanagh, E. C. & Toomey, R. J. The development of expertise in radiology: In chest radiograph interpretation, “expert’’ search pattern may predate “expert’’ levels of diagnostic accuracy for pneumothorax identification. Radiology 280, 252–260. https://doi.org/10.1148/radiol.2016150409 (2016).
Article PubMed Google Scholar
Croskerry, P. Clinical cognition and diagnostic error: Applications of a dual process model of reasoning. Adv. Heal. Sci. Educ. 14, 27–35. https://doi.org/10.1007/s10459-009-9182-2 (2009).
Article Google Scholar
Sim, Y. et al. Deep convolutional neural network-based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology 294, 199–209. https://doi.org/10.1148/radiol.2019182465 (2020).
Article PubMed Google Scholar
Yasaka, K. & Abe, O. Deep learning and artificial intelligence in radiology: Current applications and future directions. PLoS Med. 15, 2–5. https://doi.org/10.1371/journal.pmed.1002707 (2018).
Article Google Scholar
Taylor, A. G., Mielke, C. & Mongan, J. Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study. PLoS Med. 15, 1–15. https://doi.org/10.1371/journal.pmed.1002697 (2018).
Article Google Scholar
Lakhani, P. & Sundaram, B. Deep learning at chest radiography: Automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology 284, 574–582. https://doi.org/10.1148/radiol.2017162326 (2017).
Article PubMed Google Scholar
Nam, J. G. et al. Development and validation of deep learning-based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology 290, 218–228. https://doi.org/10.1148/radiol.2018180237 (2019).
Article PubMed Google Scholar
Murphy, K. et al. COVID-19 on the chest radiograph: A multi-reader evaluation of an AI system. Radiology 201874, https://doi.org/10.1148/radiol.2020201874 (2020).
Annarumma, M. et al. Automated triaging of adult chest radiographs with deep artificial neural networks. Radiology 291, 196–202. https://doi.org/10.1148/radiol.2018180921 (2019).
Article PubMed PubMed Central Google Scholar
Cicero, M. et al. Training and validating a deep convolutional neural network for computer-aided detection and classification of abnormalities on frontal chest radiographs. Invest. Radiol. 52, 281–287. https://doi.org/10.1097/RLI.0000000000000341 (2017).
Article PubMed Google Scholar
Banzato, T., Bernardini, M., Cherubini, G. B. & Zotti, A. A methodological approach for deep learning to distinguish between meningiomas and gliomas on canine MR-images. BMC Vet. Res. 14, 317. https://doi.org/10.1186/s12917-018-1638-2 (2018).
Article PubMed PubMed Central Google Scholar
Banzato, T. et al. Accuracy of deep learning to differentiate the histopathological grading of meningiomas on MR images: A preliminary study. J. Magnet. Resonan. Imaging. https://doi.org/10.1002/jmri.26723 (2019).
Banzato, T. et al. Use of transfer learning to detect diffuse degenerative hepatic diseases from ultrasound images in dogs: A methodological study. Vet. J. 233, 35–40. https://doi.org/10.1016/j.tvjl.2017.12.026 (2018).
Article CAS PubMed Google Scholar
Kim, J. Y., Lee, H. E., Choi, Y. H., Lee, S. J. & Jeon, J. S. CNN-based diagnosis models for canine ulcerative keratitis. Sci. Rep. 9, 1–7. https://doi.org/10.1038/s41598-019-50437-0 (2019).
Article CAS Google Scholar
Guan, Q. & Huang, Y. Multi-label chest X-ray image classification via category-wise residual attention learning. Pattern Recognit. Lett. 130, 259–266. https://doi.org/10.1016/j.patrec.2018.10.027 (2020).
Article Google Scholar
Burti, S., Osti, V. L., Zotti, A. & Banzato, T. Use of deep learning to detect cardiomegaly on thoracic radiographs in dogs. Vet. J. 262, 105505. https://doi.org/10.1016/j.tvjl.2020.105505 (2020).
Article CAS PubMed Google Scholar
Li, S., Wang, Z., Visser, L. C., Wisner, E. R. & Cheng, H. Pilot study: Application of artificial intelligence for detecting left atrial enlargement on canine thoracic radiographs. Vet. Radiol. Ultrasound 61, 611–618. https://doi.org/10.1111/vru.12901 (2020).
Article PubMed PubMed Central Google Scholar
Wang, X. et al. ChestX-ray8: Hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. in Proceedings of the 30th IEEE Conference on Computer Visual Pattern Recognition, CVPR 2017 Vol. 2017, 3462–3471, https://doi.org/10.1109/CVPR.2017.369 (2017). arXiv:1705.02315.
Baltruschat, I. M., Nickisch, H., Grass, M., Knopp, T. & Saalbach, A. Comparison of deep learning approaches for multi-label chest X-ray classification. Sci. Rep. 9, 1–10, https://doi.org/10.1038/s41598-019-42294-8 (2019). arXiv:1803.02315.
Yao, L., Prosky, J., Poblenz, E., Covington, B. & Lyman, K. Weakly supervised medical diagnosis and localization from multiple resolutions. arXiV Preprint: arXiv:1803.07703 (2018).
Zech, J. R. et al. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLoS Med. 15, 1–17. https://doi.org/10.1371/journal.pmed.1002683 (2018).
Article Google Scholar
James, W. . B. . Vertebral scale system to measure heart size in radiographs. Vet. Clin. N. Am. Small Anim. Pract. 30, 379–393. https://doi.org/10.1016/S0195-5616(00)50027-8 (2000).
Article Google Scholar
Huang, G., Liu, Z. & Weinberger, K. Densely connected convolutional networks. CoRRarXiv:1608.06993 (2016).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) 2016, Vol. 2016, 770–778 (2016).

Download references

Acknowledgements

This paper is part of a project funded by two research grants from the Department of Animal Medicine, Production and Health-MAPS, University of Padua, Italy. Details of the grants are as follows: (1) SID-Zotti 2018, entitled: ’Application of deep learning algorithms in pet animal diagnostic imaging’; and (2) SID-Banzato 2019, entitled ’Development of an algorithm for the automatic classification and identification of the lesions on the radiographs of the thorax in dogs. ’One of the authors (TB) also holds one grant from the University of Padua ’Talents in Research@University of Padua programme’, entitled ’Prediction of the histological grading of human meningiomas using MR images texture and deep learning: a translational application of a model developed on spontaneously occurring meningiomas in dogs’. In addition, the authors would like to thank the NVIDIA Corporation for the donating the GPU card used in this study.

Author information

These authors contributed equally: Tommaso Banzato, Marek Wodzinski and Alessandro Zotti.

Authors and Affiliations

Department of Animal Medicine, Productions, and Health, Legnaro (PD), University of Padua, 35020, Padua, Italy
Tommaso Banzato, Silvia Burti, Valentina Longhin Osti, Valentina Rossoni & Alessandro Zotti
Department of Measurement and Electronics, AGH University of Science and Technology, 32059, Kraków, Poland
Marek Wodzinski
Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO Valais), 3960, Sierre, Switzerland
Manfredo Atzori
Department of Neuroscience, University of Padua, 35128, Padua, IT, Italy
Manfredo Atzori

Authors

Tommaso Banzato
View author publications
You can also search for this author in PubMed Google Scholar
Marek Wodzinski
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Burti
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Longhin Osti
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Rossoni
View author publications
You can also search for this author in PubMed Google Scholar
Manfredo Atzori
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Zotti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.B. conceived the experiment, analyzed the results, and drafted the manuscript(s), M.W., M.A. developed the deep-learning model and conducted the experiment(s); S.B., V.L.O., V.R., A.Z. interpreted the radiographs, analyzed the experiment results, and drafted the manuscript; all the authors reviewed the manuscript.

Corresponding author

Correspondence to Tommaso Banzato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Banzato, T., Wodzinski, M., Burti, S. et al. Automatic classification of canine thoracic radiographs using deep learning. Sci Rep 11, 3964 (2021). https://doi.org/10.1038/s41598-021-83515-3

Download citation

Received: 18 September 2020
Accepted: 04 February 2021
Published: 17 February 2021
DOI: https://doi.org/10.1038/s41598-021-83515-3

This article is cited by

Regressive vision transformer for dog cardiomegaly assessment
- Jialu Li
- Youshan Zhang
Scientific Reports (2024)
Improving the classification of veterinary thoracic radiographs through inter-species and inter-pathology self-supervised pre-training of deep learning models
- Weronika Celniak
- Marek Wodziński
- Tommaso Banzato
Scientific Reports (2023)
An AI-based algorithm for the automatic evaluation of image quality in canine thoracic radiographs
- Tommaso Banzato
- Marek Wodzinski
- Alessandro Zotti
Scientific Reports (2023)
Deep learning in veterinary medicine, an approach based on CNN to detect pulmonary abnormalities from lateral thoracic radiographs in cats
- Léo Dumortier
- Florent Guépin
- Thomas Grenier
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Generative models improve fairness of medical classifiers under distribution shifts

Segment anything in medical images

AI in health and medicine

Introduction

Results

Database

Selection of the radiographic findings

Classification results

Discussion

Methods

Database creation

Radiographic findings

Image processing and deep learning

Statistical analysis

Conclusions

Data availibility

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Regressive vision transformer for dog cardiomegaly assessment

Improving the classification of veterinary thoracic radiographs through inter-species and inter-pathology self-supervised pre-training of deep learning models

An AI-based algorithm for the automatic evaluation of image quality in canine thoracic radiographs

Deep learning in veterinary medicine, an approach based on CNN to detect pulmonary abnormalities from lateral thoracic radiographs in cats

Comments

Search

Quick links