Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes

Haiming Tang; Nanfei Sun; Steven Shen

doi:10.4103/jpi.jpi_78_20

Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes

Haiming Tang, Nanfei Sun, Steven Shen

Research output: Contribution to journal › Article › peer-review

16 Scopus citations

Abstract

Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: The test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-Trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.

Original language	English (US)
Article number	30
Pages (from-to)	30
Journal	Journal of Pathology Informatics
Volume	12
Issue number	1
DOIs	https://doi.org/10.4103/jpi.jpi_78_20
State	Published - Jan 1 2021

Keywords

Artificial intelligence
computer vision
deep learning
diagnostic pathology
osteosarcoma
overfitting

ASJC Scopus subject areas

Pathology and Forensic Medicine
Health Informatics
Computer Science Applications

Access to Document

10.4103/jpi.jpi_78_20

Cite this

@article{ca34b6c837974b36a8eb526612b8d39b,

title = "Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes",

abstract = "Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: The test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-Trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.",

keywords = "Artificial intelligence, computer vision, deep learning, diagnostic pathology, osteosarcoma, overfitting",

author = "Haiming Tang and Nanfei Sun and Steven Shen",

note = "Copyright: {\textcopyright} 2021 Journal of Pathology Informatics.",

year = "2021",

month = jan,

day = "1",

doi = "10.4103/jpi.jpi_78_20",

language = "English (US)",

volume = "12",

pages = "30",

journal = "Journal of Pathology Informatics",

issn = "2229-5089",

publisher = "Medknow Publications and Media Pvt. Ltd",

number = "1",

}

TY - JOUR

T1 - Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data

T2 - Experiments on osteosarcoma subtypes

AU - Tang, Haiming

AU - Sun, Nanfei

AU - Shen, Steven

PY - 2021/1/1

Y1 - 2021/1/1

N2 - Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: The test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-Trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.

AB - Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: The test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-Trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.

KW - Artificial intelligence

KW - computer vision

KW - deep learning

KW - diagnostic pathology

KW - osteosarcoma

KW - overfitting

UR - http://www.scopus.com/inward/record.url?scp=85114417611&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85114417611&partnerID=8YFLogxK

U2 - 10.4103/jpi.jpi_78_20

DO - 10.4103/jpi.jpi_78_20

M3 - Article

C2 - 34497734

AN - SCOPUS:85114417611

SN - 2229-5089

VL - 12

SP - 30

JO - Journal of Pathology Informatics

JF - Journal of Pathology Informatics

IS - 1

M1 - 30

ER -

Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this