TY - JOUR
T1 - A dataset of microscopic peripheral blood cell images for development of automatic recognition systems
AU - Acevedo, Andrea
AU - Merino, Anna
AU - Alférez, Santiago
AU - Molina, Ángel
AU - Boldú, Laura
AU - Rodellar, José
PY - 2020/6
Y1 - 2020/6
N2 - This article makes available a dataset that was used for the development of an automatic recognition system of peripheral blood cell images using convolutional neural networks [1]. The dataset contains a total of 17,092 images of individual normal cells, which were acquired using the analyzer CellaVision DM96 in the Core Laboratory at the Hospital Clinic of Barcelona. The dataset is organized in the following eight groups: neutrophils, eosinophils, basophils, lymphocytes, monocytes, immature granulocytes (promyelocytes, myelocytes, and metamyelocytes), erythroblasts and platelets or thrombocytes. The size of the images is 360 × 363 pixels, in format jpg, and they were annotated by expert clinical pathologists. The images were captured from individuals without infection, hematologic or oncologic disease and free of any pharmacologic treatment at the moment of blood collection. This high-quality labelled dataset may be used to train and test machine learning and deep learning models to recognize different types of normal peripheral blood cells. To our knowledge, this is the first publicly available set with large numbers of normal peripheral blood cells, so that it is expected to be a canonical dataset for model benchmarking.
AB - This article makes available a dataset that was used for the development of an automatic recognition system of peripheral blood cell images using convolutional neural networks [1]. The dataset contains a total of 17,092 images of individual normal cells, which were acquired using the analyzer CellaVision DM96 in the Core Laboratory at the Hospital Clinic of Barcelona. The dataset is organized in the following eight groups: neutrophils, eosinophils, basophils, lymphocytes, monocytes, immature granulocytes (promyelocytes, myelocytes, and metamyelocytes), erythroblasts and platelets or thrombocytes. The size of the images is 360 × 363 pixels, in format jpg, and they were annotated by expert clinical pathologists. The images were captured from individuals without infection, hematologic or oncologic disease and free of any pharmacologic treatment at the moment of blood collection. This high-quality labelled dataset may be used to train and test machine learning and deep learning models to recognize different types of normal peripheral blood cells. To our knowledge, this is the first publicly available set with large numbers of normal peripheral blood cells, so that it is expected to be a canonical dataset for model benchmarking.
UR - http://www.scopus.com/inward/record.url?scp=85083451557&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083451557&partnerID=8YFLogxK
U2 - 10.1016/j.dib.2020.105474
DO - 10.1016/j.dib.2020.105474
M3 - Research Article
C2 - 32346559
AN - SCOPUS:85083451557
SN - 2352-3409
VL - 30
JO - Data in Brief
JF - Data in Brief
M1 - 105474
ER -