Preliminary Text Analysis from Medical Records for TB Diagnosis Support

Andres Felipe Romero Gomez; Alvaro D. Orjuela-Canon; Andres L. Jutinico; Carlos Awad; Erika Vergara; Angelica Palencia

doi:10.1109/EMBC46164.2021.9631006

Preliminary Text Analysis from Medical Records for TB Diagnosis Support

Andres Felipe Romero Gomez, Alvaro D. Orjuela-Canon, Andres L. Jutinico, Carlos Awad, Erika Vergara, Angelica Palencia

Research output: Chapter in Book/Report › Conference contribution

Abstract

Tuberculosis is an infectious disease that is spread through the air from one person to another and is one of the top ten causes of death in the world according to the World Health Organization. From biomedical engineering, decision support systems based on artificial intelligence have shown advantages for healthcare personnel in tasks such as diagnosis and screening. A specific area of the artificial intelligence is the natural language processing, however, most of these approaches are based on available data. This paper shows the construction of a dataset based on medical records of subjects suspected of tuberculosis. In addition, an initial exploration of the contents of the constructed dataset and how this approach can be followed by a natural language processing to support tuberculosis diagnosis in data demanding scenarios are presented.Clinical Relevance - In some developing countries as Colombia, it is difficult to develop systems based on artificial intelligence due to the availability of data. This proposal holds a strategy to build a dataset to train machine learning models, and to obtain support diagnosis tools, employing natural language from the medical scenario from text written by health professionals in the medical record. In this way, trained models based on this information available can be employed in places where medical infrastructure is precarious.

Original language	English (US)
Title of host publication	43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	2468-2471
Number of pages	4
Edition	2021
ISBN (Electronic)	9781728111797
DOIs	https://doi.org/10.1109/EMBC46164.2021.9631006
State	Published - Nov 2021
Event	43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 - Virtual, Online, Mexico Duration: Nov 1 2021 → Nov 5 2021

Publication series

Name	Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
ISSN (Print)	1557-170X

Conference

Conference	43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
Country/Territory	Mexico
City	Virtual, Online
Period	11/1/21 → 11/5/21

All Science Journal Classification (ASJC) codes

Signal Processing
Biomedical Engineering
Computer Vision and Pattern Recognition
Health Informatics

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/EMBC46164.2021.9631006

Cite this

Romero Gomez, A. F., Orjuela-Canon, A. D., Jutinico, A. L., Awad, C., Vergara, E., & Palencia, A. (2021). Preliminary Text Analysis from Medical Records for TB Diagnosis Support. In 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 (2021 ed., pp. 2468-2471). (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/EMBC46164.2021.9631006

Romero Gomez, Andres Felipe ; Orjuela-Canon, Alvaro D. ; Jutinico, Andres L. et al. / Preliminary Text Analysis from Medical Records for TB Diagnosis Support. 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021. 2021. ed. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 2468-2471 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS).

@inproceedings{b657404e56a94014a0ea472a5bd8a340,

title = "Preliminary Text Analysis from Medical Records for TB Diagnosis Support",

abstract = "Tuberculosis is an infectious disease that is spread through the air from one person to another and is one of the top ten causes of death in the world according to the World Health Organization. From biomedical engineering, decision support systems based on artificial intelligence have shown advantages for healthcare personnel in tasks such as diagnosis and screening. A specific area of the artificial intelligence is the natural language processing, however, most of these approaches are based on available data. This paper shows the construction of a dataset based on medical records of subjects suspected of tuberculosis. In addition, an initial exploration of the contents of the constructed dataset and how this approach can be followed by a natural language processing to support tuberculosis diagnosis in data demanding scenarios are presented.Clinical Relevance - In some developing countries as Colombia, it is difficult to develop systems based on artificial intelligence due to the availability of data. This proposal holds a strategy to build a dataset to train machine learning models, and to obtain support diagnosis tools, employing natural language from the medical scenario from text written by health professionals in the medical record. In this way, trained models based on this information available can be employed in places where medical infrastructure is precarious.",

author = "{Romero Gomez}, {Andres Felipe} and Orjuela-Canon, {Alvaro D.} and Jutinico, {Andres L.} and Carlos Awad and Erika Vergara and Angelica Palencia",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 ; Conference date: 01-11-2021 Through 05-11-2021",

year = "2021",

month = nov,

doi = "10.1109/EMBC46164.2021.9631006",

language = "English (US)",

series = "Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "2468--2471",

booktitle = "43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021",

address = "United States",

edition = "2021",

}

Romero Gomez, AF, Orjuela-Canon, AD, Jutinico, AL, Awad, C, Vergara, E & Palencia, A 2021, Preliminary Text Analysis from Medical Records for TB Diagnosis Support. in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021. 2021 edn, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, Institute of Electrical and Electronics Engineers Inc., pp. 2468-2471, 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021, Virtual, Online, Mexico, 11/1/21. https://doi.org/10.1109/EMBC46164.2021.9631006

Preliminary Text Analysis from Medical Records for TB Diagnosis Support. / Romero Gomez, Andres Felipe; Orjuela-Canon, Alvaro D.; Jutinico, Andres L. et al.
43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021. 2021. ed. Institute of Electrical and Electronics Engineers Inc., 2021. p. 2468-2471 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS).

Research output: Chapter in Book/Report › Conference contribution

TY - GEN

T1 - Preliminary Text Analysis from Medical Records for TB Diagnosis Support

AU - Romero Gomez, Andres Felipe

AU - Orjuela-Canon, Alvaro D.

AU - Jutinico, Andres L.

AU - Awad, Carlos

AU - Vergara, Erika

AU - Palencia, Angelica

PY - 2021/11

Y1 - 2021/11

N2 - Tuberculosis is an infectious disease that is spread through the air from one person to another and is one of the top ten causes of death in the world according to the World Health Organization. From biomedical engineering, decision support systems based on artificial intelligence have shown advantages for healthcare personnel in tasks such as diagnosis and screening. A specific area of the artificial intelligence is the natural language processing, however, most of these approaches are based on available data. This paper shows the construction of a dataset based on medical records of subjects suspected of tuberculosis. In addition, an initial exploration of the contents of the constructed dataset and how this approach can be followed by a natural language processing to support tuberculosis diagnosis in data demanding scenarios are presented.Clinical Relevance - In some developing countries as Colombia, it is difficult to develop systems based on artificial intelligence due to the availability of data. This proposal holds a strategy to build a dataset to train machine learning models, and to obtain support diagnosis tools, employing natural language from the medical scenario from text written by health professionals in the medical record. In this way, trained models based on this information available can be employed in places where medical infrastructure is precarious.

AB - Tuberculosis is an infectious disease that is spread through the air from one person to another and is one of the top ten causes of death in the world according to the World Health Organization. From biomedical engineering, decision support systems based on artificial intelligence have shown advantages for healthcare personnel in tasks such as diagnosis and screening. A specific area of the artificial intelligence is the natural language processing, however, most of these approaches are based on available data. This paper shows the construction of a dataset based on medical records of subjects suspected of tuberculosis. In addition, an initial exploration of the contents of the constructed dataset and how this approach can be followed by a natural language processing to support tuberculosis diagnosis in data demanding scenarios are presented.Clinical Relevance - In some developing countries as Colombia, it is difficult to develop systems based on artificial intelligence due to the availability of data. This proposal holds a strategy to build a dataset to train machine learning models, and to obtain support diagnosis tools, employing natural language from the medical scenario from text written by health professionals in the medical record. In this way, trained models based on this information available can be employed in places where medical infrastructure is precarious.

UR - http://www.scopus.com/inward/record.url?scp=85122493183&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85122493183&partnerID=8YFLogxK

U2 - 10.1109/EMBC46164.2021.9631006

DO - 10.1109/EMBC46164.2021.9631006

M3 - Conference contribution

C2 - 34891779

AN - SCOPUS:85122493183

T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS

SP - 2468

EP - 2471

BT - 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021

Y2 - 1 November 2021 through 5 November 2021

ER -

Romero Gomez AF, Orjuela-Canon AD, Jutinico AL, Awad C, Vergara E, Palencia A. Preliminary Text Analysis from Medical Records for TB Diagnosis Support. In 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021. 2021 ed. Institute of Electrical and Electronics Engineers Inc. 2021. p. 2468-2471. (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS). doi: 10.1109/EMBC46164.2021.9631006

Preliminary Text Analysis from Medical Records for TB Diagnosis Support

Abstract

Publication series

Conference

All Science Journal Classification (ASJC) codes

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this