Data Fusion Analysis for Determining Localization of Proteins Associated to Escherichia coli

Research output: Chapter in Book/ReportConference contribution

Abstract

In recent years, the interest in protein analysis based on biomolecular features has rapidly grown. This has led to explore the use of machine learning models, as they could hold an important alternative to contribute to the problems associated to these analyses. Models as support vector machines, artificial neural networks and random forest were compared for the prediction of protein localization. Two main sources of data were used to train the models: the information from targeting signal and from the protein sequences to determine the localization sites of the protein. A third scenario with a fusion of both sources of data was employed. Four classes were established according to the subcellular localization of the protein: cytoplasm, periplasmatic space, outer and inner membranes. Results reached values between 77% and 92% in terms of balanced accuracy. The models with better performance were based on random forest and support vector machines.

Original languageEnglish (US)
Title of host publication2022 IEEE Colombian Conference on Applications of Computational Intelligence, ColCACI 2022 - Proceedings
EditorsAlvaro David Orjuela-Canon
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665474702, 978-166547470-2
ISBN (Print)9781665474702
DOIs
StatePublished - Jul 27 2022
Event2022 IEEE Colombian Conference on Applications of Computational Intelligence, ColCACI 2022 - Cali, Colombia
Duration: Jul 27 2022Jul 29 2022

Publication series

Name2022 IEEE Colombian Conference on Applications of Computational Intelligence (ColCACI)

Conference

Conference2022 IEEE Colombian Conference on Applications of Computational Intelligence, ColCACI 2022
Country/TerritoryColombia
CityCali
Period7/27/227/29/22

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Data Fusion Analysis for Determining Localization of Proteins Associated to Escherichia coli'. Together they form a unique fingerprint.

Cite this