Automatic Silence Detection Employing Artificial Intelligence for Clinical Context Analyses

Research output: Knowledge networksConference proceedingspeer-review

Abstract

Automated speech and pause/silence detection is a crucial task in clinical and pathological environments, supporting diagnostic processes and providing essential information for treatment planning. This study evaluates three methods for automatic silence detection in clinical speech analysis: (1) a traditional energy-based method using zero-crossing detection, (2) a pretrained neural network model for voice activity detection (Silero-VAD), and (3) NVIDIA's speaker diarization and transcription tool. All methods demonstrated effective pause/silence detection with comparable error rates, though Silero-VAD exhibited superior precision and performance. Key metrics included a Dice coefficient of 0.917, an onset error of 500 ms, and an endpoint error of 370 ms, highlighting the importance of audio preprocessing.

Translated title of the contributionDetección automática de silencios mediante inteligencia artificial para análisis de contextos clínicos
Original languageEnglish (US)
DOIs
StatePublished - Dec 2024
Event3rd International Congress of Biomedical Engineering and Bioengineering, CIIBBI 2024 - Cali, Colombia
Duration: Nov 6 2024Nov 8 2024

Conference

Conference3rd International Congress of Biomedical Engineering and Bioengineering, CIIBBI 2024
Country/TerritoryColombia
CityCali
Period11/6/2411/8/24

All Science Journal Classification (ASJC) codes

  • Orthopedics and Sports Medicine
  • Bioengineering
  • Biomedical Engineering
  • Media Technology

Fingerprint

Dive into the research topics of 'Automatic Silence Detection Employing Artificial Intelligence for Clinical Context Analyses'. Together they form a unique fingerprint.

Cite this