Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2