Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Pybind11 bindings for Whisper.cpp