Speech-to-text, text-to-speech, speaker diarization, speech enhancement,
source separation, and VAD using next-gen Kaldi with onnxruntime without
Internet connection. Support embedded systems, Android, iOS, HarmonyOS,
Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support
12 programming languages