There is a new open source STT (speech-to-text) model called "moonshine" that promises superior performance and lower size on embedded devices. I will check it out :) https://github.com/usefulsensors/moonshine
Seems promising.
@td8 It understands other languages than English, doesn't it?
@mattesilver from what I understand when reading their documentation, currently it is an English-only model: https://github.com/usefulsensors/moonshine/blob/main/model-card.md
Hopefully, they can add more languages soon!