mastodontech.de ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Offen für alle (über 16) und bereitgestellt von Markus'Blog

Serverstatistik:

1,4 Tsd.
aktive Profile

#speechnote

0 Beiträge0 Beteiligte0 Beiträge heute
mkiol<p><a href="https://mastodon.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> has just reached 1K stars on GitHub. I know that doesn't mean anything, but this is a good opportunity to sum something up.</p><p>Right now you can install it via Flathub, Arch Linux AUR, OpenSUSE Pacman repo and OpenRepos if you use Sailfish OS. According to Flathub stats only, Speech Note is downloaded 300 times per day. The last update was installed on about 20K computers! This is much more than I could have ever foreseen. This is amazing and very rewarding. Thank you, dear users!</p>
Debby<p><span class="h-card" translate="no"><a href="https://infosec.exchange/@pancake" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>pancake</span></a></span> You could try <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> *(available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> ) It's a fantastic tool for quick and local voice transcription in multiple languages, but also a grate way to use and try different TTS voices - generally I like Piper voices, they sound grate and are FOSS</p>
Debby<p><span class="h-card" translate="no"><a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thelinuxEXP</span></a></span> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> </p><p>It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.</p><p>I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WhisperAI</span></a> for transcription and Piper for voice, but many other models are available as well. </p><p>It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> </p><p><a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transcription</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>translator</span></a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>offline</span></a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinetranslation</span></a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sailfishos</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nmt</span></a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stt</span></a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>asr</span></a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
mkiol<p>If you speak Swedish and are looking for fast, accurate and offline Speech-to-text, check out KBLab's fine-tuned Whisper models. These models were trained using the resources of the National Library of Sweden. Even the "Tiny" model is much more accurate than the original Whisper model.</p><p>More about this cool project: <a href="https://kb-labb.github.io/posts/2025-03-07-welcome-KB-Whisper/index.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">kb-labb.github.io/posts/2025-0</span><span class="invisible">3-07-welcome-KB-Whisper/index.html</span></a></p><p>You can play with the KBLab models in the latest version of <a href="https://mastodon.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> app.</p>
moagee<p>völlig underrated:</p><p><a href="https://chaos.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> ist eine datenschutzfreundliche Linux-App, die Sprache in Text umwandelt (<a href="https://chaos.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a>), Text vorliest (auch Dateien) (<a href="https://chaos.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a>) und übersetzt – alles lokal ohne Internetverbindung.<br>Viele Sprachen und Open-Source-Modelle stehen zum einbinden zur Verfügung!</p>
moagee<p><span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> small workaround under wayland: I created a bash script and laid it with a global shortcut. I can now have all the texts under wayland read out system-wide via "action start-reading-text" ;)<br>thx for your work! :)<br><a href="https://chaos.social/tags/speechnote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechnote</span></a></p>
mkiol<p><a href="https://mastodon.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> 4.8.0 is now available on <a href="https://mastodon.social/tags/Flathub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Flathub</span></a>!</p><p>Release highlights:<br>- New TTS engines: Kokoro, Parler-TTS, F5-TTS<br>- Support for Global Keyboard Shortcuts and "Insert into active window" on Wayland<br>- Many new STT and TTS models</p><p>If you are using the GPU add-on, update it to version 1.4.0 as well.</p><p>Video presenting all the changes: <a href="https://youtu.be/ww6skKOOzZ8" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">youtu.be/ww6skKOOzZ8</span><span class="invisible"></span></a></p><p>Full changelog: <a href="https://github.com/mkiol/dsnote/releases/tag/v4.8.0" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/mkiol/dsnote/releas</span><span class="invisible">es/tag/v4.8.0</span></a></p>
DerBrumme<p>Hallo <a href="https://troet.cafe/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a> bubble, brauche mal euer Schwarmwissen: Habe <a href="https://troet.cafe/tags/speechnote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechnote</span></a> installiert (<a href="https://troet.cafe/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> -basierte offline- Spracherkennung, läuft also auf'm lokalen PC).</p><p>Speech2Text klappt sehr gut (Rest noch nicht getestet). </p><p>Mein Ziel wäre ne Verbindung zum Keyboard: Ich will nicht nur Text diktieren, den ich dann irgendwohin kopiere, sondern sondern ne Art direkte Spracheingabe haben, also die Tastatur ersetzen. </p><p>Geht das?</p><p>Meine Anleitung: <a href="https://www.youtube.com/watch?v=VDMbWUfHsbk" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=VDMbWUfHsb</span><span class="invisible">k</span></a></p><p>Mehr Info: <a href="https://linuxnews.de/speech-note-notizen-und-mehr/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">linuxnews.de/speech-note-notiz</span><span class="invisible">en-und-mehr/</span></a></p>
mmcm<p>I just uninstalled 4 <a href="https://mastodon.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a> apps:</p><p>* <a href="https://mastodon.social/tags/speechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechNote</span></a> (+AMD addon)<br>* <a href="https://mastodon.social/tags/mongodb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>mongodb</span></a> compass<br>* <span class="h-card" translate="no"><a href="https://fosstodon.org/@organicmaps" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>organicmaps</span></a></span> <br>* <a href="https://mastodon.social/tags/verso" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>verso</span></a> (which I installed for fun)</p><p>This literated a whopping 52GiB off my system drive. Especially the AMD "addon" with over 12GiB was shocking.</p><p>So, guess I'm in the market for a <a href="https://mastodon.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a> <a href="https://mastodon.social/tags/floss" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>floss</span></a> offline-only <a href="https://mastodon.social/tags/whisper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>whisper</span></a> / <a href="https://mastodon.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> solution that integrates into a desktop.<br>And for <a href="https://mastodon.social/tags/organicMaps" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>organicMaps</span></a> I guess I'll wait until one day there'll be a .deb. 🙄</p>
Devin Prater :blind:<p>Made this issue on SpeechNote with Orca:</p><p><a href="https://github.com/mkiol/dsnote/issues/168" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/mkiol/dsnote/issues</span><span class="invisible">/168</span></a></p><p><a href="https://tweesecake.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://tweesecake.social/tags/orca" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>orca</span></a> <a href="https://tweesecake.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a> <a href="https://tweesecake.social/tags/foss" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>foss</span></a> <a href="https://tweesecake.social/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> <a href="https://tweesecake.social/tags/speechnote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechnote</span></a></p>
OSTechNix<p>Speech Note – Offline Speech Recognition, Text-to-Speech and Translation App for Linux <a href="https://floss.social/tags/Speechnote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speechnote</span></a> <a href="https://floss.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://floss.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://floss.social/tags/Translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Translator</span></a> <a href="https://floss.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://floss.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://floss.social/tags/Opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Opensource</span></a> <a href="https://floss.social/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> <br><a href="https://ostechnix.com/speech-note-speech-recognition-text-to-speech-translation-app-for-linux/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">ostechnix.com/speech-note-spee</span><span class="invisible">ch-recognition-text-to-speech-translation-app-for-linux/</span></a></p>
Vincent Batts<p><span class="h-card" translate="no"><a href="https://mastodon.social/@jzb" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>jzb</span></a></span> good question. I just did a simple comparison of a few models. I'm sure there is a better text to use for such a text...<br>It's neat that the Whisper models even get the punctuation more correct than Vosk.<br>And the Mozilla model is just not great.<br>Also, they all seem to process a little differently. Like, the Whisper models listen, and only output whole text once it has processed, whereas the Vosk will roll out words even if it amends them on the fly</p><p><a href="https://fosstodon.org/tags/speechnote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechnote</span></a> <a href="https://fosstodon.org/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a> <a href="https://fosstodon.org/tags/desktop" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>desktop</span></a> <a href="https://fosstodon.org/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a></p>
Vincent Batts<p>Speech to Text on Linux?!?!</p><p>I just found "Speech Note" when searching the Software app. <br>Installed it via flatpak.<br><a href="https://flathub.org/apps/net.mkiol.SpeechNote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">flathub.org/apps/net.mkiol.Spe</span><span class="invisible">echNote</span></a></p><p>Used the "English (Vosk Large)" and "English (Vosk Small)" language model with very decent results. There are loads of models to choose from.<br>All processed locally. No network needed!<br>This is great!</p><p><a href="https://fosstodon.org/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://fosstodon.org/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://fosstodon.org/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> <a href="https://fosstodon.org/tags/debian" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>debian</span></a> <a href="https://fosstodon.org/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a> <a href="https://fosstodon.org/tags/flathub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flathub</span></a> <a href="https://fosstodon.org/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
Fossery Tech :debian: :gnome:TTS voice change in my upcoming videos
ricardo :mastodon:<p><a href="https://fosstodon.org/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> Transcribes Voice to Text on <a href="https://fosstodon.org/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> 🗣️ </p><p><a href="https://www.omglinux.com/speech-note-transcribe-voice-to-text-on-linux/" rel="nofollow noopener" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">omglinux.com/speech-note-trans</span><span class="invisible">cribe-voice-to-text-on-linux/</span></a></p>
mkiol<p>I've just released Speech Note 4.0!<br>New version comes with shiny new offline machine Translator and many new Text to Speech voices.</p><p>To implement the Translator I borrowed some code and models from amazing <a href="https://mastodon.social/tags/BergamotProject" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BergamotProject</span></a> and <a href="https://mastodon.social/tags/FirefoxTranslations" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FirefoxTranslations</span></a>.</p><p><a href="https://mastodon.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> is a Linux offline <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a>, <a href="https://mastodon.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> and <a href="https://mastodon.social/tags/MachineTranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineTranslation</span></a> app. You can download it from <a href="https://mastodon.social/tags/Flathub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Flathub</span></a></p><p>Videos:<br><a href="https://mastodon.social/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> Desktop: <a href="https://youtu.be/psRT0UPFb04" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">youtu.be/psRT0UPFb04</span><span class="invisible"></span></a><br><a href="https://mastodon.social/tags/PinePhone" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PinePhone</span></a>: <a href="https://youtu.be/kTsM3kUxE2Q" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">youtu.be/kTsM3kUxE2Q</span><span class="invisible"></span></a><br><a href="https://mastodon.social/tags/SailfishOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SailfishOS</span></a>: <a href="https://youtu.be/88cdPpvBmmI" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">youtu.be/88cdPpvBmmI</span><span class="invisible"></span></a></p>
mkiol<p>If you have to do Speech-to-Text and Text-to-Speech tasks and don't want to send your data to the Internet, I recommend you to try Speech Note (Linux desktop app). </p><p>It is easy to use, works offline and supports 57 languages!</p><p>Speech Note works thanks to powerful <a href="https://mastodon.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> and <a href="https://mastodon.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> engines underneath: <a href="https://mastodon.social/tags/DeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepSpeech</span></a> <a href="https://mastodon.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://mastodon.social/tags/Vosk" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Vosk</span></a> <a href="https://mastodon.social/tags/Whisper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Whisper</span></a> <a href="https://mastodon.social/tags/Piper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Piper</span></a> <a href="https://mastodon.social/tags/eSpeak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>eSpeak</span></a> <a href="https://mastodon.social/tags/MBROLA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MBROLA</span></a> <a href="https://mastodon.social/tags/RHVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RHVoice</span></a></p><p>You can download <a href="https://mastodon.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a> from <a href="https://mastodon.social/tags/Flathub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Flathub</span></a>: <a href="https://flathub.org/apps/net.mkiol.SpeechNote" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="ellipsis">flathub.org/apps/net.mkiol.Spe</span><span class="invisible">echNote</span></a></p><p>Video demo: <a href="https://youtu.be/EhUPvaHvssw" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="">youtu.be/EhUPvaHvssw</span><span class="invisible"></span></a></p>