mastodontech.de ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Offen für alle (über 16) und bereitgestellt von Markus'Blog

Serverstatistik:

1,4 Tsd.
aktive Profile

#speechrecognition

1 Beitrag1 Beteiligte*r0 Beiträge heute
Sir thalon :klingon:<p>PDFs became my real-world AI benchmark. I can’t fill forms by hand, so ChatGPT in Agent mode now handles flat scans, anchors, proofs, and signatures on my phone—showing both the limits and the leverage of assistive AI.</p><p><a href="https://embassy.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://embassy.social/tags/AssistiveTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AssistiveTech</span></a> <a href="https://embassy.social/tags/A11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>A11y</span></a> <a href="https://embassy.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://embassy.social/tags/AgenticAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AgenticAI</span></a> <a href="https://embassy.social/tags/PDFForms" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PDFForms</span></a> <a href="https://embassy.social/tags/AcroForms" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcroForms</span></a> <a href="https://embassy.social/tags/Inclusion" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusion</span></a> <a href="https://embassy.social/tags/MobileFirst" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MobileFirst</span></a> <a href="https://embassy.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://embassy.social/tags/Automation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Automation</span></a> <a href="https://embassy.social/tags/Productivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Productivity</span></a> <a href="https://embassy.social/tags/AGI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AGI</span></a> <a href="https://embassy.social/tags/DocumentAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DocumentAI</span></a></p><p><a href="https://www.linkedin.com/posts/christian-bayerlein-ba578a171_accessibility-assistivetech-a11y-activity-7360218369846833152-xG9X" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">linkedin.com/posts/christian-b</span><span class="invisible">ayerlein-ba578a171_accessibility-assistivetech-a11y-activity-7360218369846833152-xG9X</span></a></p>
AskUbuntu<p>Speech to Text extension for real-time transcription in your browser — any good ones? <a href="https://ubuntu.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a></p><p><a href="https://askubuntu.com/q/1554138/612" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">askubuntu.com/q/1554138/612</span><span class="invisible"></span></a></p>
Data Quine<p>Journal of Open Source Software: voice: A Comprehensive R Package for Audio Analysis <br>{voice}<br>"...a free, open-source toolkit designed to streamline audio analysis by integrating music theory and advanced computational techniques. It enables researchers to extract, summarize, and analyze voice data efficiently, supporting applications such as speech recognition, speaker identification, and mood inference..."</p><p><a href="https://joss.theoj.org/papers/10.21105/joss.08420" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">joss.theoj.org/papers/10.21105</span><span class="invisible">/joss.08420</span></a></p><p><a href="https://datasci.social/tags/RStats" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RStats</span></a> <a href="https://datasci.social/tags/Audio" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Audio</span></a> <a href="https://datasci.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://datasci.social/tags/AudioAnalysis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AudioAnalysis</span></a> <a href="https://datasci.social/tags/Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speech</span></a></p>
Hacker News<p>Voxtral-Mini-3B-2507 – Open source speech understanding model</p><p><a href="https://huggingface.co/mistralai/Voxtral-Mini-3B-2507" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">huggingface.co/mistralai/Voxtr</span><span class="invisible">al-Mini-3B-2507</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Models" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Models</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/VoxtralMini3B" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoxtralMini3B</span></a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechNews</span></a></p>
Ecologia Digital<p>"<a href="https://mato.social/tags/KarenHao" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KarenHao</span></a> only really gets her teeth into this point in the book’s epilogue, “How the Empire Falls.” She takes inspiration from <a href="https://mato.social/tags/TeHiku" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TeHiku</span></a>, a <a href="https://mato.social/tags/M%C4%81ori" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Māori</span></a> AI <a href="https://mato.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> project. Te Hiku seeks to revitalize the <a href="https://mato.social/tags/te_reo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>te_reo</span></a> language through putting archived audio tapes of te reo speakers into an AI model, teaching new generations of Māori.<br>The tech has been developed on consent and active participation from the Māori community, and it is only licensed to organizations that respect Māori values"</p>
Jeremy KahnI don't know why they call it vibe coding
Debby<p><span class="h-card" translate="no"><a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thelinuxEXP</span></a></span> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> </p><p>It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.</p><p>I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WhisperAI</span></a> for transcription and Piper for voice, but many other models are available as well. </p><p>It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> </p><p><a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transcription</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>translator</span></a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>offline</span></a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinetranslation</span></a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sailfishos</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nmt</span></a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stt</span></a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>asr</span></a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
Hacker News<p>DeepSpeech Is Discontinued</p><p><a href="https://github.com/mozilla/DeepSpeech" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mozilla/DeepSpeech</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/DeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepSpeech</span></a> <a href="https://mastodon.social/tags/Discontinued" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Discontinued</span></a> <a href="https://mastodon.social/tags/Mozilla" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mozilla</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a></p>
PLOS Biology<p>Slow amplitude fluctuations in sounds, critical for <a href="https://fediscience.org/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a>, seem poorly represented in the <a href="https://fediscience.org/tags/brainstem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>brainstem</span></a>. This study shows that overlooked intricacies of <a href="https://fediscience.org/tags/SpikeTiming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpikeTiming</span></a> represent these fluctuations, reconciling low-level neural processing with <a href="https://fediscience.org/tags/perception" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>perception</span></a> @plosbiology.org 🧪 <a href="https://plos.io/3FJ4adI" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">plos.io/3FJ4adI</span><span class="invisible"></span></a></p>
CITO Greenhouse<p>The Marvel of Auditory and Cognitive Networks Working Together in Your Brain</p><p><a href="https://mastodon.social/tags/AuditoryProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AuditoryProcessing</span></a> <a href="https://mastodon.social/tags/BrainScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BrainScience</span></a> <a href="https://mastodon.social/tags/NeuralNetworks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>NeuralNetworks</span></a> <a href="https://mastodon.social/tags/CognitiveScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CognitiveScience</span></a> <a href="https://mastodon.social/tags/Hearing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hearing</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/BrainPlasticity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BrainPlasticity</span></a> <a href="https://mastodon.social/tags/CentralNervousSystem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CentralNervousSystem</span></a> <a href="https://mastodon.social/tags/SoundProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoundProcessing</span></a> <a href="https://mastodon.social/tags/Neuroscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Neuroscience</span></a> <a href="https://mastodon.social/tags/ListeningSkills" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ListeningSkills</span></a> <a href="https://mastodon.social/tags/BrainHealth" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BrainHealth</span></a> <a href="https://mastodon.social/tags/AuditoryDisorders" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AuditoryDisorders</span></a> <a href="https://mastodon.social/tags/LearningAndMemory" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LearningAndMemory</span></a></p><p><a href="https://youtube.com/shorts/7GO01YoqIHo?feature=share" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">youtube.com/shorts/7GO01YoqIHo</span><span class="invisible">?feature=share</span></a></p>
Debby<p>🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬</p><p>Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬</p><p>follow hem here: <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span> <br>or on YouTube: <a href="https://www.youtube.com/@ThorstenMueller" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">youtube.com/@ThorstenMueller</span><span class="invisible"></span></a> YouTube channel! </p><p><a href="https://hear-me.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/ParlerTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ParlerTTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
Debby<p>Goode <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span>, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
IT News<p>Christmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - <a href="https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/05/18/christ</span><span class="invisible">mas-comes-early-with-ai-santa-demo/</span></a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://schleuss.online/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechsynthesis</span></a> <a href="https://schleuss.online/tags/santaclaus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>santaclaus</span></a> <a href="https://schleuss.online/tags/libpeer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>libpeer</span></a> <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openai</span></a> <a href="https://schleuss.online/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Hacker News<p>Jargonic Sets New SOTA for Japanese ASR</p><p><a href="https://aiola.ai/blog/jargonic-japanese-asr/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">aiola.ai/blog/jargonic-japanes</span><span class="invisible">e-asr/</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/Jargonic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Jargonic</span></a> <a href="https://mastodon.social/tags/SOTA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SOTA</span></a> <a href="https://mastodon.social/tags/Japanese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Japanese</span></a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASR</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/Innovation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Innovation</span></a></p>
Winbuzzer<p>Nvidia Releases High-Speed Parakeet AI Speech Recognition Model, Claims Top Spot on Leaderboard</p><p><a href="https://mastodon.social/tags/Nvidia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Nvidia</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASR</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/Parakeet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Parakeet</span></a> <a href="https://mastodon.social/tags/NeMo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>NeMo</span></a> <a href="https://mastodon.social/tags/HuggingFace" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HuggingFace</span></a> <a href="https://mastodon.social/tags/AIModels" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIModels</span></a></p><p><a href="https://winbuzzer.com/2025/05/06/nvidia-releases-high-speed-parakeet-ai-speech-recognition-model-claims-top-spot-on-leaderboard-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/05/06/nvidi</span><span class="invisible">a-releases-high-speed-parakeet-ai-speech-recognition-model-claims-top-spot-on-leaderboard-xcxwbn/</span></a></p>
Richard Emling (DO9RE)<p>I'm exploring ways to improve audio preprocessing for speech recognition for my [midi2hamlib](<a href="https://github.com/DO9RE/midi2hamlib" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/DO9RE/midi2hamlib</span><span class="invisible"></span></a>) project. Do any of my followers have expertise with **SoX** or **speech recognition**? Specifically, I’m seeking advice on: 1️⃣ Best practices for audio preparation for speech recognition. 2️⃣ SoX command-line parameters that can optimize audio during recording or playback. <br> <a href="https://github.com/DO9RE/midi2hamlib/blob/main/tests/speech_menu.sh" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/DO9RE/midi2hamlib/b</span><span class="invisible">lob/main/tests/speech_menu.sh</span></a> <a href="https://metalhead.club/tags/SoX" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoX</span></a> <a href="https://metalhead.club/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://metalhead.club/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://metalhead.club/tags/AudioProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AudioProcessing</span></a> <a href="https://metalhead.club/tags/ShellScripting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ShellScripting</span></a> <a href="https://metalhead.club/tags/Sphinx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sphinx</span></a> <a href="https://metalhead.club/tags/PocketSphinx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PocketSphinx</span></a> <a href="https://metalhead.club/tags/Audio" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Audio</span></a> Retoot appreciated.</p>
Hacker News<p>Jargonic: Industry-Tunable ASR Model</p><p><a href="https://aiola.ai/blog/introducing-jargonic-asr/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">aiola.ai/blog/introducing-jarg</span><span class="invisible">onic-asr/</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/Jargonic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Jargonic</span></a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASR</span></a> <a href="https://mastodon.social/tags/Industry" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Industry</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Model" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Model</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a></p>
Pyrzout :vm:<p>Be Careful What You Ask For: Voice Control <a href="https://hackaday.com/2025/02/19/be-careful-what-you-ask-for-voice-control/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/02/19/be-car</span><span class="invisible">eful-what-you-ask-for-voice-control/</span></a> <a href="https://social.skynetcloud.site/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> <a href="https://social.skynetcloud.site/tags/computerspeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>computerspeech</span></a> <a href="https://social.skynetcloud.site/tags/voicecommand" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voicecommand</span></a> <a href="https://social.skynetcloud.site/tags/Featured" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Featured</span></a> <a href="https://social.skynetcloud.site/tags/Rants" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rants</span></a> <a href="https://social.skynetcloud.site/tags/rants" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rants</span></a></p>
Doug Holton<p>Vibe is an <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> desktop client (mac, windows, linux) for locally running Whisper to more accurately transcribe or caption videos &amp; audio <a href="https://thewh1teagle.github.io/vibe/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">thewh1teagle.github.io/vibe/</span><span class="invisible"></span></a> Source code: <a href="https://github.com/thewh1teagle/vibe/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/thewh1teagle/vibe/</span><span class="invisible"></span></a> Easier to use than what I was using before (WhisperDesktop). Default settings use the medium Whisper model, which has been good enough in my experience.<br><a href="https://mastodon.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://mastodon.social/tags/A11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>A11y</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/EdTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EdTech</span></a></p>
The Conversation U.S.<p>Speech recognition systems struggle with accents and dialects, risking problems in critical fields like healthcare and emergency services. Imagine calling 911 and the AI used to screen out non-emergency calls can’t understand you. </p><p>A Spanish language professor explains: <a href="https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">theconversation.com/sorry-i-di</span><span class="invisible">dnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281</span></a> <a href="https://newsie.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://newsie.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a></p>