MastodonTech.de

Sir thalon :klingon:PDFs became my real-world AI benchmark. I can’t fill forms by hand, so ChatGPT in Agent mode now handles flat scans, anchors, proofs, and signatures on my phone—showing both the limits and the leverage of assistive AI.<a href="https://embassy.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#Accessibility</a> <a href="https://embassy.social/tags/AssistiveTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#AssistiveTech</a> <a href="https://embassy.social/tags/A11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#A11y</a> <a href="https://embassy.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://embassy.social/tags/AgenticAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AgenticAI</a> <a href="https://embassy.social/tags/PDFForms" class="mention hashtag" rel="nofollow noopener" target="_blank">#PDFForms</a> <a href="https://embassy.social/tags/AcroForms" class="mention hashtag" rel="nofollow noopener" target="_blank">#AcroForms</a> <a href="https://embassy.social/tags/Inclusion" class="mention hashtag" rel="nofollow noopener" target="_blank">#Inclusion</a> <a href="https://embassy.social/tags/MobileFirst" class="mention hashtag" rel="nofollow noopener" target="_blank">#MobileFirst</a> <a href="https://embassy.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://embassy.social/tags/Automation" class="mention hashtag" rel="nofollow noopener" target="_blank">#Automation</a> <a href="https://embassy.social/tags/Productivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#Productivity</a> <a href="https://embassy.social/tags/AGI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AGI</a> <a href="https://embassy.social/tags/DocumentAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#DocumentAI</a><a href="https://www.linkedin.com/posts/christian-bayerlein-ba578a171_accessibility-assistivetech-a11y-activity-7360218369846833152-xG9X" rel="nofollow noopener" translate="no" target="_blank">https://www.linkedin.com/posts/christian-bayerlein-ba578a171_accessibility-assistivetech-a11y-activity-7360218369846833152-xG9X</a>

AskUbuntuSpeech to Text extension for real-time transcription in your browser — any good ones? <a href="https://ubuntu.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechrecognition</a><a href="https://askubuntu.com/q/1554138/612" rel="nofollow noopener" translate="no" target="_blank">https://askubuntu.com/q/1554138/612</a>

Data QuineJournal of Open Source Software: voice: A Comprehensive R Package for Audio Analysis {voice} "...a free, open-source toolkit designed to streamline audio analysis by integrating music theory and advanced computational techniques. It enables researchers to extract, summarize, and analyze voice data efficiently, supporting applications such as speech recognition, speaker identification, and mood inference..."<a href="https://joss.theoj.org/papers/10.21105/joss.08420" rel="nofollow noopener" translate="no" target="_blank">https://joss.theoj.org/papers/10.21105/joss.08420</a><a href="https://datasci.social/tags/RStats" class="mention hashtag" rel="nofollow noopener" target="_blank">#RStats</a> <a href="https://datasci.social/tags/Audio" class="mention hashtag" rel="nofollow noopener" target="_blank">#Audio</a> <a href="https://datasci.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://datasci.social/tags/AudioAnalysis" class="mention hashtag" rel="nofollow noopener" target="_blank">#AudioAnalysis</a> <a href="https://datasci.social/tags/Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#Speech</a>

Hacker NewsVoxtral-Mini-3B-2507 – Open source speech understanding model<a href="https://huggingface.co/mistralai/Voxtral-Mini-3B-2507" rel="nofollow noopener" translate="no" target="_blank">https://huggingface.co/mistralai/Voxtral-Mini-3B-2507</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/Models" class="mention hashtag" rel="nofollow noopener" target="_blank">#Models</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://mastodon.social/tags/VoxtralMini3B" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoxtralMini3B</a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#TechNews</a>

Ecologia Digital"<a href="https://mato.social/tags/KarenHao" class="mention hashtag" rel="nofollow noopener" target="_blank">#KarenHao</a> only really gets her teeth into this point in the book’s epilogue, “How the Empire Falls.” She takes inspiration from <a href="https://mato.social/tags/TeHiku" class="mention hashtag" rel="nofollow noopener" target="_blank">#TeHiku</a>, a <a href="https://mato.social/tags/M%C4%81ori" class="mention hashtag" rel="nofollow noopener" target="_blank">#Māori</a> AI <a href="https://mato.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechrecognition</a> project. Te Hiku seeks to revitalize the <a href="https://mato.social/tags/te_reo" class="mention hashtag" rel="nofollow noopener" target="_blank">#te_reo</a> language through putting archived audio tapes of te reo speakers into an AI model, teaching new generations of Māori. The tech has been developed on consent and active participation from the Māori community, and it is only licensed to organizations that respect Māori values"

Jeremy KahnI don't know why they call it vibe coding

Debby<a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@thelinuxEXP</a> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@mkiol</a> It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#WhisperAI</a> for transcription and Piper for voice, but many other models are available as well. It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank">https://github.com/mkiol/dsnote</a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#TTS</a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#transcription</a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#TextToSpeech</a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#translator</a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#offline</a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinetranslation</a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#sailfishos</a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechSynthesis</a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechtotext</a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#nmt</a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#linux</a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#stt</a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#asr</a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#flatpak</a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechNote</a>

Hacker NewsDeepSpeech Is Discontinued<a href="https://github.com/mozilla/DeepSpeech" rel="nofollow noopener" translate="no" target="_blank">https://github.com/mozilla/DeepSpeech</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/DeepSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#DeepSpeech</a> <a href="https://mastodon.social/tags/Discontinued" class="mention hashtag" rel="nofollow noopener" target="_blank">#Discontinued</a> <a href="https://mastodon.social/tags/Mozilla" class="mention hashtag" rel="nofollow noopener" target="_blank">#Mozilla</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a>

PLOS BiologySlow amplitude fluctuations in sounds, critical for <a href="https://fediscience.org/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a>, seem poorly represented in the <a href="https://fediscience.org/tags/brainstem" class="mention hashtag" rel="nofollow noopener" target="_blank">#brainstem</a>. This study shows that overlooked intricacies of <a href="https://fediscience.org/tags/SpikeTiming" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpikeTiming</a> represent these fluctuations, reconciling low-level neural processing with <a href="https://fediscience.org/tags/perception" class="mention hashtag" rel="nofollow noopener" target="_blank">#perception</a> @plosbiology.org 🧪 <a href="https://plos.io/3FJ4adI" rel="nofollow noopener" translate="no" target="_blank">https://plos.io/3FJ4adI</a>

CITO GreenhouseThe Marvel of Auditory and Cognitive Networks Working Together in Your Brain<a href="https://mastodon.social/tags/AuditoryProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#AuditoryProcessing</a> <a href="https://mastodon.social/tags/BrainScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#BrainScience</a> <a href="https://mastodon.social/tags/NeuralNetworks" class="mention hashtag" rel="nofollow noopener" target="_blank">#NeuralNetworks</a> <a href="https://mastodon.social/tags/CognitiveScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#CognitiveScience</a> <a href="https://mastodon.social/tags/Hearing" class="mention hashtag" rel="nofollow noopener" target="_blank">#Hearing</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/BrainPlasticity" class="mention hashtag" rel="nofollow noopener" target="_blank">#BrainPlasticity</a> <a href="https://mastodon.social/tags/CentralNervousSystem" class="mention hashtag" rel="nofollow noopener" target="_blank">#CentralNervousSystem</a> <a href="https://mastodon.social/tags/SoundProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#SoundProcessing</a> <a href="https://mastodon.social/tags/Neuroscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#Neuroscience</a> <a href="https://mastodon.social/tags/ListeningSkills" class="mention hashtag" rel="nofollow noopener" target="_blank">#ListeningSkills</a> <a href="https://mastodon.social/tags/BrainHealth" class="mention hashtag" rel="nofollow noopener" target="_blank">#BrainHealth</a> <a href="https://mastodon.social/tags/AuditoryDisorders" class="mention hashtag" rel="nofollow noopener" target="_blank">#AuditoryDisorders</a> <a href="https://mastodon.social/tags/LearningAndMemory" class="mention hashtag" rel="nofollow noopener" target="_blank">#LearningAndMemory</a><a href="https://youtube.com/shorts/7GO01YoqIHo?feature=share" rel="nofollow noopener" translate="no" target="_blank">https://youtube.com/shorts/7GO01YoqIHo?feature=share</a>

Debby🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬follow hem here: <a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@thorstenvoice</a> or on YouTube: <a href="https://www.youtube.com/@ThorstenMueller" rel="nofollow noopener" translate="no" target="_blank">https://www.youtube.com/@ThorstenMueller</a> YouTube channel! <a href="https://hear-me.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#Accessibility</a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#FLOSS</a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#TTS</a> <a href="https://hear-me.social/tags/ParlerTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#ParlerTTS</a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceTech</a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#TextToSpeech</a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#CoquiAI</a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceAssistant</a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sprachassistent</a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#AccessibilityMatters</a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#FLOSS</a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#TTS</a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#Inclusivity</a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#FOSS</a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#Coqui</a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#CoquiAI</a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceAssistant</a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sprachassistent</a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceTechnology</a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#KünstlicheStimme</a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#Python</a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#Rhasspy</a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#TextToSpeech</a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceTech</a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#STT</a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechSynthesis</a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sprachsynthese</a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialVoice</a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceCloning</a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#Spracherkennung</a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#CoquiTTS</a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#voice</a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#a11y</a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#ScreenReader</a>

DebbyGoode <a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@thorstenvoice</a>, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#AccessibilityMatters</a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#FLOSS</a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#TTS</a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#Inclusivity</a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#FOSS</a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#Coqui</a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#CoquiAI</a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceAssistant</a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sprachassistent</a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceTechnology</a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#KünstlicheStimme</a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#Python</a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#Rhasspy</a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#TextToSpeech</a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceTech</a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#STT</a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechSynthesis</a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sprachsynthese</a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialVoice</a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#VoiceCloning</a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#Spracherkennung</a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#CoquiTTS</a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#voice</a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#a11y</a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#ScreenReader</a>

IT NewsChristmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - <a href="https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/" rel="nofollow noopener" translate="no" target="_blank">https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/</a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#artificialintelligence</a> <a href="https://schleuss.online/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechrecognition</a> <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechsynthesis</a> <a href="https://schleuss.online/tags/santaclaus" class="mention hashtag" rel="nofollow noopener" target="_blank">#santaclaus</a> <a href="https://schleuss.online/tags/libpeer" class="mention hashtag" rel="nofollow noopener" target="_blank">#libpeer</a> <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener" target="_blank">#openai</a> <a href="https://schleuss.online/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#llm</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a>

Hacker NewsJargonic Sets New SOTA for Japanese ASR<a href="https://aiola.ai/blog/jargonic-japanese-asr/" rel="nofollow noopener" translate="no" target="_blank">https://aiola.ai/blog/jargonic-japanese-asr/</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/Jargonic" class="mention hashtag" rel="nofollow noopener" target="_blank">#Jargonic</a> <a href="https://mastodon.social/tags/SOTA" class="mention hashtag" rel="nofollow noopener" target="_blank">#SOTA</a> <a href="https://mastodon.social/tags/Japanese" class="mention hashtag" rel="nofollow noopener" target="_blank">#Japanese</a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#ASR</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#Technology</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/Innovation" class="mention hashtag" rel="nofollow noopener" target="_blank">#Innovation</a>

WinbuzzerNvidia Releases High-Speed Parakeet AI Speech Recognition Model, Claims Top Spot on Leaderboard<a href="https://mastodon.social/tags/Nvidia" class="mention hashtag" rel="nofollow noopener" target="_blank">#Nvidia</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#ASR</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechToText</a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://mastodon.social/tags/Parakeet" class="mention hashtag" rel="nofollow noopener" target="_blank">#Parakeet</a> <a href="https://mastodon.social/tags/NeMo" class="mention hashtag" rel="nofollow noopener" target="_blank">#NeMo</a> <a href="https://mastodon.social/tags/HuggingFace" class="mention hashtag" rel="nofollow noopener" target="_blank">#HuggingFace</a> <a href="https://mastodon.social/tags/AIModels" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIModels</a><a href="https://winbuzzer.com/2025/05/06/nvidia-releases-high-speed-parakeet-ai-speech-recognition-model-claims-top-spot-on-leaderboard-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank">https://winbuzzer.com/2025/05/06/nvidia-releases-high-speed-parakeet-ai-speech-recognition-model-claims-top-spot-on-leaderboard-xcxwbn/</a>

Richard Emling (DO9RE)I'm exploring ways to improve audio preprocessing for speech recognition for my [midi2hamlib](<a href="https://github.com/DO9RE/midi2hamlib" rel="nofollow noopener" translate="no" target="_blank">https://github.com/DO9RE/midi2hamlib</a>) project. Do any of my followers have expertise with **SoX** or **speech recognition**? Specifically, I’m seeking advice on: 1️⃣ Best practices for audio preparation for speech recognition. 2️⃣ SoX command-line parameters that can optimize audio during recording or playback. <a href="https://github.com/DO9RE/midi2hamlib/blob/main/tests/speech_menu.sh" rel="nofollow noopener" translate="no" target="_blank">https://github.com/DO9RE/midi2hamlib/blob/main/tests/speech_menu.sh</a> <a href="https://metalhead.club/tags/SoX" class="mention hashtag" rel="nofollow noopener" target="_blank">#SoX</a> <a href="https://metalhead.club/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://metalhead.club/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://metalhead.club/tags/AudioProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#AudioProcessing</a> <a href="https://metalhead.club/tags/ShellScripting" class="mention hashtag" rel="nofollow noopener" target="_blank">#ShellScripting</a> <a href="https://metalhead.club/tags/Sphinx" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sphinx</a> <a href="https://metalhead.club/tags/PocketSphinx" class="mention hashtag" rel="nofollow noopener" target="_blank">#PocketSphinx</a> <a href="https://metalhead.club/tags/Audio" class="mention hashtag" rel="nofollow noopener" target="_blank">#Audio</a> Retoot appreciated.

Hacker NewsJargonic: Industry-Tunable ASR Model<a href="https://aiola.ai/blog/introducing-jargonic-asr/" rel="nofollow noopener" translate="no" target="_blank">https://aiola.ai/blog/introducing-jargonic-asr/</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/Jargonic" class="mention hashtag" rel="nofollow noopener" target="_blank">#Jargonic</a> <a href="https://mastodon.social/tags/ASR" class="mention hashtag" rel="nofollow noopener" target="_blank">#ASR</a> <a href="https://mastodon.social/tags/Industry" class="mention hashtag" rel="nofollow noopener" target="_blank">#Industry</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/Model" class="mention hashtag" rel="nofollow noopener" target="_blank">#Model</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a>

Pyrzout :vm:Be Careful What You Ask For: Voice Control <a href="https://hackaday.com/2025/02/19/be-careful-what-you-ask-for-voice-control/" rel="nofollow noopener" translate="no" target="_blank">https://hackaday.com/2025/02/19/be-careful-what-you-ask-for-voice-control/</a> <a href="https://social.skynetcloud.site/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechrecognition</a> <a href="https://social.skynetcloud.site/tags/computerspeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#computerspeech</a> <a href="https://social.skynetcloud.site/tags/voicecommand" class="mention hashtag" rel="nofollow noopener" target="_blank">#voicecommand</a> <a href="https://social.skynetcloud.site/tags/Featured" class="mention hashtag" rel="nofollow noopener" target="_blank">#Featured</a> <a href="https://social.skynetcloud.site/tags/Rants" class="mention hashtag" rel="nofollow noopener" target="_blank">#Rants</a> <a href="https://social.skynetcloud.site/tags/rants" class="mention hashtag" rel="nofollow noopener" target="_blank">#rants</a>

Doug HoltonVibe is an <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> desktop client (mac, windows, linux) for locally running Whisper to more accurately transcribe or caption videos & audio <a href="https://thewh1teagle.github.io/vibe/" rel="nofollow noopener" translate="no" target="_blank">https://thewh1teagle.github.io/vibe/</a> Source code: <a href="https://github.com/thewh1teagle/vibe/" rel="nofollow noopener" translate="no" target="_blank">https://github.com/thewh1teagle/vibe/</a> Easier to use than what I was using before (WhisperDesktop). Default settings use the medium Whisper model, which has been good enough in my experience. <a href="https://mastodon.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#Accessibility</a> <a href="https://mastodon.social/tags/A11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#A11y</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpeechRecognition</a> <a href="https://mastodon.social/tags/EdTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#EdTech</a>

The Conversation U.S.Speech recognition systems struggle with accents and dialects, risking problems in critical fields like healthcare and emergency services. Imagine calling 911 and the AI used to screen out non-emergency calls can’t understand you. A Spanish language professor explains: <a href="https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281" rel="nofollow noopener" translate="no" target="_blank">https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281</a> <a href="https://newsie.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://newsie.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechrecognition</a>

Frühere Suchanfragen

Suchoptionen

Verwaltet von:

Serverstatistik:

#speechrecognition