Markus Weingärtner @markusblogde

**Brent Toderian** @brenttoderian.bsky.social@bsky.brid.gy · 4 T.

Brent Toderian @brenttoderian.bsky.social@bsky.brid.gy

The evolution from roads designed for cars, to streets designed for people. It’s never been about whether or not there’s “enough room” — It’s ALWAYS been about our priorities. Nice graphic via @bikeyface.bsky.social. #multimodal

**stadtmobil carsharing AG** @stadtmobilstuttgart@mastodon.social · 26. Juni

26. Juni

stadtmobil carsharing AG @stadtmobilstuttgart@mastodon.social

x Gemeinsam unterwegs: Die starke Partnerschaft von stadtmobil & VVS und SSB macht Mobilität in Stuttgart noch smarter! Viele Wege lassen sich bequem mit Bus & Bahn zurücklegen – und wenn’s mal flexibel, spontan oder mit viel Gepäck sein muss, ist stadtmobil die perfekte Ergänzung.

Wann nimmst du Bus & Bahn und wann greifst du lieber auf ein stadtmobil zurück?

#stadtmobil #VVS #SSB

**Brent Toderian** @brenttoderian.bsky.social@bsky.brid.gy · 5. Juni

5. Juni

Brent Toderian @brenttoderian.bsky.social@bsky.brid.gy

Want an economically competitive city? A fiscally smart city? A healthy city? A sustainable, climate-responsible & resilient city? An equitable & accessible city? A livable city? A city with more choices? A successful city today, that’s positioned for a successful future? Build a #multimodal city.

**Hacker News** @h4ckernews@mastodon.social · 4. Juni

4. Juni

Hacker News @h4ckernews@mastodon.social

AGI Is Not Multimodal

https://thegradient.pub/agi-is-not-multimodal/

The Gradient · 4. JuniAGI Is Not Multimodal"In projecting language back as the model for thought, we lose sight of the tacit embodied understanding that undergirds our intelligence." –Terry Winograd The recent successes of generative AI models have convinced some that AGI is imminent. While these models appear to capture the essence of human intelligence, they defy

#HackerNews #AGI #Multimodal

**KINEWS24** @KiNews@mastodon.social · 1. Juni

1. Juni

KINEWS24 @KiNews@mastodon.social

ElevenLabs KI-Update 2025: Was bringt Claude Sonnet 4?

• Sprache & Text gleichzeitig
• Intelligenter durch Claude 4
• Natürlichere KI-Dialoge
• Schnellere Entwicklung möglich
• Höheres User-Engagement

#ai #ki #artificialintelligence #elevenlabs #claudesonnet4 #multimodal

https://kinews24.de/elevenlabs-konversations-ki-multimodal-sonnet-4/

**Trufi Association** @TrufiAssoc@urbanists.social · 20. Mai

20. Mai

Trufi Association @TrufiAssoc@urbanists.social

Herrenberg is boasting about about their multimodal system for sustainable transport: stadtnavi. Hey, other cities, don't be jealous: Just adopt the solution, developed by Trufi.

#bus #BusRapidTransit #BRT

Fortgeführter Thread

**Trufi Association** @TrufiAssoc@urbanists.social · 20. Mai

20. Mai

Trufi Association @TrufiAssoc@urbanists.social

The stadtnavi system, built on Trufi’s open-source platform, shows how cities can democratize mobility. Real-time updates, CO₂ comparisons, and weather alerts aren’t exclusive to Herrenberg—they’re open for any city to implement. The project’s success lies in its adaptability: a white-label solution that lets cities rebrand and expand it freely.

https://tinyurl.com/yn9t9ro7

Trufi AssociationTrufi’s Tour-de-Force of Multimodal Possibilities: stadtnaviActive transport, motorized transport, public toilets and more – they're in the stadtnavi app: bus, train, bike, car, rideshare, taxi...

#multimodal #opensource #sustainabletransport

**Sara Zan** @zansara@mastodon.social · 16. Mai

16. Mai

Sara Zan @zansara@mastodon.social

Attention! If you or your company:

- are based in the EU
- you’re thinking of integrating Llama models into your product

Pay close attention to its license: you may be breaking Meta’s terms!

https://www.zansara.dev/posts/2025-05-16-llama-eu-ban/

Sara Zan · 16. MaiUsing Llama Models in the EUThe Llama 4 family has been released over a month ago and I finally found some time to explore it. Or so I wished to do, until I realized one crucial issue with these models: They are banned in the EU. Apparently Meta can’t be bothered to comply with EU regulations on AI, and therefore opted for a wide ban that should prevent such laws to apply to them. Of course, while this limitation is technically valid for each and every person and company domiciled in the EU, the problem arises primarily for companies that want to use Llama 4 to offer services and for researchers planning to work with these models, be it for evaluation, fine-tuning, distillation or other derivative work. Always keep in mind that I’m not a lawyer, so nothing of what I’m writing here constitutes as legal advice.

#GenAI #Llama #Multimodal

**Hacker News** @h4ckernews@mastodon.social · 16. Mai

16. Mai

Hacker News @h4ckernews@mastodon.social

Ollama's new engine for multimodal models

https://ollama.com/blog/multimodal-models

ollama.comOllama's new engine for multimodal models · Ollama BlogOllama now supports new multimodal models with its new engine.

#HackerNews #Ollama #Multimodal

**Hacker News** @h4ckernews@mastodon.social · 10. Mai

10. Mai

Hacker News @h4ckernews@mastodon.social

Vision Now Available in Llama.cpp

https://github.com/ggml-org/llama.cpp/blob/master/docs/multimodal.md

LLM inference in C/C++. Contribute to ggml-org/llama.cpp development by creating an account on GitHub.

GitHubllama.cpp/docs/multimodal.md at master · ggml-org/llama.cppLLM inference in C/C++. Contribute to ggml-org/llama.cpp development by creating an account on GitHub.

#HackerNews #Vision #Llama

**Dirk Schnelle-Walka** @dsw · 28. Apr.

28. Apr.

Dirk Schnelle-Walka @dsw

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems. Multi-modal LLM system simulates human communication using speech and generates human-like dialogues with consistent content, rhythm, & emotion.

Funnily, they also elaborate on a "think before you speak" design aspect. This might also be applicable to our everyday lives.

doi: 10.48550/arXiv.2401.03945
#LLM #multimodal #speechAI #multiagent #conversationalai

**Hacker News** @h4ckernews@mastodon.social · 15. Apr.

15. Apr.

Hacker News @h4ckernews@mastodon.social

Liquid: Language Models Are Scalable and Unified Multi-Modal Generators

https://foundationvision.github.io/Liquid/

foundationvision.github.ioLiquid: Language Models are Scalable Multi-modal Generators

#HackerNews #Liquid #Language

**Miki :rstats:** @miki_peltzer@techhub.social · 10. Apr.

10. Apr.

Miki :rstats: @miki_peltzer@techhub.social

#30DayChartChallenge Día 10: ¡Buceando en la Distribución del VIX!

En lugar de solo ver la línea del VIX, hoy analizamos su "distribución de probabilidad" por Presidencia de EE.UU. (Clinton -> Trump 2º). ¡La forma lo es todo!

Usando #rstats y #ggplot2, estas densidades facetadas nos permiten investigar:
* Modos Dominantes: ¿Cuál era el nivel "normal" de VIX (el pico más alto)? ¿Cambió mucho?
* Multi-modalidad: ¿Hay evidencia de múltiples estados de volatilidad (picos secundarios) dentro de un mismo mandato?
* Riesgo de Cola: ¿Qué tan probable era el "pánico" (VIX > 35)? ¡Compara las colas derechas!

Estos patrones reflejan los distintos regímenes de volatilidad y la percepción del riesgo sistémico. No es solo el nivel, ¡sino la "estructura" de la incertidumbre lo que importa!

Datos: Yahoo Finance via #quantmod.
Código: https://t.ly/kikdo

#Day10 #Multimodal #dataviz

**PUPUWEB Blog** @pupuweb@mastodon.social · 8. Apr.

8. Apr.

PUPUWEB Blog @pupuweb@mastodon.social

Google expands Search's AI Mode to millions more Labs users in the US, now with multimodal capabilities—allowing users to ask complex questions about images. #GoogleSearch #AI #Multimodal #ArtificialIntelligence #TechNews #GoogleAI #Innovation #SearchEngine #MachineLearning

**Harald Klinke** @HxxxKxxx@det.social · 6. Apr.

6. Apr.

Harald Klinke @HxxxKxxx@det.social

NEWS: Meta has unveiled Llama 4, its latest AI model, featuring advanced multimodal capabilities that integrate text, video, images, and audio processing. This release includes Llama 4 Scout and Llama 4 Maverick, both open-source and designed to enhance Meta’s AI assistant across platforms like WhatsApp, Messenger, and Instagram. Is this a new benchmark in AI versatility?
#Llama4 #AI #Multimodal
https://ai.meta.com/blog/llama-4-multimodal-intelligence/

Meta AIThe Llama 4 herd: The beginning of a new era of natively multimodal AI innovationWe’re introducing Llama 4 Scout and Llama 4 Maverick, the first open-weight natively multimodal models with unprecedented context support and our first built using a mixture-of-experts (MoE) architecture.

**DigitalDigger** @dadinek@fosstodon.org · 28. März

28. März

DigitalDigger @dadinek@fosstodon.org

Exciting news for devs & AI enthusiasts! Introducing Qwen2.5-Omni , the latest multimodal model from Alibaba Cloud . It excels in text, vision, and audio tasks—chat, image gen, speech recog, you name it! Access now via Qwen's GitHub or try it on ModelScope. Open weights soon! #AI #Multimodal #Tech #Qwen

**IT News** @itnewsbot@schleuss.online · 27. März

27. März

IT News @itnewsbot@schleuss.online

OpenAI’s new AI image generator is potent and bound to provoke - The arrival of OpenAI's DALL-E 2 in the spring of 2022 marked a turning po... - https://arstechnica.com/ai/2025/03/openais-new-ai-image-generator-is-potent-and-bound-to-provoke/ #autoregressiveimagegenerator #multimodalimagegeneration #4oimagegeneration #aiimagegenerator #machinelearning #autoregressive #imagesynthesis #multimodalai #multimodal #chatgpt #chatgtp #dall-e3 #biz⁢ #dall-e #gpt-4o #openai #ai

Ars Technica · 27. MärzOpenAI’s new AI image generator is potent and bound to provokeVon Benj Edwards

**Trufi Association** @TrufiAssoc@urbanists.social · 24. März

24. März

Trufi Association @TrufiAssoc@urbanists.social

Selling your car is a huge step. But what if Seattle had a multimodal app—like our "Not Without My Bike" for Hamburg?

via mstdn @seabikeblog threads @seabikeblog bsky @seattlebikeblog[.]com

https://www.seattlebikeblog.com/2025/03/20/sell-your-tesla-or-any-other-car-and-take-the-30-car-free-days-challenge/

#Seattle #apps #CommuteByBike

**Hacker News** @h4ckernews@mastodon.social · 21. März

21. März

Hacker News @h4ckernews@mastodon.social

SmolDocling: An ultra-compact VLM for end-to-end multi-modal document conversion

https://arxiv.org/abs/2503.11576

arXiv.orgSmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversionWe introduce SmolDocling, an ultra-compact vision-language model targeting end-to-end document conversion. Our model comprehensively processes entire pages by generating DocTags, a new universal markup format that captures all page elements in their full context with location. Unlike existing approaches that rely on large foundational models, or ensemble solutions that rely on handcrafted pipelines of multiple specialized models, SmolDocling offers an end-to-end conversion for accurately capturing content, structure and spatial location of document elements in a 256M parameters vision-language model. SmolDocling exhibits robust performance in correctly reproducing document features such as code listings, tables, equations, charts, lists, and more across a diverse range of document types including business documents, academic papers, technical reports, patents, and forms -- significantly extending beyond the commonly observed focus on scientific papers. Additionally, we contribute novel publicly sourced datasets for charts, tables, equations, and code recognition. Experimental results demonstrate that SmolDocling competes with other Vision Language Models that are up to 27 times larger in size, while reducing computational requirements substantially. The model is currently available, datasets will be publicly available soon.

#HackerNews #SmolDocling #VLM

**michabbb** @michabbb@vivaldi.net · 19. März

19. März

michabbb @michabbb@vivaldi.net

#Mistral Small 3.1: SOTA Multimodal #AI with 128k Context Window

#MistralAI releases improved #opensource model outperforming #Gemma3 and #GPT4oMini with 150 tokens/sec speed. Features #multimodal capabilities under #Apache2 license.

#machinelearning

Frühere Suchanfragen

Suchoptionen

Verwaltet von:

Serverstatistik:

#multimodal