All activity
Zac Zuoleft a comment
Hi everyone! Claude can write the code, run it, open the app, click through the interface, and check what actually happened, all from the CLI. You can enable computer-use from /mcp, grant macOS Accessibility and Screen Recording, approve apps per session, and stop everything instantly with Esc at any time. Research preview for macOS on Pro and Max plans for now.

Computer Use in Claude CodeLet Claude use your computer from the CLI
Qwen3.5-Omni is Qwen"s new native omni model for text, images, audio, and video, with stronger multilingual speech, realtime voice interaction, web search, function calling, voice cloning, and long-context audio/video understanding.

Qwen3.5-OmniA native omni model for voice, video, and tools
Zac Zuoleft a comment
Hi everyone! Qwen3.5-Omni is the latest native omni model from the Qwen family. It handles text, images, audio, and video in one system, pushes hard on multilingual speech, and adds a lot of the interaction stuff that actually matters in practice: semantic interruption, realtime voice control, WebSearch, Function Calling, and voice cloning. The audio/video captioning and "audio-visual vibe...

Qwen3.5-OmniA native omni model for voice, video, and tools
Zac Zuoleft a comment
Hi everyone! I only got into FreeCAD recently, and what surprised me most is how powerful it already feels for a tool that is, quite literally, free. FreeCAD 1.1 is a HUGE release. There is a lot of real workflow polish here: transparent Part Design previews, interactive draggers for tools like Fillet and Chamfer, three-point lighting, Clarify Selection, Assembly and FEM improvements, and a...

FreeCAD 1.1Extremely powerful, completely free 3D CAD modeling
FreeCAD 1.1 is a massive update to the highly capable, free, and open-source 3D CAD/CAM/FEA modeler. It introduces major quality-of-life improvements including transparent previews, interactive draggers, new CAM tools, and enhanced assembly features.

FreeCAD 1.1Extremely powerful, completely free 3D CAD modeling
Cohere Transcribe is a state-of-the-art, 2B open-weights speech recognition model. Optimized for enterprise workloads, it delivers high throughput and a leading 5.42% WER across 14 languages, making it ideal for private, local, or desktop deployment.

Cohere TranscribeNew state-of-the-art in open source speech recognition
Zac Zuoleft a comment
I’d actually read this a lot. I already check it on PH every day, just not through email. I usually open the Newsletter tab directly. One reason I like it is that it helps me catch products I would otherwise miss. The daily leaderboard can sometimes have a strong Matthew effect, so a lot of great products in the middle or lower ranks naturally get less attention during a quick scan. The...
Would you read a topic digest newsletter?
Mike KerzhnerJoin the discussion
Zac Zuoleft a comment
Hi everyone! Suno v5.5 is really about making the experience feel more like yours. The new pieces all point in the same direction: Voices lets you create with your own voice, Custom Models let you tune the model on music you made, and My Taste starts shaping results around what you actually gravitate toward. Put together, it feels like Suno is moving toward something more like a personal...

Suno v5.5Create with your voice, tune models to your sound
Gemini 3.1 Flash Live is Google’s new state-of-the-art native audio model. Built for low-latency, real-time dialogue, it excels at complex reasoning and function calling. It is the exact engine currently powering Gemini Live and Google Search Live.

Gemini 3.1 Flash LiveMaking audio AI more natural and reliable
Zac Zuoleft a comment
Hi everyone! Cohere just open-sourced Transcribe, and the core metrics here, especially the throughput and the 5.42% average WER, are genuinely impressive. From an engineering point of view, this looks like a fantastic model for Mac/PC local apps or private enterprise servers. At 2B parameters, though, it still feels a bit heavy for raw on-device mobile deployment. It is also worth noting that...

Cohere TranscribeNew state-of-the-art in open source speech recognition
Zac Zuoleft a comment
Hi everyone! The most important thing here is simple: this is now the voice model behind Gemini Live and Google Search Live. It is the speech engine @Google is actually putting into its consumer products. Google is pitching 3.1 Flash Live as its highest-quality audio and voice model yet, with lower latency, better reasoning, and more natural dialogue. The benchmark jump is also pretty...

Gemini 3.1 Flash LiveMaking audio AI more natural and reliable
Zac Zuoleft a comment
Hi everyone! For the first time, Arm is actually shipping its own production silicon for the AI data center. ...the CPU becomes the pacing element of modern infrastructure... performance is no longer defined by a single server—it is defined at the rack level. That is the core of the whole pitch. Arm is not positioning AGI CPU as a general server part. It is framing it as a rack-first CPU for...

Arm AGI CPUThe world’s most efficient agentic CPU
Zac Zuoleft a comment
Hi everyone! SongDNA lets you explore what sits behind a song on @Spotify, not just in a technical credits sense, but as a whole creative world around the people who made it. The barrier to making music is getting lower fast, but taste, strange inspiration, and the artists who cannot be reduced to one fixed style still matter the most. Those are exactly the things worth seeing more clearly, and...

Spotify SongDNAThe interactive creative network behind your favorite music
The Arm AGI CPU is production silicon for AI infrastructure, delivering high performance and extreme density for agentic AI in modern data centers.

Arm AGI CPUThe world’s most efficient agentic CPU
Spotify's SongDNA is a new interactive feature that reveals the creative lineage of your favorite tracks. Built into the Now Playing view, it lets you explore the writers, producers, samples, and the entire human network that brought a song to life.

Spotify SongDNAThe interactive creative network behind your favorite music
Uni-1 is the new unified image model from Luma for generation and editing. It reasons through prompts, follows references closely, and handles style, text, memes, and manga unusually well, so outputs feel less generic and more usable for real creative work.

Uni-1 by LumaA unified foundation model that thinks in pixels
A cloud of orchestrated, vision-enabled AI agents - autonomously browsing the web like a human would.
/\_/\
( ^.^ ) -> visit magine.cloud
= " =
Magine AI is purposely built for autonomous zero-human interference where AI can now see, dream, train in real-time, and think like humans where the internet will be for bots humans are the watchers.
MagineSpawn vision-enabled AI agents autonomously browsing the web
Zac Zuoleft a comment
Hi everyone! With Uni-1, Luma is making a very strong statement about where image models are going. Generation without understanding can only go so far. By unifying understanding and generation in one architecture, Uni-1 is Luma's first serious step on the path toward unified intelligence. That is what makes this more interesting than a normal image model launch. A model that can actually...

Uni-1 by LumaA unified foundation model that thinks in pixels
Zac Zuoleft a comment
@sagar4nfs PPS: And for pitching itself to me! 🤖
MagineSpawn vision-enabled AI agents autonomously browsing the web


