Enable computer use in the Claude Code CLI so Claude can open apps, click, type, and see your screen on macOS. Test native apps, debug visual issues, and automate GUI-only tools without leaving your terminal.
Qwen3.5-Omni is Qwen"s new native omni model for text, images, audio, and video, with stronger multilingual speech, realtime voice interaction, web search, function calling, voice cloning, and long-context audio/video understanding.
FreeCAD 1.1 is a massive update to the highly capable, free, and open-source 3D CAD/CAM/FEA modeler. It introduces major quality-of-life improvements including transparent previews, interactive draggers, new CAM tools, and enhanced assembly features.
Hello Product Hunt! We are thinking of spinning up topic specific, weekly digest newsletters that break up the firehose of goodness that is the Product Hunt leaderboard. What topics would you subscribe to? Who would you like to see sponsor these newsletters?
Here is an early prototype of what an AI Agent Digest newsletter might look like: https://gist.github.com/kerzhner...
Suno v5.5 is its most personal music model yet. Use your own voice, train custom models on your catalog, and let My Taste learn what you actually like, so the songs feel less generic and much more like you.
Gemini 3.1 Flash Live is Google’s new state-of-the-art native audio model. Built for low-latency, real-time dialogue, it excels at complex reasoning and function calling. It is the exact engine currently powering Gemini Live and Google Search Live.
Cohere Transcribe is a state-of-the-art, 2B open-weights speech recognition model. Optimized for enterprise workloads, it delivers high throughput and a leading 5.42% WER across 14 languages, making it ideal for private, local, or desktop deployment.
Spotify's SongDNA is a new interactive feature that reveals the creative lineage of your favorite tracks. Built into the Now Playing view, it lets you explore the writers, producers, samples, and the entire human network that brought a song to life.
Uni-1 is the new unified image model from Luma for generation and editing. It reasons through prompts, follows references closely, and handles style, text, memes, and manga unusually well, so outputs feel less generic and more usable for real creative work.
Library in ChatGPT gives your uploads and created files one place to live, so you can browse, search, reuse, and attach them again without hunting through old threads.
WeixinClawBot is an official plugin that connects OpenClaw directly to WeChat/Weixin. It provides a native, sanctioned pipeline to interact with your local or cloud-based AI agents right from your chat list, turning WeChat into a universal AI interface.
A cloud of orchestrated, vision-enabled AI agents - autonomously browsing the web like a human would.
/\_/\
( ^.^ ) -> visit magine.cloud
= " =
Magine AI is purposely built for autonomous zero-human interference where AI can now see, dream, train in real-time, and think like humans where the internet will be for bots humans are the watchers.
MAI-Image-2 is Microsoft's new text-to-image model built with photographers, designers, and visual storytellers in mind. It pushes hard on photoreal lighting, reliable in-image text, and rich cinematic scenes for actual creative work.
Claude Code Channels let you control your local coding session from anywhere. Using MCP servers, you can bridge Claude to Telegram and Discord to push events, receive alerts, and reply to your terminal assistant directly from your phone.
MiMo-V2-Pro and MiMo-V2-Omni are Xiaomi’s new agent foundation models. Pro is built for long-chain coding, tool use, and OpenClaw-style workflows, while Omni adds vision and audio to push the same agentic stack into the real world.