Forums

Zac Zuo

18h ago

Computer Use in Claude Code - Let Claude use your computer from the CLI

Enable computer use in the Claude Code CLI so Claude can open apps, click, type, and see your screen on macOS. Test native apps, debug visual issues, and automate GUI-only tools without leaving your terminal.
Zac Zuo

18h ago

Qwen3.5-Omni - A native omni model for voice, video, and tools

Qwen3.5-Omni is Qwen"s new native omni model for text, images, audio, and video, with stronger multilingual speech, realtime voice interaction, web search, function calling, voice cloning, and long-context audio/video understanding.
Zac Zuo

2d ago

FreeCAD 1.1 - Extremely powerful, completely free 3D CAD modeling

FreeCAD 1.1 is a massive update to the highly capable, free, and open-source 3D CAD/CAM/FEA modeler. It introduces major quality-of-life improvements including transparent previews, interactive draggers, new CAM tools, and enhanced assembly features.

Would you read a topic digest newsletter?

Hello Product Hunt! We are thinking of spinning up topic specific, weekly digest newsletters that break up the firehose of goodness that is the Product Hunt leaderboard. What topics would you subscribe to? Who would you like to see sponsor these newsletters?

Here is an early prototype of what an AI Agent Digest newsletter might look like: https://gist.github.com/kerzhner...

Zac Zuo

5d ago

Suno v5.5 - Create with your voice, tune models to your sound

Suno v5.5 is its most personal music model yet. Use your own voice, train custom models on your catalog, and let My Taste learn what you actually like, so the songs feel less generic and much more like you.
Zac Zuo

5d ago

Gemini 3.1 Flash Live - Making audio AI more natural and reliable

Gemini 3.1 Flash Live is Google’s new state-of-the-art native audio model. Built for low-latency, real-time dialogue, it excels at complex reasoning and function calling. It is the exact engine currently powering Gemini Live and Google Search Live.
Zac Zuo

4d ago

Cohere Transcribe - New state-of-the-art in open source speech recognition

Cohere Transcribe is a state-of-the-art, 2B open-weights speech recognition model. Optimized for enterprise workloads, it delivers high throughput and a leading 5.42% WER across 14 languages, making it ideal for private, local, or desktop deployment.
Product Huntp/producthuntGabe Perez

6d ago

Introducing Randomized Leaderboard Day on Product Hunt!

If you re launching today, the leaderboard is about to get a lot more interesting.

We are running a Randomized Day to give products launching more of an opportunity to get seen!

The Mechanics

To level the playing field, we are cycling the homepage layout throughout the day:
The Loop: This cycle repeats every 30 minutes, all day long.

Product Huntp/producthuntGabe Perez

6d ago

Introducing Randomized Leaderboard Day on Product Hunt!

If you re launching today, the leaderboard is about to get a lot more interesting.

We are running a Randomized Day to give products launching more of an opportunity to get seen!

The Mechanics

To level the playing field, we are cycling the homepage layout throughout the day:
The Loop: This cycle repeats every 30 minutes, all day long.

Zac Zuo

6d ago

Arm AGI CPU - The world’s most efficient agentic CPU

The Arm AGI CPU is production silicon for AI infrastructure, delivering high performance and extreme density for agentic AI in modern data centers.
Zac Zuo

6d ago

Spotify SongDNA - The interactive creative network behind your favorite music

Spotify's SongDNA is a new interactive feature that reveals the creative lineage of your favorite tracks. Built into the Now Playing view, it lets you explore the writers, producers, samples, and the entire human network that brought a song to life.
Zac Zuo

7d ago

Uni-1 by Luma - A unified foundation model that thinks in pixels

Uni-1 is the new unified image model from Luma for generation and editing. It reasons through prompts, follows references closely, and handles style, text, memes, and manga unusually well, so outputs feel less generic and more usable for real creative work.
Zac Zuo

8d ago

Library in ChatGPT - Find and reuse files across all your ChatGPT conversations

Library in ChatGPT gives your uploads and created files one place to live, so you can browse, search, reuse, and attach them again without hunting through old threads.
Zac Zuo

9d ago

WeixinClawBot - The official WeChat pipeline for OpenClaw

WeixinClawBot is an official plugin that connects OpenClaw directly to WeChat/Weixin. It provides a native, sanctioned pipeline to interact with your local or cloud-based AI agents right from your chat list, turning WeChat into a universal AI interface.
Zac Zuo

7d ago

Magine - Spawn vision-enabled AI agents autonomously browsing the web

A cloud of orchestrated, vision-enabled AI agents - autonomously browsing the web like a human would. /\_/\ ( ^.^ ) -> visit magine.cloud = " = Magine AI is purposely built for autonomous zero-human interference where AI can now see, dream, train in real-time, and think like humans where the internet will be for bots humans are the watchers.
Zac Zuo

11d ago

How do you like the new face of Kitty Coin?

Hi everyone!

A few days ago I spotted @rohanrecommends sharing PH s brand new Kitty Coin leaderboard. This is definitely one of the biggest changes on PH recently.

Now it s baked right into every profile homepage:

Zac Zuo

12d ago

MAI-Image-2 - Microsoft's top-tier text-to-image model for creatives

MAI-Image-2 is Microsoft's new text-to-image model built with photographers, designers, and visual storytellers in mind. It pushes hard on photoreal lighting, reliable in-image text, and rich cinematic scenes for actual creative work.
Zac Zuo

12d ago

Claude Code Channels - Push events and chat with Claude Code via Telegram & Discord

Claude Code Channels let you control your local coding session from anywhere. Using MCP servers, you can bridge Claude to Telegram and Discord to push events, receive alerts, and reply to your terminal assistant directly from your phone.
Zac Zuo

13d ago

MiMo-V2-Pro & Omni - Xiaomi's flagship agentic and omni-modal foundation models

MiMo-V2-Pro and MiMo-V2-Omni are Xiaomi’s new agent foundation models. Pro is built for long-chain coding, tool use, and OpenClaw-style workflows, while Omni adds vision and audio to push the same agentic stack into the real world.
Zac Zuo

13d ago

Machine Payments Protocol - The internet-native payment standard for AI agents

Machine Payments Protocol (MPP) is the open standard that lets AI agents pay for services programmatically.