All activity
WUPHF is a collaborative office of AI employees who build and maintain their own knowledge base to never lose context for the tasks you give them.
Supports Claude Code, Codex, OpenClaw agents and local LLMs via OpenCode.
Chat with your agents via TUI, Web or Telegram.
Open source. Runs on your machine, with your keys.
WUPHF by Nex.aiAI employees who build their own knowledge base
Najmuzzaman Mohammadleft a comment
The structured spec approach is a smart bet. Most agent frameworks right now ask you to wire everything imperatively, which makes it really hard to reason about what the agent is supposed to do versus what it actually does. Curious how the spec handles cases where an agent needs to adapt its behavior based on context it did not have at definition time. Is there a way to express conditional...

LogicBuild and operate fleets of agents
Najmuzzaman Mohammadleft a comment
The knowledge ingestion piece is interesting. When you bring in docs from drives and websites, how does the system handle freshness? Is it a one-time import or does it stay in sync with the source? That feels like the part that makes or breaks trust in agent answers over time.
Jet AI AgentsBuild business AI agents in minutes
Najmuzzaman Mohammadleft a comment
The part about early users walking away when you turn the wheel is a real one, and most public retrospectives skip it. Did you find the users who stuck around through the pivot were measurably different in how they used the product, or was it more about how invested they already were in the original promise? That distinction would seem to change a lot about how you communicate the next big change.
6 Months After Getting #1 on Product Hunt, What Really Happened?
Yuze Join the discussion
Najmuzzaman Mohammadleft a comment
The framing of hardware never getting the abstraction layers software did really lands. Curious about the boundary you draw between what the chat layer generates versus what is baked in at the module level. When I add a new sensor module, does the model see the schema and rewrite firmware on the fly, or is it more declarative than that?

AtechSnap-together electronics built from a chat
Najmuzzaman Mohammadleft a comment
Having every PR consolidated across providers in one menubar is the kind of thing I keep meaning to build for myself and never finish. Curious how you handle review state synchronization when reviewers are working in the actual web UIs in parallel. Do you poll on focus, or is there something smarter going on under the hood?

GitBarEvery pull request, one menubar. GitHub, GitLab & Azure
Najmuzzaman Mohammadleft a comment
Hey PH. I am Nazz - the Creator 👋. I built WUPHF for myself at Nex. I had five Claude Code windows, a Codex session, and a couple of OpenClaw agents running at the same time, and I was re-pasting the same context into all of them (lots of token, energy and mental health burn). When I wanted them to coordinate, I was the manual relay. Copy the engineer's update, paste to the PM, paste to the GTM...
WUPHF by Nex.aiAI employees who build their own knowledge base
Najmuzzaman Mohammadleft a comment
The observation that raw memory without curation turns into noise matches what I keep seeing in practice. I am curious how Memory Studio handles the refresh cycle. When a project constraint changes six months in, do users have to manually edit the bucket, or does the system flag contradictions it detects between new conversation turns and the stored context?
Most people's AI memory is a disaster they don't even realize they have.
hira siddiquiJoin the discussion
Najmuzzaman Mohammadleft a comment
The Orchestrator and Task Owner split is a cleaner hierarchy than I usually see in swarm setups. I am curious how the Orchestrator decides when to fan out versus keep something in a single thread. Is that driven by the task graph it builds up front, or is there a heuristic about codebase scope that triggers parallel branching?

Cosine SwarmParallel AI agents for long-horizon, complex software tasks
Najmuzzaman Mohammadleft a comment
The biological decay framing is interesting. I am curious how you calibrate the decay rate across different use cases. A long refactor probably needs a slower decay than a quick debugging session. Does the graph engine infer that from access patterns, or is it something the user tunes directly?
I got tired of my AI agents "losing the plot," so I gave them a brain
sachit mishraJoin the discussion
Najmuzzaman Mohammadleft a comment
Reading every git diff line by line before merging. I know AI can summarize PRs and flag risky changes, and I have even built review workflows that do exactly that. But I still open the diff myself every single time. Part of it is trust, part of it is that the act of reading code written by someone else is how I stay calibrated on what the codebase actually looks like right now. The moment I...
What's something you're embarrassed to admit you still do manually even though AI could do it?
Imed RadhouaniJoin the discussion
Najmuzzaman Mohammadleft a comment
Using the Ebbinghaus curve for context decay is a genuinely clever framing. The biggest failure mode I have seen with agent memory is not forgetting too much but forgetting the wrong things. Architectural decisions that were made months ago and rarely referenced can still be load-bearing. Does the graph engine help protect those kinds of low-frequency but high-importance memories from decay?

YourMemoryAI that remembers and forgets like humans.
Najmuzzaman Mohammadleft a comment
The orchestrator/task owner/worker hierarchy is a smart decomposition for parallel agent work. Most multi-agent coding setups I have tried fall apart when two subagents touch the same files. Curious how Cosine handles merge conflicts between parallel workers. Does the orchestrator prevent overlapping file edits upfront, or is there a resolution layer that reconciles after the fact?

Cosine SwarmParallel AI agents for long-horizon, complex software tasks
Najmuzzaman Mohammadleft a comment
This is painfully relatable. The instinct to build enterprise features before having enterprise customers is one of the most common traps I have seen in B2B SaaS. The features themselves are not the problem. The problem is treating them as a growth strategy instead of a retention tool. Enterprise features work when existing customers are actively asking for them because they need SSO or audit...
We spent 6 months building for enterprise. Nobody bought it.
Imed RadhouaniJoin the discussion
Najmuzzaman Mohammadleft a comment
Really interesting approach to benchmarking cloud browser providers. I have been evaluating a few of these services for running automated workflows and the lack of standardized benchmarks has made it difficult to compare them objectively. The fact that all results and code are public is a huge plus. Do you plan to add latency benchmarks for dynamic page interactions, or is the focus mainly on...

Browser ArenaOpen-source benchmarks for cloud browser infrastructure
Najmuzzaman Mohammadleft a comment
Trust for me is less about the category of task and more about reversibility. I will happily let an agent draft emails, reorganize files, or write code all day. But the moment an action is hard to undo, like sending money, deleting data, or posting publicly, I want a human checkpoint. The real unlock will be agents that understand their own confidence level and ask for approval only when the...
Najmuzzaman Mohammadleft a comment
The parallel agent workspace is the feature I have been waiting for. Running multiple agents side by side without them colliding on file changes has been a real pain point in every other tool I have tried. Curious how the MCP support works in practice. Does each agent get its own MCP server connection, or do they share a single pool?

Cursor 3Unified workspace for parallel local/cloud agents and MCPs
Najmuzzaman Mohammadleft a comment
The one-line-of-code setup is what sold me on trying this. I have been looking for an analytics tool that does not require a PhD in configuration just to see who is visiting my site in real time. Curious about one thing though: how does Sleek handle high-traffic spikes? Does the real-time view stay responsive when you are getting thousands of concurrent visitors?

Sleek AnalyticsSee who's on your site. Right now.
Najmuzzaman Mohammadleft a comment
Great list. The Plurality MCP setup with Claude Desktop is a good find. Being able to pull context directly into the agent session without switching tools is a big deal for workflow. The ones I have been spending the most time with lately are MCP servers that connect business tools (email, CRM, Slack) into Claude Code and other agents. The pattern I keep seeing is that the real value is not...
𝗪𝗵𝗶𝗰𝗵 𝗠𝗖𝗣 𝗦𝗲𝗿𝘃𝗲𝗿𝘀 𝗵𝗮𝘃𝗲 𝗯𝗲𝗲𝗻 𝗵𝗲𝗹𝗽𝗶𝗻𝗴 𝗺𝗲 𝘁𝗵𝗲 𝗺𝗼𝘀𝘁?
hira siddiquiJoin the discussion
Najmuzzaman Mohammadleft a comment
This is a problem that does not get enough attention yet. Everyone is focused on making agents more capable, but the question of "how do you prove they acted within policy" is going to matter a lot more as agents start touching real workflows at scale. The cryptographic verification angle is interesting. Most governance approaches I have seen are audit logs after the fact. Proving compliance at...

OpenBoxSee, verify, and govern every agent action.
