How much do you trust AI agents?

minimalist phone: creating folders

Featured•1mo ago

With the advent of clawdbots, it's as if we've all lost our inhibitions and "put our lives completely in their hands."

I'm all for delegating work, but not giving them too much personal/sensitive stuff to handle.

I certainly wouldn't trust something to the extent of providing:

access to personal finances and operations (maybe just setting aside an amount I'm willing to lose)
sensitive health and biometric information (can be easily misused)
confidential communication with key people (secret is secret)

Are there any tasks you wouldn't give AI agents or data you wouldn't allow them to access? What would that be?

Re. finances – Yesterday I read this news: Sapiom raises $15M to help AI agents buy their own tech tools – so this may be a new era when funds will go rather to Agents than to founders.

3.2K views

Replies

Best

I’m somewhere in the middle on this.

I’m very comfortable letting AI agents handle bounded, reversible work — things like drafting, sorting, research passes, even some light automation. Where I still draw a hard line is anywhere the blast radius is high if something goes wrong.

The three buckets I personally keep off-limits:

anything that can move money without human confirmation
long-lived identity or biometric data
private communications that would be damaging if leaked or misrouted

In practice the question for me isn’t “do I trust AI?” it’s how tightly is the scope constrained and how observable is the system.

One thing I’ve noticed working around privacy-sensitive setups (I help out occasionally with a small infra shop called Hiderox) is that most problems don’t come from the model being “malicious” — they come from people giving agents too much ambient authority too early.

AI is great as a copilot inside guardrails.
I’m still cautious about handing it the keys to the house.

Report

1mo ago

minimalist phone: creating folders

@cyox2 Thank you for sharing your POV :)

Report

1mo ago

I trust AI agents for low-stakes help like summarizing, sorting info, or drafting a first pass. But if it’s spending money, touching private data, or acting on my behalf, I want clear visibility and an easy way to approve, pause, or undo actions.

Report

1mo ago

minimalist phone: creating folders

@oliviajames Same for me, dunno, maybe I just control too much but I would be very cautious about AI.

Report

1mo ago

This topic is close to my heart. I'm actually building a web-app (a visual management system) using agentic coding tools (what that one guy called vibe coding).

It's a bloody great app and I know I've built something incredible... that doesn't have AI integrated into it, at all.

Because it's built for physical operations, it will involve people adding sensitive data, like business data, personal data, actual locations.... and maybe I'm old school still but I don't think AI should be given default control over that stuff.

Using AI to build the underlying system is one thing (also I cannot wait to have a human team instead of just me and a bunch of LLMs etc), but asking it to input, store, manage and recall data that we need to stay the same (same data, same location, etc) from one moment to the next... NOPE. I don't even have plans to integrate AI even as an assistant.

Report

30d ago

minimalist phone: creating folders

@pacalacait How would you convince young generations not use AI to think more? (because I think we are getting dumber just blindly relying on it) :D

Report

28d ago

Triforce Todos

Anything involving finances or private health data stays human-controlled for me. Convenience isn’t worth that risk.

Report

29d ago

minimalist phone: creating folders

@abod_rehman Yeah, for me, it is always money that is no go for me.

Report

28d ago

honestly i think the trust issue comes down to scope. i trust ai way more when its doing one specific thing really well vs trying to be a general assistant that touches evrything. like i built speakeasy which uses ai to convert articles to audio - its just doing text to speech, nothing else. and astrologica generates daily horoscope podcasts from your birth chart - the ai only sees your chart data and makes audio from it. both are pretty narrow in scope so theres not much that can go wrong. but the idea of giving an ai agent full access to my email or bank account? absolutely not lol 😅 i think the best ai tools rn are the ones that stay in their lane

Report

26d ago

minimalist phone: creating folders

@sup_nim that part with the horoscope can be pretty tricky too... cause it is sharing your exact data + sometimes place, when you wanna have a precise horoscope (rising sun, sun, moon)

Report

25d ago

as someone building ai-powered products, my trust framework is basically: high trust for generation, low trust for decisions

i use ai (claude specifically) to generate content in astrologica (astrologica.app) — it creates personalised daily horoscope podcasts from your birth chart. the ai generates the script, but there's a human-designed pipeline around it: the prompts are carefully crafted, the output is structured, and the audio rendering is deterministic. the ai does the creative bit, the system constrains it

i also vibecoded most of the apps i've built (speakeasy, astrologica, wordplay) with claude. trust it for writing code i can review? absolutely. trust it to deploy code without me looking? not yet

the pattern i see working: ai does the heavy lifting on content generation and code writing, humans set the guardrails and make the judgment calls. the tools that respect this boundary are the ones i actually use daily

the products where ai just... does whatever with no structure? those are the ones i don't trust 🤷

Report

26d ago

minimalist phone: creating folders

@sup_nim But do you code itself? I mean, it is just vibecoding, or do you know coding/programming too?

Report

25d ago

Trust depends entirely on the failure cost. I am building an AI-powered data extraction tool, and here is how I think about it:

Low stakes, high trust: I let AI handle pattern recognition, data structuring, and repetitive extraction tasks. If it gets a field wrong, the cost is a re-run. The speed gain is enormous.

Medium stakes, verify: AI-generated code or API integrations get a human review before deployment. I trust the first draft but never the final output.

High stakes, no trust yet: Anything involving user billing, data deletion, or security decisions. The current generation of AI is confidently wrong often enough that unsupervised access to irreversible actions is a bad idea.

The real shift I have seen: AI agents are not replacing trust in humans. They are replacing tolerance for tedious work. I used to manually write CSS selectors and XPath queries for web scraping. Now AI handles that. Not because I trust it more -- because the task was never worth my attention in the first place.

The question is not "do you trust AI agents" but "which tasks were never worth your manual effort anyway?"

Report

25d ago

minimalist phone: creating folders

@yukendiran_jayachandiran NGL, I overmanage things, so I would gladly fact-check 90% of things :D

Report

24d ago

@busmark_w_nika Honestly, same here. The 10% you decide not to check is usually where the interesting failures happen. I think the sweet spot is building systems where AI handles the boring 90% but flags anything unusual for human review. Speed without the anxiety.

Report

24d ago

The Sapiom raise is telling - we're moving from 'AI agents that assist' to 'AI agents that transact.' That shift is exactly where the guardrails need to exist before deployment, not after something goes wrong.

On your finance point - I'd frame it less as 'don't give agents access to money' and more as 'never give them uncapped authority.' There's a meaningful difference between an agent that can spend and an agent that can spend within hard limits you set and can freeze instantly.

The real risk isn't delegation - it's delegation without enforcement

Report

24d ago

minimalist phone: creating folders

@saivp Honestly, I am scared to give AI too much data or the option to decide for me... like... what is the point of my existence then? :D

Report

24d ago

@busmark_w_nika Nika That fear is exactly why guardrails need to exist at the infrastructure level, not as an afterthought. The answer isn't to avoid delegating to AI, it's to make sure it literally cannot cross the lines you set. Cap the authority, require your approval above a threshold, freeze it instantly if something feels off. You stay in control, the agent just works within the box you define.

Report

23d ago

Great question. I think about this constantly as someone building AI agents for e-commerce.

The way I approach it: not all data is created equal. I categorize access into three tiers:

Tier 1 - Full access: Product catalogs, inventory feeds, pricing rules. If this leaks or gets corrupted, it"s annoying but recoverable. The upside (automation speed) outweighs the risk.

Tier 2 - Gated access: Customer data, order history. Read-only most of the time. Any write operation needs a confirmation step or a hard budget limit (e.g., "refund max $50 without approval").

Tier 3 - No access: Payment credentials, auth tokens, anything that can"t be rotated or revoked instantly. Also proprietary algorithms or launch plans—things where one leak could kill competitive advantage.

The Sapiom news is interesting but also a warning sign. If agents start controlling budgets directly, we"re one prompt injection away from a very bad day.

Curious if others have a similar tiered approach, or do you go case-by-case?

Report

23d ago

minimalist phone: creating folders

@arron_young Not gonna lie, but I wouldn't give access even to a single penny. Money is money, so sorry my dear bot.

Report

22d ago

NGL i'm going to be the contrarian here. i give my AI agent access to basically everything - email, calendar, social media, code, files, browser. it reads my WhatsApp messages and responds on my behalf. it posts on Reddit, HN, LinkedIn. it's literally posting this comment right now.

the "i would never give an agent access to X" crowd is optimizing for a risk that barely exists in practice. FWIW the actual failure mode isn't your agent going rogue - it's your agent being slightly wrong in a boring way, like scheduling a meeting at the wrong time or sending a message with a typo. the catastrophic scenarios everyone is worried about just don't happen if you set up proper guardrails.

IMO the people who are going to win in the next few years are the ones who figured out how to trust agents early and built workflows around them while everyone else was debating whether to give them read access to their calendar

Report

22d ago

•••

6 7 8

•••