Yaz

2.9K posts

Yaz

@yazcal

building https://t.co/1EqOUSvVHR · cofounder @ https://t.co/Tq3tWyzfao prev. vulnzap · research @asu quis custodiet ipsas machinas?

🇪🇺 ➝ 🇺🇸 Katılım Kasım 2021

3.6K Takip Edilen1.2K Takipçiler

Yaz@yazcal·15h

@wolfiesch 2bdr apartment with in-site pool/laundry is about 3300

English

wolfie@wolfiesch·16h

@yazcal >per bedroom Do you have housemates? That's a great price if not

English

130

Yaz@yazcal·1d

Who needs caffeine when this is the first thing you see from bed? $1600/mo per bedroom. 35-min bike + ferry to Fort Mason, SF (aka f.inc lab). We are seriously not living in Europe.

English

106

14.4K

Yaz@yazcal·18h

@growing_daniel Hb SJC??

Indonesia

Daniel@growing_daniel·1d

Every other airport reminds me how good SFO is

English

503

25.9K

Yaz@yazcal·18h

@jjacky @thesimonkim @wordisbonz was the pioneer of all

English

jacky@jjacky·19h

@thesimonkim yc companies all use the same production/film companies

English

230

Simon Kim@thesimonkim·20h

all tech vids starting to look the same

English

112

Yaz@yazcal·18h

.@posthog thanks for the startup program and the free $1k founder swag kit - which, at this rate, is going to be refunded to you in 20 days because I shipped it to my family’s home 😭

English

101

Yaz@yazcal·18h

@jjacky Rush hour over the Golden Gate (18mi to mission) can definitely be a pain, yep. But if you’ve got a family, it’s a great opt - super safe neighborhood, and the ferry is nice too. from what I’ve heard, it’s best if you’re going in 4 days a week or less, or not doing a strict 9–5;)

English

jacky@jjacky·18h

@yazcal Would love to own a home here one day... Is getting into the city annoying?

English

Yaz@yazcal·18h

@jjacky working on it to make it kernel-level, docs.veto.so

English

jacky@jjacky·19h

maybe chmod should add a new permission scope and grant just for ai agents seems like it would solve a lot of issues like you can lock down dot env so agent can't read or edit it, but it's still accessible to everything else

English

455

Yaz@yazcal·18h

@jjacky

QME

jacky@jjacky·19h

@yazcal Berkeley? Sorry not super familiar with the geography

English

Yaz@yazcal·19h

@jjacky right opposite side;)

English

285

jacky@jjacky·23h

@yazcal sausalito?

Filipino

839

Yaz@yazcal·19h

(beldevere tiburon)

English

335

Yaz@yazcal·5d

@gbengaolajide01 @cognition i think i had modified github.com/rsvedant/openc… (don't recommend now though)

English

Gbenga@gbengaolajide01·5d

@yazcal @cognition inside opencode? how?

English

Cognition@cognition·6d

We’re releasing SWE-1.6, our best model in both intelligence & model UX. SWE-1.6 matches our Preview model on SWE-Bench Pro while dramatically improving on various behavioral axes. It’s available today in Windsurf in two modes: free tier (200 tok/s) and fast tier (950 tok/s).

English

770

224.5K

Yaz retweetledi

Matthew Berman@MatthewBerman·6d

I’m glad America has Mythos.

English

742

28.1K

Yaz@yazcal·5d

There are only two coherent paths: commoditizing a model like this so the defensive uplift diffuses broadly, even at Tparams, or keeping it closed for the few who can pay until Chinese labs build their own closed variants and hand the offensive edge to the operators they prefer.

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

227

Yaz@yazcal·6d

@zwiebelhelm @adilmania @EmmanuelMacron can’t dm you max übrigens

Deutsch

Max Scherf@zwiebelhelm·6d

@yazcal @adilmania @EmmanuelMacron 🤣

QME

Adil Mania.@adilmania·6d

quick sneak peek of my previous life (before being the yellow guy 💛) where i was used to pitch my ventures to @EmmanuelMacron 😂 this was fun lol

English

116

9.2K

Yaz@yazcal·6d

@adilmania @zwiebelhelm @EmmanuelMacron add healthcare + unemployment benefits lmao

English

Adil Mania.@adilmania·6d

@zwiebelhelm @EmmanuelMacron food is good in EU 😂

English

Yaz@yazcal·24 Mar

@mil000 go ahead

English

Milo Smith@mil000·23 Mar

I need to make a list of software ands stuff I actually like and isn't just noise / slop

English

4.3K

Yaz@yazcal·23 Mar

this fixes where code runs but not what it can do 100 agents = thousands of authz decisions per second. RBAC was built for humans clicking buttons. agents need per-action checks in <10ms or your swarm is sitting idle everyone's solving the easy problem (ephemeral envs) and ignoring the hard one (authz at machine speed) veto.so - deterministic-first, works with every major framework in 2 lines (see git.veto.so OSS too)

English

Larsen Cundric@larsencc·22 Mar

At @browser_use we run millions of parallel agents. We think about parallelism all day. That's the job. It got me thinking about something most teams aren't talking about yet. Picture a 50 person engineering org where everyone uses AI coding assistants. Each developer kicks off a few agents at once. Suddenly you have hundreds of concurrent code changes being generated, tested, and pushed. Now ask yourself: where does all that code get validated? For most companies, the answer is a single staging environment. Maybe two if they're lucky. That's a massive mismatch. Your development throughput just 10x'd, but your validation layer stayed exactly the same. Agents sit idle waiting their turn. Context windows expire. CI pipelines pile up. The productivity gains you paid for evaporate in a queue. This isn't an agent problem. It's an infrastructure problem. Staging was built for a world where one person ships one thing, gets feedback, iterates. That model breaks when the workload becomes parallel by default. I've seen this pattern before, even pre-AI. At Flexport, the product was so large you couldn't run it locally. Every engineer got their own cloud dev environment. Docker containers spun up on demand, with switches to toggle which services you needed. Not because it was fancy. Because one shared environment for hundreds of engineers simply didn't work. Now give each of those engineers 3 agents (or more ofc). Teams keep throwing money at better models and faster agents while ignoring the chokepoint sitting right behind them. You invested in 10x development speed and got 2x back because the rest is stuck waiting. The answer is ephemeral, isolated environments. One per agent. Spun up in seconds, torn down when done. Only the services that changed get deployed. No shared state, no queue, no conflicts. Every serious engineering org will need this. Most haven't even started thinking about it. So who's building this? Because most teams are holding together shared environments with duct tape and hoping it scales. If you're working on this or running into it, I want to hear from you.

English

287

40.3K

Yaz@yazcal·21 Mar

@ericrovner agents don't need better prompts they need fewer permissions

English

Eric Rovner@ericrovner·21 Mar

This guy spent hours writing detailed instructions for his AI agent. It read them, understood them, and ignored them anyway. Here’s why. Your instructions aren’t code. They don’t execute. They compete for attention with every prompt you send, and the prompt wins. The longer your instructions, the easier they get lost. On Cowork, your MD files are only loaded at the start not every turn, so compaction makes it worse. As sessions grow, your rules get watered down with each cycle. Short instructions survive. Long ones get summarized into nothing. The longer you go, the further the model drifts from your rules.

English

Yaz@yazcal·20 Mar

@ohryansbelt I had warned Cluely..

English

373

Ryan@ohryansbelt·20 Mar

Delve, a YC-backed compliance startup that raised $32 million, has been accused of systematically faking SOC 2, ISO 27001, HIPAA, and GDPR compliance reports for hundreds of clients. According to a detailed Substack investigation by DeepDelver, a leaked Google spreadsheet containing links to hundreds of confidential draft audit reports revealed that Delve generates auditor conclusions before any auditor reviews evidence, uses the same template across 99.8% of reports, and relies on Indian certification mills operating through empty US shells instead of the "US-based CPA firms" they advertise. Here's the breakdown: > 493 out of 494 leaked SOC 2 reports allegedly contain identical boilerplate text, including the same grammatical errors and nonsensical sentences, with only a company name, logo, org chart, and signature swapped in > Auditor conclusions and test procedures are reportedly pre-written in draft reports before clients even provide their company description, which would violate AICPA independence rules requiring auditors to independently design tests and form conclusions > All 259 Type II reports claim zero security incidents, zero personnel changes, zero customer terminations, and zero cyber incidents during the observation period, with identical "unable to test" conclusions across every client > Delve's "US-based auditors" are actually Accorp and Gradient, described as Indian certification mills operating through US shell entities. 99%+ of clients reportedly went through one of these two firms over the past 6 months > The platform allegedly publishes fully populated trust pages claiming vulnerability scanning, pentesting, and data recovery simulations before any compliance work has been done > Delve pre-fabricates board meeting minutes, risk assessments, security incident simulations, and employee evidence that clients can adopt with a single click, according to the author > Most "integrations" are just containers for manual screenshots with no actual API connections. The author describes the platform as a "SOC 2 template pack with a thin SaaS wrapper" > When the leak was exposed, CEO Karun Kaushik emailed clients calling the allegations "falsified claims" from an "AI-generated email" and stated no sensitive data was accessed, while the reports themselves contained private signatures and confidential architecture diagrams > Companies relying on these reports could face criminal liability under HIPAA and fines up to 4% of global revenue under GDPR for compliance violations they believed were resolved > When clients threaten to leave, Delve reportedly pairs them with an external vCISO for manual off-platform work, which the author argues proves their own platform can't deliver real compliance > Delve's sales price dropped from $15,000 to $6,000 with ISO 27001 and a penetration test thrown in when a client mentioned considering a competitor

erin griffith@eringriffith

A detailed and brutal look at the tactics of buzzy AI compliance startup Delve "Delve built a machine designed to make clients complicit without their knowledge, to manufacture plausible deniability while producing exactly the opposite." substack.com/home/post/p-19…

English

401

723

8.2K

5.7M

Keşfet

@wolfiesch @growing_daniel @jjacky @thesimonkim @wordisbonz @posthog @gbengaolajide01 @cognition