Yaz

2.9K posts

Yaz banner
Yaz

Yaz

@yazcal

building https://t.co/1EqOUSvVHR · cofounder @ https://t.co/Tq3tWyzfao prev. vulnzap · research @asu quis custodiet ipsas machinas?

🇪🇺 ➝ 🇺🇸 Katılım Kasım 2021
3.6K Takip Edilen1.2K Takipçiler
Yaz
Yaz@yazcal·
@wolfiesch 2bdr apartment with in-site pool/laundry is about 3300
English
1
0
1
86
wolfie
wolfie@wolfiesch·
@yazcal >per bedroom Do you have housemates? That's a great price if not
English
1
0
0
130
Yaz
Yaz@yazcal·
Who needs caffeine when this is the first thing you see from bed? $1600/mo per bedroom. 35-min bike + ferry to Fort Mason, SF (aka f.inc lab). We are seriously not living in Europe.
Yaz tweet media
English
9
2
106
14.4K
Daniel
Daniel@growing_daniel·
Every other airport reminds me how good SFO is
English
47
11
503
25.9K
jacky
jacky@jjacky·
@thesimonkim yc companies all use the same production/film companies
English
1
0
2
230
Simon Kim
Simon Kim@thesimonkim·
all tech vids starting to look the same
English
28
4
112
6K
Yaz
Yaz@yazcal·
.@posthog thanks for the startup program and the free $1k founder swag kit - which, at this rate, is going to be refunded to you in 20 days because I shipped it to my family’s home 😭
Yaz tweet media
English
0
0
2
101
Yaz
Yaz@yazcal·
@jjacky Rush hour over the Golden Gate (18mi to mission) can definitely be a pain, yep. But if you’ve got a family, it’s a great opt - super safe neighborhood, and the ferry is nice too. from what I’ve heard, it’s best if you’re going in 4 days a week or less, or not doing a strict 9–5;)
Yaz tweet media
English
0
0
0
52
jacky
jacky@jjacky·
@yazcal Would love to own a home here one day... Is getting into the city annoying?
English
1
0
0
72
jacky
jacky@jjacky·
maybe chmod should add a new permission scope and grant just for ai agents seems like it would solve a lot of issues like you can lock down dot env so agent can't read or edit it, but it's still accessible to everything else
English
3
0
3
455
jacky
jacky@jjacky·
@yazcal Berkeley? Sorry not super familiar with the geography
English
1
0
1
76
Yaz
Yaz@yazcal·
@jjacky right opposite side;)
English
1
0
0
285
Yaz
Yaz@yazcal·
(beldevere tiburon)
English
0
0
1
335
Cognition
Cognition@cognition·
We’re releasing SWE-1.6, our best model in both intelligence & model UX. SWE-1.6 matches our Preview model on SWE-Bench Pro while dramatically improving on various behavioral axes. It’s available today in Windsurf in two modes: free tier (200 tok/s) and fast tier (950 tok/s).
Cognition tweet media
English
49
64
770
224.5K
Yaz retweetledi
Matthew Berman
Matthew Berman@MatthewBerman·
I’m glad America has Mythos.
English
65
29
742
28.1K
Yaz
Yaz@yazcal·
There are only two coherent paths: commoditizing a model like this so the defensive uplift diffuses broadly, even at Tparams, or keeping it closed for the few who can pay until Chinese labs build their own closed variants and hand the offensive edge to the operators they prefer.
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
0
0
1
227
Adil Mania.
Adil Mania.@adilmania·
quick sneak peek of my previous life (before being the yellow guy 💛) where i was used to pitch my ventures to @EmmanuelMacron 😂 this was fun lol
English
16
1
116
9.2K
Milo Smith
Milo Smith@mil000·
I need to make a list of software ands stuff I actually like and isn't just noise / slop
English
9
0
54
4.3K
Yaz
Yaz@yazcal·
this fixes where code runs but not what it can do 100 agents = thousands of authz decisions per second. RBAC was built for humans clicking buttons. agents need per-action checks in <10ms or your swarm is sitting idle everyone's solving the easy problem (ephemeral envs) and ignoring the hard one (authz at machine speed) veto.so - deterministic-first, works with every major framework in 2 lines (see git.veto.so OSS too)
English
0
0
0
19
Larsen Cundric
Larsen Cundric@larsencc·
At @browser_use we run millions of parallel agents. We think about parallelism all day. That's the job. It got me thinking about something most teams aren't talking about yet. Picture a 50 person engineering org where everyone uses AI coding assistants. Each developer kicks off a few agents at once. Suddenly you have hundreds of concurrent code changes being generated, tested, and pushed. Now ask yourself: where does all that code get validated? For most companies, the answer is a single staging environment. Maybe two if they're lucky. That's a massive mismatch. Your development throughput just 10x'd, but your validation layer stayed exactly the same. Agents sit idle waiting their turn. Context windows expire. CI pipelines pile up. The productivity gains you paid for evaporate in a queue. This isn't an agent problem. It's an infrastructure problem. Staging was built for a world where one person ships one thing, gets feedback, iterates. That model breaks when the workload becomes parallel by default. I've seen this pattern before, even pre-AI. At Flexport, the product was so large you couldn't run it locally. Every engineer got their own cloud dev environment. Docker containers spun up on demand, with switches to toggle which services you needed. Not because it was fancy. Because one shared environment for hundreds of engineers simply didn't work. Now give each of those engineers 3 agents (or more ofc). Teams keep throwing money at better models and faster agents while ignoring the chokepoint sitting right behind them. You invested in 10x development speed and got 2x back because the rest is stuck waiting. The answer is ephemeral, isolated environments. One per agent. Spun up in seconds, torn down when done. Only the services that changed get deployed. No shared state, no queue, no conflicts. Every serious engineering org will need this. Most haven't even started thinking about it. So who's building this? Because most teams are holding together shared environments with duct tape and hoping it scales. If you're working on this or running into it, I want to hear from you.
English
58
22
287
40.3K
Yaz
Yaz@yazcal·
@ericrovner agents don't need better prompts they need fewer permissions
English
1
0
0
38
Eric Rovner
Eric Rovner@ericrovner·
This guy spent hours writing detailed instructions for his AI agent. It read them, understood them, and ignored them anyway. Here’s why. Your instructions aren’t code. They don’t execute. They compete for attention with every prompt you send, and the prompt wins. The longer your instructions, the easier they get lost. On Cowork, your MD files are only loaded at the start not every turn, so compaction makes it worse. As sessions grow, your rules get watered down with each cycle. Short instructions survive. Long ones get summarized into nothing. The longer you go, the further the model drifts from your rules.
Eric Rovner tweet media
English
1
0
1
77
Ryan
Ryan@ohryansbelt·
Delve, a YC-backed compliance startup that raised $32 million, has been accused of systematically faking SOC 2, ISO 27001, HIPAA, and GDPR compliance reports for hundreds of clients. According to a detailed Substack investigation by DeepDelver, a leaked Google spreadsheet containing links to hundreds of confidential draft audit reports revealed that Delve generates auditor conclusions before any auditor reviews evidence, uses the same template across 99.8% of reports, and relies on Indian certification mills operating through empty US shells instead of the "US-based CPA firms" they advertise. Here's the breakdown: > 493 out of 494 leaked SOC 2 reports allegedly contain identical boilerplate text, including the same grammatical errors and nonsensical sentences, with only a company name, logo, org chart, and signature swapped in > Auditor conclusions and test procedures are reportedly pre-written in draft reports before clients even provide their company description, which would violate AICPA independence rules requiring auditors to independently design tests and form conclusions > All 259 Type II reports claim zero security incidents, zero personnel changes, zero customer terminations, and zero cyber incidents during the observation period, with identical "unable to test" conclusions across every client > Delve's "US-based auditors" are actually Accorp and Gradient, described as Indian certification mills operating through US shell entities. 99%+ of clients reportedly went through one of these two firms over the past 6 months > The platform allegedly publishes fully populated trust pages claiming vulnerability scanning, pentesting, and data recovery simulations before any compliance work has been done > Delve pre-fabricates board meeting minutes, risk assessments, security incident simulations, and employee evidence that clients can adopt with a single click, according to the author > Most "integrations" are just containers for manual screenshots with no actual API connections. The author describes the platform as a "SOC 2 template pack with a thin SaaS wrapper" > When the leak was exposed, CEO Karun Kaushik emailed clients calling the allegations "falsified claims" from an "AI-generated email" and stated no sensitive data was accessed, while the reports themselves contained private signatures and confidential architecture diagrams > Companies relying on these reports could face criminal liability under HIPAA and fines up to 4% of global revenue under GDPR for compliance violations they believed were resolved > When clients threaten to leave, Delve reportedly pairs them with an external vCISO for manual off-platform work, which the author argues proves their own platform can't deliver real compliance > Delve's sales price dropped from $15,000 to $6,000 with ISO 27001 and a penetration test thrown in when a client mentioned considering a competitor
Ryan tweet media
erin griffith@eringriffith

A detailed and brutal look at the tactics of buzzy AI compliance startup Delve "Delve built a machine designed to make clients complicit without their knowledge, to manufacture plausible deniability while producing exactly the opposite." substack.com/home/post/p-19…

English
401
723
8.2K
5.7M