Guy Podjarny

5.4K posts

Guy Podjarny banner
Guy Podjarny

Guy Podjarny

@guypod

Founder @tessl_io, shaping AI Native Dev, Host of The AI Native Dev podcast & angel investor. Previously founder @snyksec, CTO @Akamai, founder @Blaze.io

London, UK Katılım Haziran 2008
441 Takip Edilen8K Takipçiler
Sabitlenmiş Tweet
Guy Podjarny
Guy Podjarny@guypod·
I'm excited to announce Tessl's 𝐩𝐚𝐜𝐤𝐚𝐠𝐞 𝐦𝐚𝐧𝐚𝐠𝐞𝐫 𝐟𝐨𝐫 𝐚𝐠𝐞𝐧𝐭 𝐬𝐤𝐢𝐥𝐥𝐬! Install any skill, browse over 2,000 evaluated skills on the Tessl Registry, and be setup to manage those skills over time! Find out more on our blog (link in the comments), or get started with: npm i -g @ tessl/cli && tessl skill search Agent skills are powerful, and need professional dev tools to match. Don't copy skills to your repo - install them as a dependency you can update. Don't blindly use a skill and hope it works - evaluate its quality and efficacy. Don't hope someone notices you updated your skill - version it properly. Tessl enables all of this and more for skills, as well as other forms of context. Super excited to be launching this, and for the many more enhancements in the queue! Explore Skills in Tessl Registry: tessl.co/dzm
English
1
5
17
3.6K
Guy Podjarny retweetledi
Simon Maple
Simon Maple@sjmaple·
Writing a SKILL.md is without testing it, is writing it blind. Ultimately, you don't know if the agent follows it, if parts of the skill are redundant, or if it even makes things worse. We wrote a skill called skill-optimizer which solves this problem through running structured evals, comparing agent performance with and without the skill, and giving a clear score delta. It combines two approaches: a static review of the skill instructions, and real task-based evaluations that simulate realistic scenarios and grade outcomes. I used a real Fastify skill example, which @matteocollina wrote and identified regressions, diagnoses issues, applies fixes, and verifies improvements automatically, turning a 67% success rate into 94%. PR on it's way, Matteo! tessl.io/blog/stop-gues…
English
0
3
7
1K
Guy Podjarny retweetledi
AI Native Dev
AI Native Dev@ainativedev·
.@chadfowler replaced 70% of his codebase in 3 months and cut costs by 75%. His #1 rule: don't write a service longer than a page. The shorter it is, the easier it is to replace.
AI Native Dev@ainativedev

"The code that we have is a liability. The system is the asset we're building." @chadfowler, VC at Blue Yard Capital (@blueyard) and former CTO at Wunderlist, sits down with @guypod to discuss the Phoenix Architecture: software designed to be replaced rather than maintained. In this episode: • why was the code written by Chad never longer than a page • how he replaced 70% of a codebase in 3 months and cut costs by 75% • shipping AI code no human ever reviewed, and how to make it safe • the shadow specs your agents are making without you • why your system should work with the worst LLM, not just the best If you're still thinking about your codebase the old way, this one will change that. (0:00) Trailer (1:07) AI DevCon (2:01) Introduction (3:41) Origin story: euthanising legacy systems (5:45) Immutable infrastructure as inspiration (6:48) Disposable software and immutable code (9:00) Cattle versus pets for code (10:03) Making disposable code feasible at Wunderlist (12:31) Phoenix Architecture (15:16) Extreme programming lesson: do hard things constantly (17:04) What level of detail should specs have? (19:15) Pace layers and stable regeneration (22:37) New programming languages versus patterns (29:47) Compiling to system architectures (30:45) Training the programmer versus defining the system (35:03) Personalised and malleable software (37:48) Local first and shared data models (45:08) Evaluations as the real codebase (49:36) Testing the agent versus testing the system (55:38) Path of adoption (01:00:48) Wrap-up

English
1
3
17
3.8K
Guy Podjarny retweetledi
Macey Baker
Macey Baker@macebake·
My main takeaway is that skills are software, and the same rules apply. The things that make a bad skill also make a bad software component, eg. being badly scoped. "Should we factor this out" has become "Should we make a skill for this". Same stuff, different form factor
Thariq@trq212

x.com/i/article/2033…

English
1
4
2
872
Guy Podjarny retweetledi
fmerian/launch
fmerian/launch@fmerian·
Snyk founder is working on something new 👀
Guy Podjarny@guypod

Agent skills help agents use your products, build in your codebase and enforce your policies. They're the new unit of software for devs - but most are still treated like simple markdown files copied between repos with no versioning, no quality signal, no updates. Without AI evaluations, you can’t tell if a skill helps, provides minimal uplift or even degrades functionality. You spend your time course-correcting agents instead of shipping. @tessl_io is a development platform and package manager for agent skills. Today, I’m excited to launch on Product Hunt and announce that you can evaluate your skill and optimize them on Tessl. This means you can stop debugging agent output and start shipping quality code, faster. Real example: we've helped ElevenLabs ship skills that double agent success in using their APIs. If you're building a personal project, maintaining an OSS library, or developing with AI at work, you can now evaluate your skill and optimize it to help agents use it properly. Check us out on Product Hunt. If it’s useful, we’d love your upvote - and even more, your feedback in the comments: producthunt.com/products/tessl…

English
1
1
4
697
Guy Podjarny
Guy Podjarny@guypod·
Agent skills help agents use your products, build in your codebase and enforce your policies. They're the new unit of software for devs - but most are still treated like simple markdown files copied between repos with no versioning, no quality signal, no updates. Without AI evaluations, you can’t tell if a skill helps, provides minimal uplift or even degrades functionality. You spend your time course-correcting agents instead of shipping. @tessl_io is a development platform and package manager for agent skills. Today, I’m excited to launch on Product Hunt and announce that you can evaluate your skill and optimize them on Tessl. This means you can stop debugging agent output and start shipping quality code, faster. Real example: we've helped ElevenLabs ship skills that double agent success in using their APIs. If you're building a personal project, maintaining an OSS library, or developing with AI at work, you can now evaluate your skill and optimize it to help agents use it properly. Check us out on Product Hunt. If it’s useful, we’d love your upvote - and even more, your feedback in the comments: producthunt.com/products/tessl…
Guy Podjarny tweet media
English
1
0
10
1.2K
Tudor Barbu
Tudor Barbu@dirtyculture·
@sentientt_media how's the quality of the skills tho? are they battle-tested or just random uploads??
English
3
0
21
4.2K
Sentient
Sentient@sentient_agency·
Holy shit... someone just built an App Store for Claude Code. It's called SkillsMP and there are 200,000+ agent skills that teach your AI exactly how to write PPTX files, review PRs, deploy to cloud, analyze data, and more. No complex prompting. No building from scratch. No wasted tokens. 100% Opensource.
Sentient tweet media
English
94
211
1.8K
166K
Guy Podjarny retweetledi
Ed Sim
Ed Sim@edsim·
If agentic development is the future, then skills are the atomic unit. But how do you move from experimental to production-grade agents? You need @tessl_io, the dev-grade package manager for skills. It’s your registry for evaluated skills + platform to manage their full lifecycle. Congrats @guypod & the team! 🚀
Guy Podjarny@guypod

Agent skills help agents use your products, build in your codebase and enforce your policies. They’re not just words - they are what the unit of software for agentic devs, and need powerful dev tools to match. That is what @tessl_io offers. Tessl is the package manager and development platform for skills. It offers a full dev lifecycle, helping you generate, evaluate, distribute and observe skills & context, developing them to the professional grade they warrant. Today, I’m excited to announce the general availability of our task evals, which help you understand how good your skills are. Such insight is critical to making your skills great, avoiding regression, and applying learnings from their real world usage. For example: @Cisco's software-security skill shows a 1.8X improvement in securing coding in its benchmark, and @ElevenLabs's agents skill boosts success by almost 3X! However, not to name names, we often see skills that provide minimal uplift while consuming context window space, or even degrade functionality. As Spencer Kimball, CEO of Cockroach Labs, put it when we shared early versions of this: evaluation is what makes agentic coding outcomes converge instead of drifting. Task evals are joining a long list of powerful context development tools, such as: * Review skills against quality best practices * Generate and maintain skills and docs for using your libraries & platform * Distribute versioned skills to your dev team and ecosystem * Consume skills easily and safely, and keep them up-to-date Skills are a central part of software development. If you’re serious about making agentic dev successful in your org, or helping your customers’s agents use your products, you need to invest in them. We hope Tessl can help. Check out links in the thread to get started!

English
3
3
7
1.6K
Guy Podjarny retweetledi
scott belsky
scott belsky@scottbelsky·
new product to help agents gain new skills, and evaluate their skills...from the @guypod and @tessl_io team...
Guy Podjarny@guypod

Agent skills help agents use your products, build in your codebase and enforce your policies. They’re not just words - they are what the unit of software for agentic devs, and need powerful dev tools to match. That is what @tessl_io offers. Tessl is the package manager and development platform for skills. It offers a full dev lifecycle, helping you generate, evaluate, distribute and observe skills & context, developing them to the professional grade they warrant. Today, I’m excited to announce the general availability of our task evals, which help you understand how good your skills are. Such insight is critical to making your skills great, avoiding regression, and applying learnings from their real world usage. For example: @Cisco's software-security skill shows a 1.8X improvement in securing coding in its benchmark, and @ElevenLabs's agents skill boosts success by almost 3X! However, not to name names, we often see skills that provide minimal uplift while consuming context window space, or even degrade functionality. As Spencer Kimball, CEO of Cockroach Labs, put it when we shared early versions of this: evaluation is what makes agentic coding outcomes converge instead of drifting. Task evals are joining a long list of powerful context development tools, such as: * Review skills against quality best practices * Generate and maintain skills and docs for using your libraries & platform * Distribute versioned skills to your dev team and ecosystem * Consume skills easily and safely, and keep them up-to-date Skills are a central part of software development. If you’re serious about making agentic dev successful in your org, or helping your customers’s agents use your products, you need to invest in them. We hope Tessl can help. Check out links in the thread to get started!

English
1
1
7
6K
Guy Podjarny
Guy Podjarny@guypod·
Tessl lets you install rules too, with the same ease. That said, skills are flawed but they’ll improve, and either way context is still your way to steer agents. Context is what you will build as an AI Native developer. A few more details: - the right skill can be many times better in both activation and effectiveness when use - Vercel’s post talked about implicit activation, you can also invoke skills explicitly - Evals (and soon observability) are your way to know if you’re successful Tessl helps with all of these and more
English
2
0
0
49
Guy Podjarny
Guy Podjarny@guypod·
Agent skills help agents use your products, build in your codebase and enforce your policies. They’re not just words - they are what the unit of software for agentic devs, and need powerful dev tools to match. That is what @tessl_io offers. Tessl is the package manager and development platform for skills. It offers a full dev lifecycle, helping you generate, evaluate, distribute and observe skills & context, developing them to the professional grade they warrant. Today, I’m excited to announce the general availability of our task evals, which help you understand how good your skills are. Such insight is critical to making your skills great, avoiding regression, and applying learnings from their real world usage. For example: @Cisco's software-security skill shows a 1.8X improvement in securing coding in its benchmark, and @ElevenLabs's agents skill boosts success by almost 3X! However, not to name names, we often see skills that provide minimal uplift while consuming context window space, or even degrade functionality. As Spencer Kimball, CEO of Cockroach Labs, put it when we shared early versions of this: evaluation is what makes agentic coding outcomes converge instead of drifting. Task evals are joining a long list of powerful context development tools, such as: * Review skills against quality best practices * Generate and maintain skills and docs for using your libraries & platform * Distribute versioned skills to your dev team and ecosystem * Consume skills easily and safely, and keep them up-to-date Skills are a central part of software development. If you’re serious about making agentic dev successful in your org, or helping your customers’s agents use your products, you need to invest in them. We hope Tessl can help. Check out links in the thread to get started!
English
3
9
28
19.7K
Guy Podjarny retweetledi
David Singleton
David Singleton@dps·
Jobs called computers "bicycles for the mind" -- tools we could shape to our will. But they never were. Until now. Every morning an agent preps me for my day -- calendar, news, last 24hrs of Slack -- in a personal podcast. I made it by asking. Same for hundreds of other things. Launching @dreamer in beta today. That 🧠 bicycle, finally. dreamer.com
Dreamer@dreamer

Introducing Dreamer. A place to discover, build, and enjoy agentic apps. It’s your home for personal intelligence. Now in beta. Sign up👇

English
86
72
427
207.4K
Guy Podjarny
Guy Podjarny@guypod·
What does an Observability solution built for agents look like? Do they need dashboards? Do they care about log formats? And how does such a product interact with humans? I had a fascinating conversation about that and more with @mirko_novakovic on the @ainativedev . We discussed how : - LLMs natively understand OpenTelemetry - Humans like dashboards but agents like text - Context is key to making agents work - Use cases that work today with agents Throughout, we had an open conversation challenging what is an observability product if you standardize the format, rely on 3rd party LLM analysis and take away the dashboard. Spoiler alert - there's much value to deliver, but it’s different! Mirko founded @dash0hq, an AI Native O11y company, and previously built an 011y company called Instana, and sold it to IBM. He knows the space :) Great conversation with a superb guest - a must listen! Full episode here: tessl.co/swe
English
0
1
6
545
Guy Podjarny
Guy Podjarny@guypod·
@HashiCorp just released Agent Skills for Terraform and Packer. What’s interesting here isn’t the idea of Skills itself, but the focus on making agent behaviour repeatable and evaluable. These aren’t prompts, they’re structured, testable bundles of domain knowledge. We worked with HashiCorp to evaluate the Skills using real task runs and evals in Tessl, looking at where they actually improve outcomes and where they don’t. Feels like a solid step toward more reliable, AI-native infrastructure workflows. Less guesswork, more verification. Check it out here: tessl.co/nsh
English
0
8
50
3.5K
Guy Podjarny retweetledi
Tessl
Tessl@tessl_io·
Agent skills are getting harder to manage. Most teams still treat skills as static artifacts: markdown files, created or copied from repo to repo. This quickly leads to debt, with skills growing stale and copies falling out of sync. That's where Tessl comes in! Today, we launched agent skills on Tessl 🎉 Tessl lets you treat skills like software, not snippets: - Discover evaluated skills in the Tessl Registry - Install and evaluate skills via CLI or from GitHub - See how skills perform across agents and models - Version, update, and evolve skills safely over time Now you can discover evaluated skills in the Tessl Registry or install and test any skill from GitHub. 💻 npm i -g @tessl/cli && tessl skill search 🔗 Explore Tessl Registry: tessl.co/cds #agentskills #devtools #aidevtools
Tessl tweet media
English
0
2
10
1K
Guy Podjarny retweetledi
Victor Riparbelli
Victor Riparbelli@vriparbelli·
Today marks the next chapter for @synthesiaIO as we announce our $200M Series E at $4B led by GV. It’s been quite a ride since 2017 when we set out to transform how people make video. Now, 8 years later, so much of that vision has come to fruition as evident everywhere around us. It’s truly incredible how good AI video has become in the last two years. Up until now it’s been mostly about making video as we know it with AI. This is often referred to as the ‘bridge-period’ where new technology copies old formats, like when the first films were just recorded theater or early GPS was just a digital map with no navigation. Now, it’s time to figure out what AI-native video truly looks like - when we let go of the priors and rethink video in the context of language models, real-time video generation, smartphones and so much more. With this new round, it’s all about helping people work better – both with AI video and a bunch of new real-time products, like Skills. We’ll be sharing much more in the coming months! Grateful to all of our amazing customers, Synthesian’s and investors!
English
10
10
59
13.2K
Guy Podjarny retweetledi
Maria Gorinova
Maria Gorinova@migorinova·
Super excited to share what we've been doing at @tessl_io to improve the quality of code generated by AI agents! We introduce a new way to measure abstraction adherence and show how Tessl 's usage specs significantly boost it Check out the full article! tessl.io/blog/proposed-…
Maria Gorinova tweet mediaMaria Gorinova tweet media
English
1
2
4
840
Guy Podjarny retweetledi
Frugal
Frugal@frugalaico·
Cloud costs are eating software margins alive. Today we're announcing Frugal's 5M seed round. Frugal is the first Application Cost Engineering (ACE) platform: fixing inefficient code before it hits production. frugal.co #finops #ai #CloudCostOptimization
Frugal tweet media
English
0
1
3
597
Guy Podjarny retweetledi
Ahmed Men ⌘
Ahmed Men ⌘@demtzu·
Introducing Capture to Notion. One ⌘ shortcut to capture anything straight to Notion. Your new w̶e̶b̶ universal clipper.
English
26
42
375
53K