Ilan Kadar (@ilan_kadar) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Ilan Kadar@ilan_kadar·5d

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

113

78

1K

2.4M

Ilan Kadar@ilan_kadar·3d

We hit 1st place on #ProductHunt.

English

1

0

4

660

Ilan Kadar@ilan_kadar·5d

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

113

78

1K

2.4M

Ilan Kadar@ilan_kadar·3d

Today, we hit #1 on #ProductHunt. And what made it special wasn’t the ranking, it was all of you. Thousands of builders showed up. Not just to try it, but to push it, question it, and build with it, starting to vibe-train their own evals and guardrails. Because deep down, we all know: building AI agents is easy now. Trusting them in production isn’t. Seeing so many of you lean into that with us, that’s the real WIN. To everyone who supported, upvoted, and shared, THANK YOU We’re just getting started 🚀

English

5

2

20

802

Ilan Kadar retweetledi

Kunal Kushwaha@kunalstwt·5d

Air Canada’s chatbot once literally made up its own refund policy in court and won a lawsuit for the customer, not the airline. There’s a new term being coined right now called vibe training by the company @pluraiAI, and they’ve basically built a way to use tiny, fast models as guardrails to catch hallucinations in sub-100ms and the cost is over 8x lower than GPT-5-mini. 🔥👉 They’re live on Product Hunt today: producthunt.com/products/plura… If you’re building agents, go check them out, grab the free trial, and show them some love on the launch! 🫶 The best part? You don’t need a PhD in AI. Sponsored by Plurai.

English

0

4

84

11.7K

Ilan Kadar@ilan_kadar·4d

So true, this is exactly what we’re seeing across teams. And yes… we’re currently #1 on Product Hunt, but it’s very close. Would really appreciate the support to help us stay on top with an upvote 🚀 producthunt.com/products/plurai

English

0

1

1.4K

Ilan Kadar@ilan_kadar·4d

@eranshir @urieli17 On the way❤️

English

0

2

15

eran shir@eranshir·4d

@urieli17 @ilan_kadar איפה שלי??

עברית

3

0

116

Uri Eliabayev@urieli17·4d

קלעו בול לסוואג האהוב עלי :)

Uri Eliabayev@urieli17

אילן והצוות חושף את מה שהם עבדו עליו בשנה האחרונה - דרך אמיתית להעריך את הביצועים של סוכנים. כל מי שבנה מערכות כאלה יודע כמה זה קשה לגעת אם הן באמת עובדות, שלא נדבר על להגדיר דאטה לבדיקות. זה בדיוק מה שהם פותרים. אם אתם בונים סוכנים, אתם חייבים לתת לזה צ'אנס.

עברית

3

0

25

4.4K

Ilan Kadar@ilan_kadar·4d

@eranshir @urieli17 On its way ❤️

English

0

1

14

Ilan Kadar@ilan_kadar·4d

Yesterday blew past every expectation. Thousands of agent builders sign-ups! I barely slept (2 hours, if I’m honest)… and now we’re heading straight into our ProductHunt launch and need your support to make it to the top on Product Hunt ❤️ •⁠ ⁠Open the link •⁠ ⁠Hit upvote •⁠ ⁠Drop a quick comment This takes 30 seconds and directly impacts our ranking. Let’s push this to the top today producthunt.com/products/plurai

English

5

0

9

2.2K

Ilan Kadar@ilan_kadar·5d

Yesterday blew past every expectation. I barely slept (2 hours, if I’m honest)… and now we’re heading straight into our #ProductHunt launch and I need you! 🚀 Because something clicked. We launched vibe training - and within hours, thousands of agent builders started creating evals and guardrails for their own use cases! It’s moving fast. Because the truth is simple: Building agents is easy. Making them reliable in production is not. That’s what vibe training fixes. If you’ve been following, building with us, or just rooting from the sidelines — we need your support ❤️ • Open the link • Hit upvote • Drop a quick comment This takes 30 seconds and directly impacts our ranking. Let’s push this to the top today producthunt.com/products/plura…

English

4

3

15

2.5K

Ilan Kadar@ilan_kadar·5d

@DAIEvolutionHub This is just the beginning , excited to see what people build with it. thanks for sharing

English

0

65

Kshitij Mishra | AI & Tech@DAIEvolutionHub·5d

Everyone is building AI agents… Almost none will survive real users. That’s where they break. Quietly. Brutally. The ones who fix edge cases first? They’ll own the next wave. This “vibe-training” shift isn’t optional. It’s the gap between hype and domination.

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

28

76

20.7K

Ilan Kadar@ilan_kadar·5d

@shiri_shh This is just the beginning, excited to see what people build with it.

English

0

11

shirish@shiri_shh·5d

AI agents look flawless in demos…until real users break them in 5 seconds 😭 Today, Plurai ships vibe training. you describe what "good" and "bad" looks like in plain english. It generates a synthetic test set, debates edge cases with multi-agent validation, and trains a small model on your exact policies. This is the production layer every agent builder needed

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

6

1

20

3.1K

Ilan Kadar@ilan_kadar·5d

@eranshir Eran, thanks for the kind words. We were lucky to learn from you at Nexar, it was a great environment and a lot of what we’re building today comes from that foundation. We Appreciate the support.

English

0

1

45

eran shir@eranshir·5d

כל כך גאה לראות את אילן ואלעד הולכים מהישג להישג. ממליץ לכל ארגון שעוסק בסוכנים לבחון את הפתרונות שלהם.

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

עברית

2

1

26

4.3K

Ilan Kadar@ilan_kadar·5d

Love this breakdown,really captures what we’re seeing. LLM-as-a-judge got us started, but it doesn’t hold up in production, too generic, too slow, too expensive. Vibe-training flips that: a small model that actually understands your agent, your policies, your edge cases—and runs inline on every interaction. That’s how you go from evaluating agents → to actually trusting them in production. Appreciate you sharing this!

English

1

0

2

229

Ilan Kadar retweetledi

Akshay 🚀@akshay_pachaar·5d

Vibe train your AI agents. There's a new method that could replace LLM-as-a-judge for production agents. Most teams rely on a giant LLM as a judge to evaluate and guard their agent. But it has two major drawbacks: - It's slow and expensive at inference time - It often misses domain-specific failures Vibe training flips this. Researchers at Plurai distill a small language model that's specialized for your agent's exact use case. The SLM becomes your evaluator and your runtime guardrail, both in one. The training data isn't hand-curated either. They spin up a swarm of adversarial agents that debate and stress-test every use case your agent is supposed to handle. That synthetic interaction data trains the specialized SLM. So the judge actually understands what "wrong" looks like in your specific domain. The reported gains vs. standard LLM-as-a-judge setups: - ~8x faster inference - ~50% fewer evaluation errors Smaller, faster, and more accurate because it's specialized for the job. The SLM-for-agents thesis is playing out in a very concrete way. If LLM-as-a-judge is your current evaluation layer, this is worth benchmarking against. Paper link in the replies.

English

20

25

160

11.1K

Ilan Kadar@ilan_kadar·5d

Thank you for sharing. That “safety tax” is exactly what we set out to fix. Paying twice just to trust your own agent doesn’t scale. The shift is real: from slow, expensive checks → to real-time, purpose-built guardrails. That’s how you go from demo to production. We’re just getting started 🚀

English

0

1

0

120

Ilan Kadar retweetledi

Chidanand Tripathi@thetripathi58·5d

I used to pay for the most expensive AI models just to double-check my own agents. It felt like a "safety tax" I had to pay, but it was killing my margins and making everything feel slow. I was basically paying twice for the same result. Plurai finally fixed this. Instead of a giant model, you train a tiny one that only cares about your specific rules. You just type what you want in plain English, and it builds a custom safety net in minutes. It runs instantly and costs almost nothing. This is how you actually move from a prototype to something that works at scale. Check it out:

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

15

43

144

38.8K

Ilan Kadar@ilan_kadar·5d

@DailyDoseOfDS_ This is just the beginning, excited to see what people build with it

English

0

2

28

Daily Dose of Data Science@DailyDoseOfDS_·5d

Vibe train your AI agents. This new method can replace LLM-as-a-judge for production agents. Most teams point a giant LLM at their agent's output and call it evaluation. It works, but it comes with two real costs: - It's slow and expensive at inference time - It misses the domain-specific failures that actually matter to your use case Vibe training flips the whole setup. Researchers at Plurai distill a small language model that's specialized for your agent's exact behavior, your edge cases, and your failure modes. The SLM becomes your evaluator and your runtime guardrail in one. Here's why this is a big deal: - Cheap enough to run inline on every agent step, not just offline batches - Catches the failures that generic LLM judges shrug off - Same model guards production and grades it, so eval and runtime stay in sync A small specialized model beating a giant general one is becoming a pattern. Distillation is quietly turning into one of the most underrated techniques for shipping reliable agents. Try it here: plurai.ai/launch Paper: plurai.ai/papers

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

4

9

62

5.6K

Ilan Kadar@ilan_kadar·5d

@RahulKu22532718 This is how we move from “AI that works” to “AI you can trust.”

English

0

2

45

Rahul Kumar@RahulKu22532718·5d

Plurai just changed how AI agents are built and controlled From writing tests → to defining behavior Custom evals + guardrails in minutes No labeling No ML team needed • Faster and more cost-efficient • Real-time control • Built for production from day one This feels less like testing AI and more like actually controlling it 🔗 Check it out now: plurai.ai/launch

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

18

34

75

15.6K

Ilan Kadar@ilan_kadar·5d

@manishkumar_dev Exactly. Agents don’t break in demos, they break on the edge cases you didn’t train for.

English

0

22

Manish Kumar Shah@manishkumar_dev·5d

AI agents don’t fail in demos, they fail in production where edge cases live. Plurai introduces vibe training to turn those edge cases into the foundation, not the risk. From intent to real time evals and guardrails in minutes, powered by SLMs that are faster, cheaper, and more consistent. Because reliable agents are not built by testing less, they are built by training for reality.

Ilan Kadar@ilan_kadar

Big day for us, finally sharing what we’ve been cooking for a while. Over the past year, we kept seeing the same pattern: AI agents look great in demos, until real users break them. Today, we’re fixing that with 𝘃𝗶𝗯𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 to build real-time, tailored evals and guardrails for your agents, in minutes. Define your intent with a prompt or a few examples. We generate edge-case datasets, and train a model aligned to your use case, outperforming state-of-the-art LLMs at a fraction of the cost. (Research paper with benchmarks in the comments) If you’re building AI agents, don’t let your users be the ones who discover the failures. Be the one who makes AI agents reliable in production and takes control at scale. Start vibe-training for free: plurai.ai/launch

English

15

18

56

5.4K

Ilan Kadar@ilan_kadar·5d

@aastha_mhaske This is exactly why we built it, real-time evals + guardrails on every interaction.

English

0

2

Aastha@aastha_mhaske·5d

@ilan_kadar This hits a real pain point. Most teams I’ve seen rely on sampling or offline evals, rarely anything that actually protects live interactions.

English

2

0

5

97

Ilan Kadar@ilan_kadar·5d

@dkare1009 100%. We’re trying to eliminate that painful debugging loop entirely

English

0

3

60