Vaibhav Gupta

753 posts

Vaibhav Gupta

Vaibhav Gupta

@vaibcode

Making a programming language - BAML. @boundaryml, 🦄 aitw podcast: https://t.co/g4CJWgOsrj prev YC, google, msft, deshaw, and other things

Seattle, WA Bergabung Eylül 2012
440 Mengikuti1.6K Pengikut
Tweet Disematkan
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
I don’t enjoy the syntax and tooling we use to building AI pipelines. Langchain/ai sdk/et all have never really felt “right”. No type safety, single language support, prompt is hidden until I buy some observability suite. So we made a thing youtu.be/wD3zieaV0Yc?si…
YouTube video
YouTube
English
3
1
34
5.1K
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
@pranavcmadhukar Yep. Equivalent of saying “haha you’re dumb cause you can’t read hieroglyphics” I may still be dumb, but it’s not because of hieroglyphics abilities 😂
English
0
0
1
41
Pranav
Pranav@pranavcmadhukar·
@vaibcode this paper is a great filter for those who understand what llms do well and can't do. i suppose it was somewhat helpful to illustrate the extent of generalization LLMs are capable of but it was a really high ceiling to test
English
1
0
2
53
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
This is so dumb. No shit LLMs can write brainf*ck or a language that uses whitespace as syntax. There is virtually no training data that says whitespace is semantic. Why would someone expect this? This is completely different than tranfer learning python -> java. It would’ve been surprising if LM said this, and that would’ve been a very interesting research paper, but you proved nothing here
Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English
2
0
5
532
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
new ci check for slop incomming →
Vaibhav Gupta tweet media
English
0
0
2
186
Alonso Silva
Alonso Silva@alonsosilva·
@vaibcode So where is BAML going? OpenAI or Anthropic? 😉
English
1
0
1
37
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
seems like the models can't build everything and sometimes you just need damn good engineering. uv and bun are clear standouts here. both are incredibly amazing engineering teams that produced beautiful systems.
Charlie Marsh@charliermarsh

We've entered into an agreement to join OpenAI as part of the Codex team. I'm incredibly proud of the work we've done so far, incredibly grateful to everyone that's supported us, and incredibly excited to keep building tools that make programming feel different.

English
1
0
4
457
geoff
geoff@GeoffreyHuntley·
this is my favourite prompt of all time: “how could this be better?” reply with yours and why it rocks!
English
76
16
121
14.2K
Vaibhav Gupta me-retweet
Anish Palakurthi
Anish Palakurthi@anishpalakurT·
Looking for designers who have Blender experience! Will pay ~$500 for a single asset
English
49
2
61
3K
Boundary
Boundary@boundaryML·
Whats your favorite language feature and why is it match?
English
5
0
1
231
geoff
geoff@GeoffreyHuntley·
tbh. i’m starting to fall in love with ocaml again
English
12
2
34
5.1K
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
@AndersonAndrue @aaronvi we have a bunch of other docs around correctness (typesafe exceptions, match, type-systems etc). this is purely about various syntactical forms when writing code.
English
0
0
0
32
Storm
Storm@AndersonAndrue·
@vaibcode @aaronvi I don't see code correctness listed in the concerns?
English
1
0
0
29
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
syntax makes a huge difference to how good coding agents are, and languages should be rethought. some great learnings from rust and go! great post by @aaronvi
Vaibhav Gupta tweet media
English
2
1
10
1.1K
Vaibhav Gupta
Vaibhav Gupta@vaibcode·
@zeeg the most useful things the best engineers do is delete code and remove complexity. folks on the everything app just dont get this yet. 😂
English
1
0
17
777
David Cramer
David Cramer@zeeg·
so many people on this everything app trying to tell me what im doing wrong with LLMs as if I dont ship 100x more code than them
English
24
5
306
15.3K