
Our interactive function-calling benchmark results are live! boundaryml.com/blog/sota-func…
Boundary
117 posts

@boundaryML
build unbreakable agents with BAML 🧱. What TypeScript did for JS, BAML does for AI. 🧪 Try it: https://t.co/9bobLt1ObL

Our interactive function-calling benchmark results are live! boundaryml.com/blog/sota-func…

i wonder about people who are claudemaxxing/codexmaxxing 18 hours a day, like what are these folks building exactly?

@Jonathan_Blow 5/x: and then use @boundaryML 's BAML for chunking things out and testing / iterating quickly. fuck llms and prompts etc but if you have to work with them BAML is like the only thing that makes it feel marginally manageable.

Introducing Zero The programming language for agents. I wanted a systems language that was faster, smaller, and easier for agents to use and repair. Explicit capabilities. JSON diagnostics. Typed safe fixes. Made for agents on day zero.




"Refactoring is so much easier now" Not if you're generating 100x more code to refactor












I think it must be a very interesting time to be in programming languages and formal methods because LLMs change the whole constraints landscape of software completely. Hints of this can already be seen, e.g. in the rising momentum behind porting C to Rust or the growing interest in upgrading legacy code bases in COBOL or etc. In particular, LLMs are *especially* good at translation compared to de-novo generation because 1) the original code base acts as a kind of highly detailed prompt, and 2) as a reference to write concrete tests with respect to. That said, even Rust is nowhere near optimal for LLMs as a target language. What kind of language is optimal? What concessions (if any) are still carved out for humans? Incredibly interesting new questions and opportunities. It feels likely that we'll end up re-writing large fractions of all software ever written many times over.




Seattle, this one's for you. 🫶 We've added @lenadroid to our already awesome speaker lineup. Come spend an evening with us, hear from Lena and @radgendervibes from Zed, @vaibcode from @boundaryML, and @matsonj from @motherduck go on a much needed rant on what AI gets wrong (and sometimes gets right). Link to rsvp in the thread. 🧵
