Christian Adib

220 posts

Christian Adib banner
Christian Adib

Christian Adib

@AdibChristian

@MIT engineer building Layer10

Cambridge, MA เข้าร่วม Şubat 2018
4.8K กำลังติดตาม12.9K ผู้ติดตาม
Viv
Viv@Vtrivedy10·
own your agent harness 🤝 own your intelligence 🤝 own your evals 🤝 open ecosystems this doesn’t mean start from scratch, build on an open base harness where you can edit/extend existing primitives like bash, fs-ops, permissions, skills support, etc for your task harnesses today (for better or worse) deeply affect agent performance owning this layer makes sure your agent is optimized to the very last detail for tasks you and your customers care about instead of praying that labs or closed harnesses this for you and align with your goals the world’s builders should be able to openly observe and control their intelligence stack reach out if we can help 🚀
Jo Kristian Bergum@jobergum

Don't outsource your agent harness

English
4
5
52
6.6K
Christian Adib
Christian Adib@AdibChristian·
@hwchase17 Doesn’t post-training partially specialize models to certain harnesses? e.g., Opus may perform better in Claude Agent SDK than in deepagents, OpenCode, or pi
English
0
0
0
506
Christian Adib
Christian Adib@AdibChristian·
@sandhya Skills that are honed via hill-climbing using a real life eval set are a particularly interesting idea. Somewhat of a DSPy approach maybe. I’ve heard the term “Skill Harvesting” being thrown around. Definitely a moat if done at scale and maintained imo
English
0
0
1
41
Sandhya
Sandhya@sandhya·
@AdibChristian Agree and as I said in the post, vertical models still only make sense for some problems, not all. For others it might be a custom vertical database that offers better performance. Both are opportunities for a moat. Having neither makes a startup less technically defensible.
English
1
0
1
479
Christian Adib
Christian Adib@AdibChristian·
ألا سيف من الإيمان يبرى السيف مسنونا يجلى عن سما الأوطان هذا الذل والهونا يقود إلى جنون المجد أبطالاً مجانينا بقلب يحمل الآمال والآلام والدينا يهز القوم بالذكرى وقد ينسى الفتى حينا إذا أعطيت وعد الحر كان الوعد مأمونا
العربية
1
0
13
868
Steve Hanke
Steve Hanke@steve_hanke·
In the last week alone, the Pentagon estimates that the US spent $11 BILLION on the Iran war. WARNING: Pentagon estimates are always fiction and way below the real cost. WAR = COSTS AMERICANS AN ARM AND A LEG.
Steve Hanke tweet media
English
13
66
171
17K
Joseph Raymond
Joseph Raymond@iam_balviin·
@AdibChristian Hi there, Christian! I'm interested in the role and would like to make an application, but I can't DM you.
English
1
0
0
50
Christian Adib
Christian Adib@AdibChristian·
Layer10 is hiring an Executive Assistant based in Lebanon. Role includes managing calendars, handling general EA responsibilities, and potentially supporting some marketing work. If you’re organized, proactive, and interested in working with a fast moving team, reach out! layer10.ai
English
1
2
9
1.5K
Christian Adib
Christian Adib@AdibChristian·
@bassamkaram المكتب الإعلامي لجوي تاسيديس يصدر بيان استنكار عنوانه: "تعبنا"
العربية
1
0
5
1.4K
Fida 🦊
Fida 🦊@FidaKfida·
@AdibChristian We have some of the best talent. Are you going to let SF hog all of them?
English
1
0
0
69
Christian Adib
Christian Adib@AdibChristian·
@FidaKfida I got my car stolen in Montréal a few years ago and I’ve been holding a grudge
English
1
0
1
63
Christian Adib
Christian Adib@AdibChristian·
Anyone know any cracked IIT engineers that want to join an exciting AI startup working on automating processes with a lot of edge cases? Please introduce me!
English
1
0
4
2K
Christian Adib
Christian Adib@AdibChristian·
@curious_vii Greek Orthodox iconography? Gotta come see this stuff in Beirut
English
1
0
4
507
christian
christian@curious_vii·
Mood
christian tweet mediachristian tweet mediachristian tweet mediachristian tweet media
English
1
0
2
1.2K
christian
christian@curious_vii·
officially switched to codex 5.2 xhigh etc as my primary driver claude code has nice quality of life advantages: - running background processes, e.g., dev servers - plan mode (although I'm not sure it matters as much if the model is just... better?) - a few others I can't remember atm... but — and IDK how many times we need to re-learn this — but the model is (at least most of) the product, and this is another example...
christian@curious_vii

thinking about switching to codex as my orchestrator... claude keeps giving feedback that codex's investigations and changes are more robust now: clearly, you get returns to scope from multi-agent (and multi-model, more specifically), when you run a sort of super–Ralph loop w/ tons of semi-automated testing, whereby escalation to human review (or, even auto-merge—I'm sure we'll get there soon...) is gated by multiple models reviewing each other's plans, work product, and so on; BUT, it's not in the Labs' respective interests to make this easy (I don't think?), so either there'll always be a meaningful gap between power users who are setting up multi-model systems and enjoying the fruits of higher-quality, more reliable automation and more casual users relying on one model at a time OR there's an oppportunity to develop a "product" that generalizes now and in the future as the models' latent space footprints expand (and differentiate?) obviously @droid et al (e.g., @AmpCode and @DevinAI to a lesser extent, I think) are working to do something like this, where the model choices are abstracted away from the user by default, but, for some reason, I'm skeptical that they can all keep pace, given the fundamental architectural changes that might end up conflicting with the design choices they've made (and need to support thus far); see: Cursor as a prime example — although, maybe what would otherwise be premature optimization doesn't matter if you get enough distribution... fascinating space, and, in the end, I think the individual user/maker (not 'consumer!') captures most of the economic surplus!

English
2
0
3
740