Prithivi Da

4

2.5K

Jürgen Schmidhuber@SchmidhuberAI·1d

@Arjunjain @goodfellow_ian Glad that you finally liked my reply :-) Now please convert the remaining disbelievers.

English

3

118

53.1K

Arjun Jain | Fast Code AI@Arjunjain·2d

No, PM Is Not a GAN. Stop! @SchmidhuberAI's Predictability Minimization (1992) and Ian Goodfellow's GANs (2014) both use adversarial objectives. So does every zero-sum game since von Neumann. That's where the similarity ends. Goodfellow's generator never sees real data. It maps noise to samples and learns the data distribution purely through the discriminator's gradients. That's the whole trick. That's the invention. Schmidhuber's PM does the opposite - both players sit on top of the same real data, competing to learn independent features. It's representation learning. Nothing is generated. No noise is mapped anywhere. No distribution is learned. Calling PM a GAN because both use minimax is like calling chess a war because both have strategy. PM was a smart idea about feature independence. GANs were a breakthrough in implicit generative modeling. These are not the same insight, and retroactively collapsing the distance between them doesn't honor prior work - it misrepresents both.

English

4

8

119

19.8K

Prithivi Da@prithivida·2d

How good is the surfer 2 agent with Hcompany models ? Anyone ?

English

83

Prithivi Da retweeté

Bo Wang@BoWang87·2d

Postdoc vs first-year PhD in the lab

English

4

83

23.3K

Prithivi Da@prithivida·2d

@jobergum :)

Prithivi Da@prithivida

Agents: a plausible projection, and 5 vectors to consider before adopting. A claim making the rounds is that Claude agents will end up in the graveyard, alongside the OpenAI Agents SDK and Google’s earlier efforts. That dismisses all the nuances. 1. The first distinction to keep straight is this: agent stack ≠ agent runtime ≠ agents themselves. Many people collapse these into one category, and that creates confusion. 2. No single agent stack will fit every need. Consumer grade agents need to behave like appliances with minimal config fuss. Enterprise grade agents need governance, controls, and integration depth. Hacker-grade systems such as OpenClaw optimize for flexibility and experimentation. Agent stacks are analogous to programming languages or model families: broad substrates, not universal answers. 3. For consumer-grade agents to pickup adoption, most of the below must be true, (in other words why hacker grade agents won’t magically become consumer grade ?) Separate agent builders from agent users. Over time, the market likely needs something closer to an agent store: a place where free and paid agents can be discovered, distributed, and monetized. Like an app store, that ecosystem would work best when paired with both an SDK and a runtime. 4. Agents, in practice, are bounded systems. Most high value agents cannot unbounded accumulation of skills. That’s where directions like holaboss as a workspace centric stack make sense. 5. From all that angle, Claude agents are an opinionated, MCP-heavy stack. OpenClaw is closer to a runtime philosophy that treats agents as extensible collections of skills. The distinction matters. MCP is powerful, but also polarizing. It can be verbose, token-expensive, and unpopular with some developers. At the same time, teams that have already invested in MCP endpoints may find the Claude ecosystem attractive, especially if the surrounding tool layer makes orchestration easier. So, Claude agents may not go to graveyard as most predict it to be, if anything they have revived MCP with managed agents. if you want an agent system with bespoke guardrails, recovery logic, safety controls, persona management, and trace-driven learning as first-class features, you will probably have to build it yourself.

QAM

224

Jo Kristian Bergum@jobergum·2d

It’s either just fancy scaffolding around the model or everyone should build their own harness, which way ai engineer?

English

11

0

15

4.1K

Prithivi Da@prithivida·3d

@Austen What usecases in particular you tried ?

English

53

Austen Allred@Austen·4d

Hermes Agent is the Linux of OpenClaw. It really is an almost perfect analogy. Use it for a couple hours and you’ll get exactly what that means.

Peter Yang@petergyang

Ok I’ll bite - wtf is Hermes agent? Is that like the luxury bag version of OpenClaw

English

20

4

234

29.8K

Prithivi Da@prithivida·3d

Agents: a plausible projection, and 5 vectors to consider before adopting. A claim making the rounds is that Claude agents will end up in the graveyard, alongside the OpenAI Agents SDK and Google’s earlier efforts. That dismisses all the nuances. 1. The first distinction to keep straight is this: agent stack ≠ agent runtime ≠ agents themselves. Many people collapse these into one category, and that creates confusion. 2. No single agent stack will fit every need. Consumer grade agents need to behave like appliances with minimal config fuss. Enterprise grade agents need governance, controls, and integration depth. Hacker-grade systems such as OpenClaw optimize for flexibility and experimentation. Agent stacks are analogous to programming languages or model families: broad substrates, not universal answers. 3. For consumer-grade agents to pickup adoption, most of the below must be true, (in other words why hacker grade agents won’t magically become consumer grade ?) Separate agent builders from agent users. Over time, the market likely needs something closer to an agent store: a place where free and paid agents can be discovered, distributed, and monetized. Like an app store, that ecosystem would work best when paired with both an SDK and a runtime. 4. Agents, in practice, are bounded systems. Most high value agents cannot unbounded accumulation of skills. That’s where directions like holaboss as a workspace centric stack make sense. 5. From all that angle, Claude agents are an opinionated, MCP-heavy stack. OpenClaw is closer to a runtime philosophy that treats agents as extensible collections of skills. The distinction matters. MCP is powerful, but also polarizing. It can be verbose, token-expensive, and unpopular with some developers. At the same time, teams that have already invested in MCP endpoints may find the Claude ecosystem attractive, especially if the surrounding tool layer makes orchestration easier. So, Claude agents may not go to graveyard as most predict it to be, if anything they have revived MCP with managed agents. if you want an agent system with bespoke guardrails, recovery logic, safety controls, persona management, and trace-driven learning as first-class features, you will probably have to build it yourself.

English

nice demo but i'm calling it now: this will end up dead like openai's agent builder x.com/claudeai/statu…

356

Prithivi Da@prithivida·5d

Agreed !

Elvis@elvissun

English

2

100

Prithivi Da retweeté

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·7 Nis

AI political compass, tag urself

English

204

68

941

105.9K

Prithivi Da retweeté

Carlos E. Perez@IntuitMachine·6d

On Claude Mythos 0:00 - We haven't trained it specifically to be good at cyber, we trained it to be good at code. But as a side effect of being good at code, it's also good at cyber. 0:20 - The model that we're experimenting with is by and large as good as a professional human at identifying bugs. 0:34 - this model is able to create exploits out of three, four, sometimes five vulnerabilities that in sequence give you some kind of very sophisticated end outcome. 0:51 - Obviously, capabilities in a model like this could do harm if in the wrong hands, and so we won't be releasing this model widely. 2:02 - I've found more bugs in the last couple of weeks than I've found in the rest of my life combined. 3:07 - We've spoken to officials across the US government and we've offered to work with them and collaborate to assess the risks of these models and to help defend against the risks of these models. 3:37 - It is essential that we come together and work together across industry to help build better defensive capabilities. No single organization sees the whole picture and can tackle this on their own.

English

43

151

1.3K

215.1K

Prithivi Da@prithivida·6d

@antoine_chaffin @bo_wangbo I guess it also heavily dependent the languages we want to bring together. For instance these guys trained one model per language. Not sure what other things they tried. arxiv.org/abs/2312.09508

English

37

Antoine Chaffin@antoine_chaffin·6d

@prithivida @bo_wangbo Definitely! Adding some multilingual data surely helps, but the performances of a multilingual backbone ColBERT model trained with English data are really good arxiv.org/abs/2209.01335

English

0

77

Bo@bo_wangbo·6d

Im honestly confused, is gte-modern-colbert even multilingual? isn't jina-colbert-v2 or colbert-xm a better target to compare with? liquid.ai/blog/lfm2-colb…

English

0

9

854

Prithivi Da@prithivida·6d

@antoine_chaffin @bo_wangbo So x-language generalisation in Colbert is viable and a good investment ?

English

0

33

Antoine Chaffin@antoine_chaffin·6d

@bo_wangbo It isn’t indeed That being said, LFM-ColBERT is trained on English only data, the multilingual performances comes from generalisation thanks to the multîlingual backbone Also FWIW we should release a multilingual ColBERT soon-ish

English

0

10

491

Prithivi Da@prithivida·3 Nis

@trychroma ❤️

QME

2

76

Chroma@trychroma·3 Nis

Chroma supports multiple lexical search strategies for keyword-style retrieval. FTS. BM25. SPLADE. Walks through how they differ, and when each one wins.

English

4

6

61

5.9K

Prithivi Da@prithivida·3 Nis

@NielsRogge @badlogicgames @opencode Wonder why so much bad rap for plan modes ? Is it because of longer plans ?

English

204

Niels Rogge@NielsRogge·3 Nis

Damn, really cool talk by @badlogicgames appeared on my YouTube feed! Lots of alpha regarding building agent harnesses, and why Anthropic cut off access to @opencode and the like

English

9

96

5.1K

Prithivi Da@prithivida·3 Nis

@antoine_chaffin Hahaha :) it does look like a NPC.

English

Raphaël Sourty@raphaelsrty

1

45

Antoine Chaffin@antoine_chaffin·3 Nis

I do not always have a NPC pose I swear, it was mostly because I was a bit stressed at first

I'm at @antoine_chaffin talk at ECIR 2026, presenting OSS done at @LightOnIO as soon as the talk is done I will run to the LI workshop with Antoine and @AmelieTabatta

English

7

1

35

2.1K

Prithivi Da@prithivida·3 Nis

@antoine_chaffin Hmm yes hardly mainstream in small to mini model community.

English

1

68

Antoine Chaffin@antoine_chaffin·3 Nis

It actually did a bit, less than it should imho (because this is probably one of my favourite piece of work), but I actually saw a lot of usage and people very happy about it! Unfortunately during the release back then went less viral than usual on Twitter which did not help, but it seems to still start to be widely used, especially the edge models!

English

Antoine Chaffin@antoine_chaffin

0

1

59

Antoine Chaffin@antoine_chaffin·2 Nis

I want to deeply thank everyone that attended It is absolutely insane how many people there was and also how cracked and friendly everyone was On a more personal note, I was so happy to see all of the work that were enabled by all of our efforts (ModernBERT, Ettin, PyLate), it is the whole reason we are doing this so I got emotional, this is the reason we are doing this so thank you so much

There is actually so many insane people attending the workshop I would not even dare starting the list It's going to be super cool!!

English

5

48

4K

Prithivi Da@prithivida·3 Nis

@andersonbcdefg @bryancsk Haha :) you naming them again would make them even more famous = You are famous too buddy !

English

1

27

Ben (no treats)@andersonbcdefg·3 Nis

@bryancsk multiple

Español

0

2

109

Ben (no treats)@andersonbcdefg·3 Nis

the weirdest part of getting old is that one day you wake up and all the people you know from college are suddenly famous

English

0

37

2K

Prithivi Da@prithivida·1 Nis

@lateinteraction @TheSeriousProg Correct, they are but part of the ecosystem and they have the users, so at some level we depend on them unless we band up together and create a new optimised DB for multi-vec support no ?

English

0

65

Omar Khattab@lateinteraction·1 Nis

@TheSeriousProg Ah many of these companies offer bad services because they don’t want to use the custom stack that multi-vector models need. They try to reuse their single-vector stack. Horrible decision and indeed it leads to extreme costs. But it’s just self-inflicted.

English

On Strengths and Limitations of Single-Vector Embeddings Microsoft shows that dimensionality alone cannot explain poor retrieval performance of single-vector embeddings, identifying domain shift and the "drowning in documents" paradox as key factors. 📝 arxiv.org/abs/2603.29519

0

73

Omar Khattab@lateinteraction·1 Nis

overwhelming evidence for late interaction / multi-vector models yet again :-) > even after finetuning, single-vector models lag far behind multi-vector embeddings, which achieve significant performance gains and exhibit greater robustness to catastrophic forgetting.

Sumit@_reachsumit

English

4

7

89

8.4K

Prithivi Da@prithivida·1 Nis

@antoine_chaffin Speaking of pains, What do you think of this ?

Prithivi Da@prithivida

x.com/i/article/2038…

English

On Strengths and Limitations of Single-Vector Embeddings Microsoft shows that dimensionality alone cannot explain poor retrieval performance of single-vector embeddings, identifying domain shift and the "drowning in documents" paradox as key factors. 📝 arxiv.org/abs/2603.29519

0

1

52

Antoine Chaffin@antoine_chaffin·1 Nis

It’s funny because it’s highlighting a few key points of our discussions lately It’s not *only* about the dimension and yes, it’s always very much possible to fine tune your model to become better at the task… But having to do it for every task is a pain, especially if your model forget the others every time

English

2

0

3

525

Antoine Chaffin@antoine_chaffin·1 Nis

Someday people will understand

Sumit@_reachsumit

English