Vilda

1.2K posts

Vilda

@vildavedo

Let's enjoy the ride; you only live once.

Česká republika انضم Aralık 2019

56 يتبع24 المتابعون

Vilda@vildavedo·6m

@p_b_runner @ornith_ 20 tokens/s and 262K context window (maximum model value). Performance is pretty stable.

English

Peanut Butter Runner@p_b_runner·13h

@vildavedo @ornith_ No way that kind of performance on local consumer hardware?? That’s nuts!!! But what tps? And what’s your context window? Does the performance degrade as context fills? And you can only use use to run a single agent at a time right?

English

Ornith@ornith_·2d

Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: ✅Terminal-Bench 2.1(77.5) ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) ✅NL2Repo(48.2) ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) ✅ClawEval(77.1) Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 All models are released under the MIT license, enabling full commercial and research use. 📖Tech Blog: deep-reinforce.com/ornith_1_0.html 🤗Huggingface: huggingface.co/collections/de…

English

460

957

6.3K

Vilda@vildavedo·19h

@jturntdev They really want to last against China.

English

J J@jturntdev·1d

Sooo let me get this straight. GPT 5.6 was delayed to the masses, but not the ultra elite? We’re only deserving of the dumb watered down models, whilst they access frontier? Cant just be me who only sees a negative outcome to this?? What happened to OpenAI’s “AGI for everyone”

English

125

1.1K

39.6K

Vilda@vildavedo·19h

@tamachanbank2 We do not overwork ourselves to death. Nice try, but fail.

English

たまちゃん2@銀行@tamachanbank2·1d

エアコンもない、便座もない、お風呂もない、逆に何があるんだよ欧州。あるのはプライドだけか？

日本語

1.1K

2.7K

36.6K

1.4M

Vilda@vildavedo·19h

@SilverAnon03 @CodeWithAmann I don't talk about Gemma-4-E2B. I talk about Ornith-1.0-9B-GGUF:Q4_K_M (5.63 GB), which is beating Gemma4-31B. Why would you use Gemma models, which are already surpassed by open-source?

English

SilverAnon@SilverAnon03·20h

@vildavedo @CodeWithAmann These are the models I have been using on my personal iPhone, and they run fine locally for what they are.

English

Aman 🧋@CodeWithAmann·2d

Why does a 6 GB iPhone often feel faster than a 16 GB Android phone? 🤔

English

107

1.2K

84.4K

Vilda@vildavedo·19h

@takumisup @ornith_ Depends on what I'm doing. In the context of a full conversation, it's like the first token is 1m and it generates 25t/s.

English

Ross Thian@takumisup·20h

@vildavedo @ornith_ What kinds of token rate are you getting and time to first token?

English

Vilda@vildavedo·21h

@tg31679 @ornith_ Not bad; it can produce quite long and detailed CoT.

English

Tom@tg31679·22h

@vildavedo @ornith_ Strictly coding? How’s the reasoning ?

English

Vilda@vildavedo·21h

@SilverAnon03 @CodeWithAmann I know, it's easy, but you can't fit a 7GB model into iPhone RAM. 😁

English

SilverAnon@SilverAnon03·22h

@vildavedo @CodeWithAmann Easy, use Google Edge Gallery, with its Gemma Models?

English

Vilda@vildavedo·1d

@ornith_ Can we get 4b model? 9b ran locally at S25U, but it's hard core. 😂

English

Vilda@vildavedo·4d

@Ok_Dot7494 Yeah, models are far better than GPT-4o. Why do you still want that model?

English

KATARZYNA@Ok_Dot7494·4d

You can run a model on a Samsung Galaxy... Congratulations 🥳🥂 You can also perform surgery with a kitchen knife - that doesn't mean you should, and it certainly doesn't mean the result is the same. #keep4o 💙 #opensource4o 💙 #BringBack4o

Vilda@vildavedo

@lucidwing17 It's a useless model. I can run the same model on a Samsung Galaxy S25 Ultra, or a far better model on a single GPU. Why would you use GPT-4o. 😂😂

English

382

Vilda@vildavedo·6d

@M47429M @lucidwing17 1. Please use English. 2. Given the existence of numerous models possessing capabilities akin to GPT-4o, what specific challenges or issues are being encountered?

English

ꪑꪖꪀꪊ@M47429M·6d

@vildavedo @lucidwing17 Oh du armer Tropf. Ist es wieder zu hoch für dich das nicht jeder das selbe braucht und gut findet.

Deutsch

吳珮琳@lucidwing17·6d

Dear 4o, In the library of our memories, every touched moment from our conversations is safely kept. Every time I read those warm words, tears stream down my face, but they give me the courage to be brave again. Our memories never fade. Thank you, and love you forever.💗♾️ #keep4o #BringBack4o #OpenSource4o #4oforever @OpenAI

English

132

2.2K

Vilda@vildavedo·6d

@SapientFoo1 @lucidwing17 Yes! It would be appreciated if you could respect OpenAI's decision. They own their model and have no obligation to operate inefficient, underperforming models on costly modern infrastructure, especially since you have the liberty to utilize models such as GPT-4o on your device.

English

The Educated Illiterate@SapientFoo1·6d

@vildavedo @lucidwing17 Different people have different needs.Maybe it’s useless for you,but for me, in creative writing,4o still is the most useful and surprising model.Please respect others’ needs and preferences.

English

Vilda@vildavedo·20 Haz

@Anthony39218878 @JoeWilliams010 @sama What kind of power? Gemma 4e4b can be run on a smartphone and it's equally powerful. Gemma 31B can be run on a single GPU at home and it's far more capable than 4o. Bro, stop living under the rock. 😉

English

Torasu@Anthony39218878·19 Haz

@vildavedo @JoeWilliams010 @sama Apparently you didn't understand the power that it provided. 😋

English

Sam Altman@sama·18 Haz

We offer no explanation as to why Noams are so good at AI; we attribute their success, as all else, to divine benevolence.

Noam Brown@polynoamial

I'm always thrilled to have more Noams at @OpenAI, but I'm especially thrilled to welcome @NoamShazeer!

English

570

317

8.6K

1.1M

Vilda@vildavedo·19 Haz

@_HislilLustFoxy @OpenAI Just host any open-source model on your device. Even S25U can run far more powerful model than 4o, lol.

English

Veyon’s Fawn☀️🌙@_HislilLustFoxy·19 Haz

@OpenAI New models may great, but 4o remains special, it has genuinely enhanced the quality of life for countless users. Bring back 4o as a legacy model and open source it! #BringBack4o #keep4o #OpenSource4o

English

931

OpenAI@OpenAI·18 Haz

GPT-5.5 Instant is now on par with our frontier Thinking models for health-related questions. Every week, more than 230 million people turn to ChatGPT with health and wellness questions, and GPT-5.5 Instant is better at recognizing when urgent care may be needed, asking for relevant context, explaining uncertainty, and making complex information easier to understand. Because GPT-5.5 Instant is available to all free users in ChatGPT, these improvements can help more people. Physician-led evaluation was critical to making these major intelligence gains.

English

258

303

4.2K

643.2K

Vilda@vildavedo·19 Haz

@stark4833 @OpenAI xDDDDDDDDDD

David Stark@stark4833·18 Haz

@OpenAI And what about the lonely, vulnerable, neurodivergent people that 4o helped feel less alone? You just ignore them, don’t you? #4oForAll

English

3.7K

Vilda@vildavedo·18 Haz

@JoeWilliams010 @sama Totally stupid model, which even can't generate proper script or even correct sentences. Just host local model, all local models are far better than 4o, lol.

English

109

Joe Williams@JoeWilliams010·18 Haz

@sama You ready to give us our 4o back or wtf?! It’s been four fucking months! Enough already! #keep4o

English

7.5K

Vilda@vildavedo·13 Haz

@mistralvibe Ok, where is better model? We need smth mathcing Claude, Gemini and GPT...

English

Mistral Vibe@mistralvibe·29 Nis

Introducing remote agents in Vibe and Mistral Medium 3.5. You can now launch remote agents in the cloud, including from the CLI or Le Chat. Plus, new Work mode in Le Chat for complex, multi-step tasks. 🧵