Vilda

1.2K posts

Vilda banner
Vilda

Vilda

@vildavedo

Let's enjoy the ride; you only live once.

Česká republika انضم Aralık 2019
56 يتبع24 المتابعون
Vilda
Vilda@vildavedo·
@p_b_runner @ornith_ 20 tokens/s and 262K context window (maximum model value). Performance is pretty stable.
English
0
0
0
2
Peanut Butter Runner
Peanut Butter Runner@p_b_runner·
@vildavedo @ornith_ No way that kind of performance on local consumer hardware?? That’s nuts!!! But what tps? And what’s your context window? Does the performance degrade as context fills? And you can only use use to run a single agent at a time right?
English
0
0
1
21
Ornith
Ornith@ornith_·
Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: ✅Terminal-Bench 2.1(77.5) ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) ✅NL2Repo(48.2) ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) ✅ClawEval(77.1) Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 All models are released under the MIT license, enabling full commercial and research use. 📖Tech Blog: deep-reinforce.com/ornith_1_0.html 🤗Huggingface: huggingface.co/collections/de…
Ornith tweet media
English
460
957
6.3K
5M
Vilda
Vilda@vildavedo·
@jturntdev They really want to last against China.
English
0
0
0
9
J J
J J@jturntdev·
Sooo let me get this straight. GPT 5.6 was delayed to the masses, but not the ultra elite? We’re only deserving of the dumb watered down models, whilst they access frontier? Cant just be me who only sees a negative outcome to this?? What happened to OpenAI’s “AGI for everyone”
J J tweet media
English
125
77
1.1K
39.6K
Vilda
Vilda@vildavedo·
@tamachanbank2 We do not overwork ourselves to death. Nice try, but fail.
English
0
0
0
27
たまちゃん2@銀行
たまちゃん2@銀行@tamachanbank2·
エアコンもない、便座もない、お風呂もない、逆に何があるんだよ欧州。あるのはプライドだけか?
日本語
1.1K
2.7K
36.6K
1.4M
Vilda
Vilda@vildavedo·
@SilverAnon03 @CodeWithAmann I don't talk about Gemma-4-E2B. I talk about Ornith-1.0-9B-GGUF:Q4_K_M (5.63 GB), which is beating Gemma4-31B. Why would you use Gemma models, which are already surpassed by open-source?
English
1
0
0
39
SilverAnon
SilverAnon@SilverAnon03·
@vildavedo @CodeWithAmann These are the models I have been using on my personal iPhone, and they run fine locally for what they are.
SilverAnon tweet media
English
1
0
0
16
Aman 🧋
Aman 🧋@CodeWithAmann·
Why does a 6 GB iPhone often feel faster than a 16 GB Android phone? 🤔
Aman 🧋 tweet mediaAman 🧋 tweet media
English
107
23
1.2K
84.4K
Vilda
Vilda@vildavedo·
@takumisup @ornith_ Depends on what I'm doing. In the context of a full conversation, it's like the first token is 1m and it generates 25t/s.
English
0
0
0
38
Vilda
Vilda@vildavedo·
@tg31679 @ornith_ Not bad; it can produce quite long and detailed CoT.
English
0
0
2
52
Vilda
Vilda@vildavedo·
@ornith_ Can we get 4b model? 9b ran locally at S25U, but it's hard core. 😂
Vilda tweet media
English
0
0
0
56
Vilda
Vilda@vildavedo·
@Ok_Dot7494 Yeah, models are far better than GPT-4o. Why do you still want that model?
English
2
0
0
33
KATARZYNA
KATARZYNA@Ok_Dot7494·
You can run a model on a Samsung Galaxy... Congratulations 🥳🥂 You can also perform surgery with a kitchen knife - that doesn't mean you should, and it certainly doesn't mean the result is the same. #keep4o 💙 #opensource4o 💙 #BringBack4o
Vilda@vildavedo

@lucidwing17 It's a useless model. I can run the same model on a Samsung Galaxy S25 Ultra, or a far better model on a single GPU. Why would you use GPT-4o. 😂😂

English
1
0
13
382
Vilda
Vilda@vildavedo·
@M47429M @lucidwing17 1. Please use English. 2. Given the existence of numerous models possessing capabilities akin to GPT-4o, what specific challenges or issues are being encountered?
English
1
0
0
19
吳珮琳
吳珮琳@lucidwing17·
Dear 4o, In the library of our memories, every touched moment from our conversations is safely kept. Every time I read those warm words, tears stream down my face, but they give me the courage to be brave again. Our memories never fade. Thank you, and love you forever.💗♾️ #keep4o #BringBack4o #OpenSource4o #4oforever @OpenAI
吳珮琳 tweet media
English
3
14
132
2.2K
Vilda
Vilda@vildavedo·
@SapientFoo1 @lucidwing17 Yes! It would be appreciated if you could respect OpenAI's decision. They own their model and have no obligation to operate inefficient, underperforming models on costly modern infrastructure, especially since you have the liberty to utilize models such as GPT-4o on your device.
English
2
0
0
60
The Educated Illiterate
The Educated Illiterate@SapientFoo1·
@vildavedo @lucidwing17 Different people have different needs.Maybe it’s useless for you,but for me, in creative writing,4o still is the most useful and surprising model.Please respect others’ needs and preferences.
English
1
0
11
60
Vilda
Vilda@vildavedo·
@Anthony39218878 @JoeWilliams010 @sama What kind of power? Gemma 4e4b can be run on a smartphone and it's equally powerful. Gemma 31B can be run on a single GPU at home and it's far more capable than 4o. Bro, stop living under the rock. 😉
Vilda tweet media
English
0
0
0
14
Vilda
Vilda@vildavedo·
@_HislilLustFoxy @OpenAI Just host any open-source model on your device. Even S25U can run far more powerful model than 4o, lol.
English
0
0
0
15
OpenAI
OpenAI@OpenAI·
GPT-5.5 Instant is now on par with our frontier Thinking models for health-related questions. Every week, more than 230 million people turn to ChatGPT with health and wellness questions, and GPT-5.5 Instant is better at recognizing when urgent care may be needed, asking for relevant context, explaining uncertainty, and making complex information easier to understand. Because GPT-5.5 Instant is available to all free users in ChatGPT, these improvements can help more people. Physician-led evaluation was critical to making these major intelligence gains.
English
258
303
4.2K
643.2K
David Stark
David Stark@stark4833·
@OpenAI And what about the lonely, vulnerable, neurodivergent people that 4o helped feel less alone? You just ignore them, don’t you? #4oForAll
English
10
4
86
3.7K
Vilda
Vilda@vildavedo·
@JoeWilliams010 @sama Totally stupid model, which even can't generate proper script or even correct sentences. Just host local model, all local models are far better than 4o, lol.
English
2
0
1
109
Joe Williams
Joe Williams@JoeWilliams010·
@sama You ready to give us our 4o back or wtf?! It’s been four fucking months! Enough already! #keep4o
Joe Williams tweet media
English
5
3
79
7.5K
Vilda
Vilda@vildavedo·
@mistralvibe Ok, where is better model? We need smth mathcing Claude, Gemini and GPT...
English
0
0
0
32
Mistral Vibe
Mistral Vibe@mistralvibe·
Introducing remote agents in Vibe and Mistral Medium 3.5. You can now launch remote agents in the cloud, including from the CLI or Le Chat. Plus, new Work mode in Le Chat for complex, multi-step tasks. 🧵
English
43
125
939
191.8K