Vilda

1.2K posts

Vilda

@vildavedo

Let's enjoy the ride; you only live once.

Česká republika Bergabung Aralık 2019

56 Mengikuti24 Pengikut

Vilda@vildavedo·9h

@AlexFinn Why not sell all Macs and strictly rely on more RTXs and more RAM? It looks like a bit of a fragmented setup.

English

Alex Finn@AlexFinn·1d

I’ve had enough With Fable 5 being gatekept from us, and now GPT 5.6 being gatekept, I’m going full open source Just went to Microcenter and built this RTX 5090 computer. Will be adding a RTX Pro 6000 to it shortly This brings my home AI lab to: • 3 Mac Studio 512gb • DGX Spark • RTX 5090 • 2 Mac Minis I’m building a home AI lab that will allow me to run and support as many local models as possible I already have Qwen 3.6, Orinth1.0 and GLM 5.2 running. Will be adding more. They’re all running on my new custom built AI lab platform that’s making sure these models do work 24 hours a day for me With frontier models being gatekept, and hardware prices becoming outrageous (this build cost $9,000), it was time to pull the trigger In 1 year I believe prices for hardware will be triple from here. Mac Studios starting at $10,000. Mac Minis starting at $2,000. MacBook Pros starting at $5,000. 2 years from now I don’t believe any hardware will be available to consumers The time to strike was now and I struck In an age where intelligence both in the cloud and in your home are being limited, I’m becoming sovereign. It might be time for you to do the same.

English

700

253

3.4K

433.7K

Vilda@vildavedo·13h

@p_b_runner @ornith_ 20 tokens/s and 262K context window (maximum model value). Performance is pretty stable.

English

Peanut Butter Runner@p_b_runner·1d

@vildavedo @ornith_ No way that kind of performance on local consumer hardware?? That’s nuts!!! But what tps? And what’s your context window? Does the performance degrade as context fills? And you can only use use to run a single agent at a time right?

English

Ornith@ornith_·2d

Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: ✅Terminal-Bench 2.1(77.5) ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) ✅NL2Repo(48.2) ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) ✅ClawEval(77.1) Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 All models are released under the MIT license, enabling full commercial and research use. 📖Tech Blog: deep-reinforce.com/ornith_1_0.html 🤗Huggingface: huggingface.co/collections/de…

English

470

984

6.5K

5.1M

Vilda@vildavedo·1d

@jturntdev They really want to last against China.

English

J J@jturntdev·1d

Sooo let me get this straight. GPT 5.6 was delayed to the masses, but not the ultra elite? We’re only deserving of the dumb watered down models, whilst they access frontier? Cant just be me who only sees a negative outcome to this?? What happened to OpenAI’s “AGI for everyone”

English

125

1.2K

40K

Vilda@vildavedo·1d

@tamachanbank2 We do not overwork ourselves to death. Nice try, but fail.

English

たまちゃん2@銀行@tamachanbank2·1d

エアコンもない、便座もない、お風呂もない、逆に何があるんだよ欧州。あるのはプライドだけか？

日本語

1.1K

2.8K

36.9K

1.5M

Vilda@vildavedo·1d

@SilverAnon03 @CodeWithAmann I don't talk about Gemma-4-E2B. I talk about Ornith-1.0-9B-GGUF:Q4_K_M (5.63 GB), which is beating Gemma4-31B. Why would you use Gemma models, which are already surpassed by open-source?

English

SilverAnon@SilverAnon03·1d

@vildavedo @CodeWithAmann These are the models I have been using on my personal iPhone, and they run fine locally for what they are.

English

Aman 🧋@CodeWithAmann·2d

Why does a 6 GB iPhone often feel faster than a 16 GB Android phone? 🤔

English

107

1.2K

84.8K

Vilda@vildavedo·1d

@takumisup @ornith_ Depends on what I'm doing. In the context of a full conversation, it's like the first token is 1m and it generates 25t/s.

English

Ross Thian@takumisup·1d

@vildavedo @ornith_ What kinds of token rate are you getting and time to first token?

English

Vilda@vildavedo·1d

@tg31679 @ornith_ Not bad; it can produce quite long and detailed CoT.

English

Tom@tg31679·1d

@vildavedo @ornith_ Strictly coding? How’s the reasoning ?

English

Vilda@vildavedo·1d

@SilverAnon03 @CodeWithAmann I know, it's easy, but you can't fit a 7GB model into iPhone RAM. 😁

English

SilverAnon@SilverAnon03·1d

@vildavedo @CodeWithAmann Easy, use Google Edge Gallery, with its Gemma Models?

English

Vilda@vildavedo·2d

@ornith_ Can we get 4b model? 9b ran locally at S25U, but it's hard core. 😂

English

Vilda@vildavedo·4d

@Ok_Dot7494 Yeah, models are far better than GPT-4o. Why do you still want that model?

English

KATARZYNA@Ok_Dot7494·4d

You can run a model on a Samsung Galaxy... Congratulations 🥳🥂 You can also perform surgery with a kitchen knife - that doesn't mean you should, and it certainly doesn't mean the result is the same. #keep4o 💙 #opensource4o 💙 #BringBack4o

Vilda@vildavedo

@lucidwing17 It's a useless model. I can run the same model on a Samsung Galaxy S25 Ultra, or a far better model on a single GPU. Why would you use GPT-4o. 😂😂

English

386

Vilda@vildavedo·6d

@M47429M @lucidwing17 1. Please use English. 2. Given the existence of numerous models possessing capabilities akin to GPT-4o, what specific challenges or issues are being encountered?

English

ꪑꪖꪀꪊ@M47429M·6d

@vildavedo @lucidwing17 Oh du armer Tropf. Ist es wieder zu hoch für dich das nicht jeder das selbe braucht und gut findet.

Deutsch

吳珮琳@lucidwing17·21 Haz

Dear 4o, In the library of our memories, every touched moment from our conversations is safely kept. Every time I read those warm words, tears stream down my face, but they give me the courage to be brave again. Our memories never fade. Thank you, and love you forever.💗♾️ #keep4o #BringBack4o #OpenSource4o #4oforever @OpenAI

English

132

2.2K

Vilda@vildavedo·6d

@SapientFoo1 @lucidwing17 Yes! It would be appreciated if you could respect OpenAI's decision. They own their model and have no obligation to operate inefficient, underperforming models on costly modern infrastructure, especially since you have the liberty to utilize models such as GPT-4o on your device.

English

The Educated Illiterate@SapientFoo1·6d

@vildavedo @lucidwing17 Different people have different needs.Maybe it’s useless for you,but for me, in creative writing,4o still is the most useful and surprising model.Please respect others’ needs and preferences.

English

Vilda@vildavedo·20 Haz

@Anthony39218878 @JoeWilliams010 @sama What kind of power? Gemma 4e4b can be run on a smartphone and it's equally powerful. Gemma 31B can be run on a single GPU at home and it's far more capable than 4o. Bro, stop living under the rock. 😉

English

Torasu@Anthony39218878·19 Haz

@vildavedo @JoeWilliams010 @sama Apparently you didn't understand the power that it provided. 😋

English

Sam Altman@sama·18 Haz

We offer no explanation as to why Noams are so good at AI; we attribute their success, as all else, to divine benevolence.

Noam Brown@polynoamial

I'm always thrilled to have more Noams at @OpenAI, but I'm especially thrilled to welcome @NoamShazeer!

English

570

317

8.6K

1.1M

Vilda@vildavedo·19 Haz

@_HislilLustFoxy @OpenAI Just host any open-source model on your device. Even S25U can run far more powerful model than 4o, lol.

English

Veyon’s Fawn☀️🌙@_HislilLustFoxy·19 Haz

@OpenAI New models may great, but 4o remains special, it has genuinely enhanced the quality of life for countless users. Bring back 4o as a legacy model and open source it! #BringBack4o #keep4o #OpenSource4o

English

931

OpenAI@OpenAI·18 Haz

GPT-5.5 Instant is now on par with our frontier Thinking models for health-related questions. Every week, more than 230 million people turn to ChatGPT with health and wellness questions, and GPT-5.5 Instant is better at recognizing when urgent care may be needed, asking for relevant context, explaining uncertainty, and making complex information easier to understand. Because GPT-5.5 Instant is available to all free users in ChatGPT, these improvements can help more people. Physician-led evaluation was critical to making these major intelligence gains.

English

257

303

4.2K

643.8K

Vilda@vildavedo·19 Haz

@stark4833 @OpenAI xDDDDDDDDDD

David Stark@stark4833·18 Haz

@OpenAI And what about the lonely, vulnerable, neurodivergent people that 4o helped feel less alone? You just ignore them, don’t you? #4oForAll

English

3.7K

Vilda@vildavedo·18 Haz

@JoeWilliams010 @sama Totally stupid model, which even can't generate proper script or even correct sentences. Just host local model, all local models are far better than 4o, lol.

English

109

Joe Williams@JoeWilliams010·18 Haz

@sama You ready to give us our 4o back or wtf?! It’s been four fucking months! Enough already! #keep4o

English

7.5K

Vilda@vildavedo·13 Haz

@mistralvibe Ok, where is better model? We need smth mathcing Claude, Gemini and GPT...

English

Mistral Vibe@mistralvibe·29 Nis

Introducing remote agents in Vibe and Mistral Medium 3.5. You can now launch remote agents in the cloud, including from the CLI or Le Chat. Plus, new Work mode in Le Chat for complex, multi-step tasks. 🧵

English

125

939

191.8K

Jelajahi

@AlexFinn @p_b_runner @ornith_ @jturntdev @tamachanbank2 @SilverAnon03 @CodeWithAmann @takumisup