tCosta

34.3K posts

tCosta banner
tCosta

tCosta

@uglesado

Para karine, o mundo.

Katılım Nisan 2011
96 Takip Edilen216 Takipçiler
tCosta retweetledi
Wanderley Majeski
Wanderley Majeski@WanderleyMajes1·
🤣🤣🤣🤣
Wanderley Majeski tweet media
QME
98
5K
32.2K
242.2K
tCosta
tCosta@uglesado·
@wesleysimplic @kazzkiq Perdão pelo julgamento adiantado. Tinha entendido que era um modelo pós treinado. Aparentemente é um cli que normaliza a saída de prompt, gostei da inciativa amigo! Consigo ver utilidade no Nemotron nano que é um cocô mas possui 1M de contexto. Normalizando a saída pode ser bom!!
Português
1
0
0
73
Wesley Simplicio
Wesley Simplicio@wesleysimplic·
@uglesado @kazzkiq Testa ai e vc me dá o feedback com reports man.. bora clonar, testar e ver com seus olhos lindos que Deus te deu..
Português
2
0
0
86
Claudio
Claudio@kazzkiq·
Se fizessem um chip com um modelo embarcado nível Opus 4.6, rodando a 2000 tok/s, mas com custando R$30k, você compraria ou continuaria pagando o Claude?
Claudio tweet media
Português
17
0
18
4.8K
tCosta
tCosta@uglesado·
@kazzkiq Falando da placa taalas? Se fosse algo como 700 reais por um qwen3.6 27b definitivamente. Mais que isso n compensa pelo custo de api, modelos mudam a cada 2 meses. Em um ano estaria absurdamente defasado.
Português
0
0
1
207
tCosta
tCosta@uglesado·
@wesleysimplic @kazzkiq Comparativos com o qwen2.5? Ouso dizer benchmaxxing ou overfitting. Com todo respeito amigo...
Português
1
0
0
93
Wesley Simplicio
Wesley Simplicio@wesleysimplic·
@kazzkiq Homi, testa ai, faz o clone, tive uma pifania ontem, veio do ceú, olha os novos testes
Wesley Simplicio tweet media
Português
1
0
2
401
tCosta
tCosta@uglesado·
@elielAGI @0xSero Wouldnt honestly be a bad idea if your work rely heavily on coding.
English
1
0
0
26
Eliel
Eliel@elielAGI·
@uglesado @0xSero yeah the cache hits are awesome. tbh could just fully swith to v4 flash and pro in my hermes
English
1
0
1
39
0xSero
0xSero@0xSero·
Deepseek-v4-Flash beats Sonnet, and Opus-4.5 (no thinking) and basically matches GPT-5.2 medium Tomorrow I will have a compression of Flash that'll make it fit well on 1x Spark at hopefully better quality than alternatives Join tomorrow: luma.com/reap
0xSero tweet media
English
37
29
644
38.7K
tCosta
tCosta@uglesado·
@elielAGI @0xSero Ngl, its absurd. I used qwen 3.5 9b as a worker model following scripts and skills made from deepseek v4 pro in hermes. 10usd lasted me a whole month with light usage.
English
1
0
1
80
Eliel
Eliel@elielAGI·
@0xSero flash>pro? but for the price deepseek might be the most worth it deal in all of ai rn
English
1
0
1
496
tCosta
tCosta@uglesado·
@LottoLabs Its an excellent worker agent, not so much smart to do things from scratch. My go-to is make deepseek create scripts, skills, and just make 9b follow it. Made 10usd of deepseek last a whole month this way.
English
0
0
1
187
Lotto
Lotto@LottoLabs·
Qwen 9b is a Swiss Army knife Very good skill following, runs on anything, low resources to finetune, multimodal, fast A perfect testing ground for the big brother 27b Can drop down to 4b for edge deployments
English
12
5
108
7.4K
tCosta
tCosta@uglesado·
@LottoLabs Still they closened that gap with 27b! Good news at least!
English
0
0
1
102
Lotto
Lotto@LottoLabs·
@uglesado Definitely will get better results in llama.cpp than LMstudio
English
1
0
2
399
tCosta
tCosta@uglesado·
@codecovenant @lmstudiodevs My point being. I am using lmstudio for over a year now. The first time i complained of it over llama.cpp they released it lol. Had good result with qwen3.6 27b, on par with llamacpp. But 35b, not so much.
English
0
0
1
47
Code and Covenant🇨🇦
Code and Covenant🇨🇦@codecovenant·
@uglesado @lmstudiodevs There have been a few of these pivots lately like dflash. Gotta test what even works best. Lmstudio is following llama in boosting mtp. I was waiting to see what they choose because it let's me avoid doing the homework. Easy to use their expertise .
English
1
0
1
70
LM Studio Developers
LM Studio Developers@lmstudiodevs·
LM Studio with Multi Token Prediction (MTP) is now in beta. 1. Update to 0.4.14+3 in-app 2. Make sure your llama.cpp engine is 2.15.0 3. Turn on MTP when loading a model Use a model that supports it, like Qwen3.6-35B-A3B-MTP-GGUF or Qwen3.6-27B-MTP-GGUF
LM Studio Developers tweet media
English
40
80
650
44.4K
tCosta
tCosta@uglesado·
@brammy999 @siegnant @teortaxesTex Basically the same, the adaptations were matured long ago. When earlier adaptations in earlier 2000s, they lasted tiny bit less than mono fuel. Thats not an issue anymore.
English
0
0
9
183
tCosta
tCosta@uglesado·
@sumoto_iitoko Muito comum brasileiros torcerem para seleções estrangeiras, exceto argentina, portugal e alemanha claro...
Português
0
0
0
29
tCosta
tCosta@uglesado·
@NousResearch @Teknium Embedded calendar skill? So the agent remind the user with to-dos, reminders and personalised notes?
English
0
0
1
77
Nous Research
Nous Research@NousResearch·
Hermes Agent v0.13.0 - “The Tenacity Release”
English
146
195
2.5K
1.3M
Uzi
Uzi@uzairansar·
First look 👀
Uzi tweet media
English
58
16
971
64.4K
AI✖️Satoshi⏩️
AI✖️Satoshi⏩️@AiXsatoshi·
M5 Max Macbook pro来たので とりあえずcodex,claude,Lmstdioインストールして LLMダウンロードしてる どの量子化にするか迷いますね
AI✖️Satoshi⏩️ tweet media
日本語
28
5
194
20.1K