Andrey Baksalyar

253 posts

Andrey Baksalyar

Andrey Baksalyar

@baksalyar

Telegram — @baksalyar, Instagram — @baksalyar

Ayrshire, UK Katılım Haziran 2011
722 Takip Edilen28 Takipçiler
Command Code
Command Code@CommandCodeAI·
The next Command Code deal drops. Worth the wait. - Monday June 15, 2026 - 10 AM PT You'll want to be online for this.
Command Code tweet media
English
15
11
129
4.5K
Ahmad Awais
Ahmad Awais@MrAhmadAwais·
stripe still has no dark mode on the dashboard in 2026 dev first company btw opened it at 12am and got absolutely flashbanged retinas are cooked lmao fix your shit @stripe
English
7
1
49
3.3K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@ArtificialAnlys How long do you think it'll take to run all the models through the Deep SWE Bench again? I'm really curious to see how things shake out in the second tier once the Chinese open-source models enter the picture.
English
0
0
3
4.4K
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
Today is the first time our Intelligence Frontier chart has moved backward.
Artificial Analysis tweet media
English
41
135
2.3K
256.6K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@CommandCodeAI I asked it to find bugs in the codebase — no results in the end (exploration got interrupted) and -$5 drained my account credits 😅 I know it's not real money in cmd, but still. It doesn't make any sense if it's no better than GPT 5.5.
English
1
0
0
235
Command Code
Command Code@CommandCodeAI·
Kimi K2.7 Code is now in available in Command Code. 10x free credits in Go. Our new #1 open mode in internal benchmarks. cmd update to v0.37.0 select via /model • 256K context 🍃 • 30% lower reasoning tokens than K2.6 ✅ • Open weights 1T-parameter MoE - 32B active ⚡
Command Code tweet media
English
15
13
224
16.9K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@MrAhmadAwais I asked it to find bugs in the codebase — no results in the end (exploration got interrupted) and -$5 drained my account credits 😅 I know it's not real money in cmd, but still. It doesn't make any sense if it's no better than GPT 5.5.
English
1
0
0
183
Ahmad Awais
Ahmad Awais@MrAhmadAwais·
ok kimi k2.7 code seems like a real deal. it's already dethroned k2.6 in every test i've thrown at it. still benchmarking, but my early money is on this being the new top open model.
Command Code@CommandCodeAI

Kimi K2.7 Code is now in available in Command Code. 10x free credits in Go. Our new #1 open mode in internal benchmarks. cmd update to v0.37.0 select via /model • 256K context 🍃 • 30% lower reasoning tokens than K2.6 ✅ • Open weights 1T-parameter MoE - 32B active ⚡

English
8
3
154
8.1K
OpenAI
OpenAI@OpenAI·
We heard you wanted to use Codex rate limit resets on your own time. Starting today, we’re rolling out the ability to save rate limit resets to use later. We’re starting Go, Plus, Pro, and Business users with one free reset:
English
1.2K
1.7K
21.4K
4.2M
Abdur Rahim
Abdur Rahim@_ARahim_·
I kept spotting typos in my Claude Code prompts after I'd sent them. So I built tuipo — Grammarly for your terminal It underlines typos as you type in any TUI — CC, Codex, Aider, vim, your shell — and never touches the app it wraps. github.com/ARahim3/tuipo @steipete @bcherny
English
86
11
407
123.9K
Karthik
Karthik@rachamka·
@theo its better but not as crazy as marketed
English
1
0
5
6.7K
Theo - t3.gg
Theo - t3.gg@theo·
How are you guys feeling about Fable so far?
English
839
15
2K
476.3K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@harisamjed @MrAhmadAwais The same experience. It's good for big codebase scans though — searching for bugs, security holes, and issues with extensive reports, which are later rechecked by smarter models — ideal tasks to leverage its speed.
English
0
0
1
3
Haris Amjed
Haris Amjed@harisamjed·
@MrAhmadAwais Well for some reason nemotron never worked for me. Yesterday I fell for Ultra version hype and gave it a complex task. It burned $4 of credits and messed up several files. I interrupted it mid task and change model to dsv4flash which fixed everything.
English
1
0
1
59
Ahmad Awais
Ahmad Awais@MrAhmadAwais·
what's your favorite open model for code now?
English
86
1
76
15.5K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@ArtificialAnlys Where is MiniMax M3? Why did you ignore it? Are you pretending you didn't notice the release of one of the top Chinese LLMs? 😄
English
1
0
1
190
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
Alibaba's Fun-Realtime-TTS takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Google's Gemini 3.1 Flash TTS and Inworld's Realtime TTS-2 Research Preview Competition at the top of the TTS Arena is tighter than ever, with just 24 Elo points separating the top five models. Fun-Realtime-TTS takes the top spot with the highest Elo score on the leaderboard. @Ali_TongyiLab @AlibabaGroup's previous Fun-Realtime-TTS-Preview reached #7 on the leaderboard, making this Alibaba's first #1 model in the Artificial Analysis Speech Arena. Fun-Realtime-TTS is available via Alibaba Cloud with API access for developers. Key takeaways: ➤ Quality: Fun-Realtime-TTS has an Elo score of 1,219 (+16/-16) based on 962 arena appearances, placing it ahead of Gemini 3.1 Flash TTS at 1,214, Inworld Realtime TTS-2 Research Preview at 1,209, and Cartesia Sonic 3.5 at 1,203 ➤ Pricing: Fun-Realtime-TTS is priced at $27.59/1M characters, positioning it between Gemini 3.1 Flash TTS at $18.3/1M characters and Inworld Realtime TTS 1.5 Max at $35/1M characters, while remaining below Sonic 3.5 at $39/1M characters. ➤ Features: Fun-Realtime-TTS supports real-time speech generation with voice cloning, voice design, multilingual output, and support for regional accents and dialects. See more details and listen to samples below 🧵
Artificial Analysis tweet media
English
12
17
169
27.8K
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@OxxoTweets So what does that even mean? 38T of garbage tokens and synthetic pretraining slop just to inflate the numbers and pretend the model has some "unique level of intelligence"? I tried it, and it couldn't even write a simple Python script — it got stuck on f-string formatting, lol.
English
0
0
0
17
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@NiP73 What does it change? This thing can’t even produce a simple Python script after multiple failed attempts and can’t use tool calls, lol. Are you all insane? Wake up—it’s junk.
English
0
0
0
7
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@Codedigipt Yeah, sure, “great performance”. It's a trash, man, it can't code it can't use tools, it hallucinates AF.
English
0
0
0
7
Andrey Baksalyar
Andrey Baksalyar@baksalyar·
@NeoAIForecast @liquidai @AMD Man, it can't even make a simple, working python script after multiple attempts. What “much capability” are you talking about? It's just crap.
English
1
0
1
114
Neo
Neo@NeoAIForecast·
Benchmarking the new LFM2.5-8B-A1B MoE by @liquidai on an @AMD RX 7800 XT using ROCm and llama.cpp. A few interesting results: >Q5_K_M delivered the fastest generation speed at ~200 tok/s >Q8_0 pushed over 6,000 tok/s prompt processing >Even a 16GB card can run these smaller MoE models comfortably >Shows just how much capability is available without needing flagship hardware Running more tests on the AMD stack and seeing how far ROCm has come
Neo tweet media
English
13
6
108
7.7K