Chris Scott

292 posts

Chris Scott banner
Chris Scott

Chris Scott

@greatscottdev

Software Engineer | LLM hobbyist | Problem Solver

Florida Katılım Ekim 2023
153 Takip Edilen82 Takipçiler
Chris Scott
Chris Scott@greatscottdev·
@Alibaba_Qwen @arena Yes if you actually going to release weights for 397b variant! This is what I am still running at home but stuck on 3.5
English
0
0
0
2.8K
Qwen
Qwen@Alibaba_Qwen·
🚀🚀Qwen3.7 Preview lands on Arena ! Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.⚡️⚡️ Can't wait to release Qwen3.7 series models!Stay tuned! @arena
Arena.ai@arena

Qwen3.7 Preview By @Alibaba_Qwen lands on Arena for Text and Vision. In Text Arena, Qwen3.7 Max Preview ranks #13 overall. Alibaba is now the #6 lab in this arena. - #7 Math - #9 Expert - #9 Software & IT - #10 Coding In Vision Arena: Qwen3.7 Plus Preview ranks #16 overall, making Alibaba the #5 lab. Congrats to the @Alibaba_Qwen team on the latest progress!

English
198
378
3.4K
597.6K
James Grugett
James Grugett@jahooma·
DeepSeek v4 **Flash** is absolutely insane. It costs almost nothing (~1/300th Opus), and yet performs among the best open source models. On our coding benchmark Flash does better(!) than Pro
James Grugett tweet media
English
84
42
743
53.8K
Chris Scott
Chris Scott@greatscottdev·
@Baidu_Inc Where are the weights? Can’t drop without hugging face links??
GIF
English
0
0
2
290
Baidu Inc.
Baidu Inc.@Baidu_Inc·
ERNIE 5.1 just dropped. Built on ERNIE 5.0's pre-training foundation, our latest foundation model upgrades search, reasoning, knowledge Q&A, creative writing, and agentic capabilities, while using only around 6% of the pre-training cost of comparable models. More in the thread 🧵
Baidu Inc. tweet media
English
51
99
756
116.3K
Chris Scott retweetledi
James Long
James Long@jlongster·
shipped another worktree improvement: opencode will show all worktrees available from git you don't have to create them through opencode anymore! use whatever tool you want to name and organize them
English
14
17
354
42.8K
Chris Scott
Chris Scott@greatscottdev·
@mattpocockuk @techgirl1908 I think so for now. All the memory implementations I have used end up causing more problems than helping with the exception of a single loop to complete a single outcome.
English
0
0
1
10
Matt Pocock
Matt Pocock@mattpocockuk·
@techgirl1908 I am currently intrigued but sceptical about memory, specifically about engineering. Is it not better to optimise for the most common, predictable, cheapest state - no memory?
English
9
0
14
3.2K
Angie Jones
Angie Jones@techgirl1908·
The more I work with agents, the more I'm convinced that "just give it more context" can't be the whole answer. I'm not seeing enough discourse about memory. More specifically, memory design... like what gets stored, what gets retrieved, what gets summarized, what triggers the agent to look things up again. I'll be spending time with @oracledevelopers soon, getting hands-on with agentic memory patterns. Very excited to get into the weeds!
English
25
5
111
13.4K
vLLM
vLLM@vllm_project·
🚀 Excited to be the exclusive day-0 launch partner for @lightseekorg's Tokenspeed project! We've integrated Tokenspeed's MLA library, optimized specifically for agentic workloads with long context and multi-turn, purpose-built for Kimi 2.5/2.6 and DeepSeek R1 on NVIDIA Blackwell hardware! Try it out today with our preview image - nightly support coming soon!
vLLM tweet media
LightSeek Foundation@lightseekorg

Introducing TokenSpeed, a speed-of-light LLM inference engine. > TensorRT LLM level performance > vLLM level usability > Built by a lean and mission-driven team in two months > MIT license, open-source github.com/lightseekorg/t… lightseek.org/blog/lightseek…

English
9
27
199
32K
Chris Scott
Chris Scott@greatscottdev·
This sounds very interesting
English
0
0
1
29
Chris Scott
Chris Scott@greatscottdev·
@csw868 probably don't clean up in many other areas.
English
1
0
1
17
Christian Wilson
Christian Wilson@csw868·
people who don’t clean the community grill after use, are the lowest of the low
English
2
0
3
92
Chris Scott
Chris Scott@greatscottdev·
@zUnEm01 for me it come to Qwen 3.5 397b. It is what I currently run locally. I would love for the 3.6 to hit open weights. @Alibaba_Qwen I thought was all about OSS?? It offers the best in option for me in speed / Code / Instruction following out of other options I can self host.
English
0
0
1
1.5K
zUn
zUn@zUnEm01·
GLM 5 could be better but it is very unreliable asf! Kimi k2.6 could be better but it just has issues with understanding and following instructions, it over does stuffs and destroys my repo. Deepseek is a winner here because it understands and follows instructions with 1m context it's a plus for me. The only problem with Deepseek is this: it doesn't have vision.
Kasif@md_kasif_uddin

Be honest, which is the best open source AI Model?

English
43
15
506
58.8K
Chris Scott
Chris Scott@greatscottdev·
@0xSero I did give this a quick test. What did you run it on? Sliding window attention does not have the best support for sm120.
English
0
0
0
71
0xSero
0xSero@0xSero·
MiMo-V2.5 Supports: vision + video + audio Speeds: - 128 tok/s decode - 367 tok/s prefill & 30K cache - FP8 weights FP16 cache - 65% on terminal-2 - 58% on swe-pro - 62% on claw-eval - 376gb VRAM - 256k context Tying or closely trailing Kimi-K2.6 Excellent work by Xiaomi
0xSero tweet media
English
12
17
264
9.6K
Rijndael
Rijndael@rot13maxi·
you know what would be awesome? qwen3.6-122b-a10b
English
41
10
297
23.4K
Chris Scott
Chris Scott@greatscottdev·
What?? This is interesting. I do use warp, not for any of the agent features. But I was one of the ones pushing to allow for custom local / endpoints. (github issues)
Warp@warpdotdev

Warp is now open-source.

English
1
0
3
206
simeonGriggs
simeonGriggs@simeonGriggs·
The React Miami effect.
simeonGriggs tweet media
English
6
0
29
614