Chris Scott

292 posts

Chris Scott

@greatscottdev

Software Engineer | LLM hobbyist | Problem Solver

Florida Katılım Ekim 2023

153 Takip Edilen82 Takipçiler

Chris Scott@greatscottdev·2d

@Alibaba_Qwen @arena Yes if you actually going to release weights for 397b variant! This is what I am still running at home but stuck on 3.5

English

2.8K

Qwen@Alibaba_Qwen·2d

🚀🚀Qwen3.7 Preview lands on Arena ！ Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.⚡️⚡️ Can't wait to release Qwen3.7 series models！Stay tuned! @arena

Arena.ai@arena

Qwen3.7 Preview By @Alibaba_Qwen lands on Arena for Text and Vision. In Text Arena, Qwen3.7 Max Preview ranks #13 overall. Alibaba is now the #6 lab in this arena. - #7 Math - #9 Expert - #9 Software & IT - #10 Coding In Vision Arena: Qwen3.7 Plus Preview ranks #16 overall, making Alibaba the #5 lab. Congrats to the @Alibaba_Qwen team on the latest progress!

English

198

378

3.4K

597.6K

Chris Scott@greatscottdev·6d

Who’s going?? @jeffdunham

English

Chris Scott@greatscottdev·14 May

@jahooma Missing @XiaomiMiMo is that to keep it in the lead?

English

108

James Grugett@jahooma·13 May

DeepSeek v4 **Flash** is absolutely insane. It costs almost nothing (~1/300th Opus), and yet performs among the best open source models. On our coding benchmark Flash does better(!) than Pro

English

743

53.8K

Chris Scott@greatscottdev·13 May

@code_barbarian @joinhello_miami @remix_run @mongoosejs Enjoyed the talk, learned some things I didn’t know about both @mongoosejs and @remix_run

English

Valeri Karpov@code_barbarian·12 May

Check out my talk on @remix_run + @mongoosejs in wynwood tonight 🌴

Remix Miami@remix_miami

Join us next week as @code_barbarian teaches us how to work with data in @remix_run 3 and what really matters! Join me at Working with Data in Remix 3: What Actually Matters | Remix Miami meetu.ps/e/Q1D3G/LpYmM/i luma.com/u6a3zdv3

English

Chris Scott@greatscottdev·10 May

@Baidu_Inc Where are the weights? Can’t drop without hugging face links??

GIF

English

290

Baidu Inc.@Baidu_Inc·9 May

ERNIE 5.1 just dropped. Built on ERNIE 5.0's pre-training foundation, our latest foundation model upgrades search, reasoning, knowledge Q&A, creative writing, and agentic capabilities, while using only around 6% of the pre-training cost of comparable models. More in the thread 🧵

English

756

116.3K

Chris Scott retweetledi

James Long@jlongster·8 May

shipped another worktree improvement: opencode will show all worktrees available from git you don't have to create them through opencode anymore! use whatever tool you want to name and organize them

English

354

42.8K

Chris Scott@greatscottdev·8 May

@philipkiely @TheAhmadOsman Yes more gpus!!

English

Philip Kiely@philipkiely·8 May

Talked inference with @TheAhmadOsman Our conclusion: buy a LOT of GPUs.

English

277

24.9K

Chris Scott@greatscottdev·8 May

@mattpocockuk @techgirl1908 I think so for now. All the memory implementations I have used end up causing more problems than helping with the exception of a single loop to complete a single outcome.

English

Matt Pocock@mattpocockuk·7 May

@techgirl1908 I am currently intrigued but sceptical about memory, specifically about engineering. Is it not better to optimise for the most common, predictable, cheapest state - no memory?

English

3.2K

Angie Jones@techgirl1908·6 May

The more I work with agents, the more I'm convinced that "just give it more context" can't be the whole answer. I'm not seeing enough discourse about memory. More specifically, memory design... like what gets stored, what gets retrieved, what gets summarized, what triggers the agent to look things up again. I'll be spending time with @oracledevelopers soon, getting hands-on with agentic memory patterns. Very excited to get into the weeds!

English

111

13.4K

Chris Scott@greatscottdev·7 May

@vllm_project @lightseekorg Sm120 support? Or only enterprise Blackwell*

English

108

vLLM@vllm_project·6 May

🚀 Excited to be the exclusive day-0 launch partner for @lightseekorg's Tokenspeed project! We've integrated Tokenspeed's MLA library, optimized specifically for agentic workloads with long context and multi-turn, purpose-built for Kimi 2.5/2.6 and DeepSeek R1 on NVIDIA Blackwell hardware! Try it out today with our preview image - nightly support coming soon!

LightSeek Foundation@lightseekorg

Introducing TokenSpeed, a speed-of-light LLM inference engine. > TensorRT LLM level performance > vLLM level usability > Built by a lean and mission-driven team in two months > MIT license, open-source github.com/lightseekorg/t… lightseek.org/blog/lightseek…

English

199

32K

Chris Scott@greatscottdev·5 May

This sounds very interesting

English

Chris Scott retweetledi

Christian Wilson@csw868·5 May

@code_barbarian is going to give us insight to handling data in @remix_run 3! Tuesday May 12th at @remix_miami located at the Dock Wynwood. Come join us!! Sign ups below

English

1.9K

Chris Scott@greatscottdev·3 May

@csw868 probably don't clean up in many other areas.

English

Christian Wilson@csw868·3 May

people who don’t clean the community grill after use, are the lowest of the low

English

Chris Scott@greatscottdev·1 May

@zUnEm01 for me it come to Qwen 3.5 397b. It is what I currently run locally. I would love for the 3.6 to hit open weights. @Alibaba_Qwen I thought was all about OSS?? It offers the best in option for me in speed / Code / Instruction following out of other options I can self host.

English

1.5K

zUn@zUnEm01·1 May

GLM 5 could be better but it is very unreliable asf! Kimi k2.6 could be better but it just has issues with understanding and following instructions, it over does stuffs and destroys my repo. Deepseek is a winner here because it understands and follows instructions with 1m context it's a plus for me. The only problem with Deepseek is this: it doesn't have vision.

Kasif@md_kasif_uddin

Be honest, which is the best open source AI Model?

English

506

58.8K

Chris Scott@greatscottdev·1 May

@0xSero I did give this a quick test. What did you run it on? Sliding window attention does not have the best support for sm120.

English

0xSero@0xSero·30 Nis

MiMo-V2.5 Supports: vision + video + audio Speeds: - 128 tok/s decode - 367 tok/s prefill & 30K cache - FP8 weights FP16 cache - 65% on terminal-2 - 58% on swe-pro - 62% on claw-eval - 376gb VRAM - 256k context Tying or closely trailing Kimi-K2.6 Excellent work by Xiaomi

English

264

9.6K

Chris Scott@greatscottdev·1 May

@keegabit @felipevalr great time at @AIEMiami was going through pictures and forgot about this one.

English

234

Chris Scott@greatscottdev·30 Nis

@rot13maxi @QuixiAI Not really, 397 is the most useful option.

English

854

Rijndael@rot13maxi·30 Nis

you know what would be awesome? qwen3.6-122b-a10b

English

297

23.4K

Chris Scott@greatscottdev·28 Nis

What?? This is interesting. I do use warp, not for any of the agent features. But I was one of the ones pushing to allow for custom local / endpoints. (github issues)

Warp@warpdotdev

Warp is now open-source.

English

206

Chris Scott@greatscottdev·27 Nis

@simeonGriggs Had a very similar experience

English