modpotato

455 posts

modpotato

@modpotatos

modpotato (i have no idea where my other accounts are)

Katılım Mayıs 2025

52 Takip Edilen18 Takipçiler

modpotato@modpotatos·11h

yyyyyyyikes

Rhys@RhysSullivan

so i'm actually going exponential now

English

modpotato@modpotatos·14h

mooncake by the kimi guys has done this for ages now

NVIDIA AI Developer@NVIDIAAIDev

If VRAM isn’t eaten by weights, it can go to KV cache and batch size. FlexTensor’s planned tensor offload displaces weight storage into host RAM, so inference stacks like vLLM can scale context and throughput on fixed hardware instead of immediately jumping to multiple GPUs.

English

modpotato@modpotatos·14h

what happened to this

Skyler Miao@SkylerMiao7

M2.7 open weights coming in ~2 weeks. still actively iterating just updated a new version on yesterday — noticeably better on OpenClaw.

English

modpotato@modpotatos·16h

the question is does sonnet call opus enough to get cache hits or not

Claude@claudeai

We're bringing the advisor strategy to the Claude Platform. Pair Opus as an advisor with Sonnet or Haiku as an executor, and get near Opus-level intelligence in your agents at a fraction of the cost.

English

modpotato@modpotatos·16h

@SolRuck yet it misses the verified check and cant work for users because it has the APP tag

English

39K

Sol Ruck@SolRuck·1d

You should’t trust Discord screenshots blindly btw :) Took me like 2 hours to make this lol

English

155

1.7K

38.4K

2.9M

modpotato@modpotatos·1d

MSA seems to have proven that model level attention works, so why dont we let the model weight its attention as a tag-based attention map and use the prompt as an attention cache?

English

modpotato@modpotatos·1d

WHERE DO YOU EXFILTRATE 10 PETABYTES OF DATA TO??

Polymarket@Polymarket

JUST IN: Hacker allegedly breaches Chinese state-run supercomputer & steals over 10 petabytes of sensitive data, including "highly classified defense documents & missile schematics"

English

modpotato@modpotatos·1d

@Holden0Day sorry, 20tb (at fp16) inference weights? fp8 10TB mxfp4/nvfp4/int4 5TB

English

Michael@Holden0Day·1d

@modpotatos model weights are often stored in lower precision

English

modpotato@modpotatos·1d

i dont even know HOW they would leak what, 10T params, fp32.. solid 40TB of weights 😭

David Ondrej@DavidOndrej1

if you work at Anthropic and leak Mythos weights you will go down in history.

English

modpotato@modpotatos·2d

@cqcqcqdx georg who

English

128

RossRadio@cqcqcqdx·3d

70s era circuitboards were a magnificent era. Inside the Pulsar Calculator watch from 1975.

English

860

23.6K

modpotato@modpotatos·2d

it fits perfectly

English

modpotato retweetledi

i2cjak@i2cjak·2d

ts about to be in every coffee maker for all time

adafruit industries@adafruit

Upcoming ESP32-S31 dual-core RISC-V MCU offers Gigabit Ethernet, WiFi, Bluetooth, and 802.15.4 connectivity @cnxsoft blog.adafruit.com/2026/04/06/upc…

English

108

3.5K

109.3K

modpotato@modpotatos·2d

i dont remember if i even signed up for garryslist ps unsubscribe link opens the login page

English

638

modpotato@modpotatos·2d

@Space_SIGINT @bubbleboi i think they mean the compute running the llm

English

Space Oddity🇺🇸⚓⚡🪶⚛️@Space_SIGINT·2d

@bubbleboi You realize that the LLM does all the "compute power" not your workstation, eh?

English

844

bubble boi@bubbleboi·2d

I’ve decided to throw all the compute power I have at getting Mythos to provide a solution to the Navier Stokes existence and smoothness problem. I will let it run until tomorrow morning and update you guys with what it produces.

English

300

19.9K

modpotato retweetledi