Arkin Terli

341 posts

Arkin Terli banner
Arkin Terli

Arkin Terli

@arkinterli

Technologist. Generalist. Believer that true art lies in ultimate simplification. Personal opinions. Formerly at ArgoAI, Apple, IMG, AMD, and Microsoft 🇺🇸🗽

San Francisco, CA Se unió Şubat 2010
475 Siguiendo185 Seguidores
Simplifying AI
Simplifying AI@simplifyinAI·
Microsoft just solved the context window problem. Right now, every AI suffers from a fatal flaw: the "context window problem." When an AI reasons through a complex problem, it generates a massive chain-of-thought. But there is a catch. It has to keep every single token of that thought in its active memory. The technical term is the "KV Cache." The longer the AI thinks, the heavier it gets. It slows down. It gets expensive. Eventually, it runs out of space. We thought the only fix was renting bigger, more expensive cloud GPUs to hold all that context. Microsoft just proved us wrong. They published a paper called "MEMENTO." Instead of giving the AI a bigger memory, they taught it how to forget. Here is how it works: Instead of generating one endless stream of consciousness, a Memento-trained model breaks its reasoning into small blocks. After it finishes a block, it writes a dense, highly compressed summary of its own logic—a "memento." Then, it does something unprecedented. It physically deletes the entire previous reasoning block from its memory cache. It only carries the memento forward. The model reasons, extracts the core logic, and instantly drops the dead weight. The results rewrite the economics of running AI. • Context length compressed by 6x. • Active memory usage (KV cache) reduced by 2.5x. • Zero loss in math, science, or coding accuracy. And here is the real implication. Big tech has been charging you by the token for massive context windows you don't actually need. With this architecture, small businesses and solo operators can run complex, multi-step autonomous agents entirely locally. You don't need an enterprise cloud setup. A standard machine running an open-source model can now reason indefinitely without overflowing its memory. No API fees. Complete privacy. We spent the last two years trying to give AI an infinite memory. It turns out, the secret to smarter AI isn't remembering everything. It's knowing exactly what to forget.
Simplifying AI tweet media
English
17
65
263
15.2K
Arkin Terli
Arkin Terli@arkinterli·
@Zai_org Great model, but it breaks down above 100k context.
English
1
0
3
4.7K
⭕ Brock Pierson
⭕ Brock Pierson@brockpierson·
Last night I played 4 hours of the original Command & Conquer Red Alert with a friend. I absolutely loved this game. One of the first PC games I truly obsessed over. Released in 1996. Still an incredible game. DId you play it?
English
850
238
7.3K
540.6K
Curiosity
Curiosity@CuriosityonX·
NEWS🚨: Western Australia’s sky turned an eerie shade of red as dust filled the air ahead of Tropical Cyclone Narelle.
English
40
175
1.3K
82.1K
Awni Hannun
Awni Hannun@awnihannun·
According to benchmarks Qwen3.5 4B is as good as GPT 4o. GPT 4o came out ~2 years ago (May 2024). Qwen 3.5 4B runs easily on modern mobile devices. So the gap between frontier intelligence in a datacenter and running a model of equal quality on your iPhone could be 2-3 years. (Probably closer to 3 assuming Qwen3.5 4B is more benchmaxxed than 4o) I don't expect the trend of increasing intelligence-per-watt to change. So in 2-3 years it's plausible we will be running GPT 5.x quality models on an iPhone. Pretty wild.
English
123
147
2K
198.9K
Xor
Xor@XorDev·
@wookash_podcast It seems to me that LLMs actually raise the bar for excellence. The generalized work is not as important, but the advanced knowledge, concepts and nuance are even more important now. There's no shortcuts to those high level skills. It requires starting from the bottom
English
3
1
59
2.5K
Łukasz | Wookash Podcast
Łukasz | Wookash Podcast@wookash_podcast·
I don't get "knowledge is worth nothing now" LLM crowd. Have you ever tried to build something? If you want to build something there are thousands of small decisions, assumptions, tests to make, validate and run. There is a reason why even though you have a calculator, everyone learns in school what's 2+3. You don't want (or can't) in real world reach for calculator for 2+3. So how are you going to make something with zero knowledge? "Hey I want to go to Mars, how do I do that?" "Here are five easy steps to get to Mars in no time!"
English
29
30
423
17.8K
Xor
Xor@XorDev·
Orchard vec3 p,v=normalize(FC.rgb*2.-r.xyx),c=v/v.y;c.z+=.5*t;for(float z,i,b,g,m;i++<5e1;z+=.8*max(b=length((p.y-m)/1e2/(abs(sin(c.xz/.1))-.05/v.y)),min(4.-m,g=length(sin(p.xz)+1.-.1*(1.+sin(p.y-p.zx*.5))*m))-b),o.rgb+=(.7-v)/(g+b))p=z*v+1.,p.z-=t,m=abs(++p.y);o=tanh(o/5e2);
7
41
491
12.3K
Arkin Terli
Arkin Terli@arkinterli·
30+ years in the 21–24 BMI range. Current: 6’2” | 179 lbs | BMI 23 Secret: Eat healthy.
English
1
0
0
95
Scott Adams
Scott Adams@ScottAdamsSays·
A Final Message From Scott Adams
Scott Adams tweet mediaScott Adams tweet media
English
13.2K
32K
191.6K
43M
Arkin Terli
Arkin Terli@arkinterli·
❄️🎄 Merry Christmas 🎄❄️
Eesti
0
0
0
77
Arkin Terli
Arkin Terli@arkinterli·
@XorDev You are about to get into the demoscene world.
English
1
0
1
102
Xor
Xor@XorDev·
"Twist" for(float i,z,d,s;i++<1e2;){vec3 c=vec3(1,3,5)+s/50.,p=z*normalize(FC.rgb*2.-r.xyy),a=normalize(cos(c));p.z+=30.;a=a*dot(a,p)-cross(a,p),a.xy*=mat2(cos(t+vec4(0,33,11,0))),a=abs(a);z+=d=.05+.1*abs(abs(a.z-20.)-cos(s=a.x+a.y));o.rgb+=(cos(.1*i-c)+1.)/d/d;}o=tanh(o/6e3);
Čeština
6
19
240
7.3K
Florian Berger
Florian Berger@flockaroo·
vec3 q=vec3(0,0,1e4),v=FC.gbr-r.yxx*.3,p;for(float i,s,d;i++<53.;d=.2*t){for(p=q,s=6e3;4.<s;p=p.zxy-s*sin(p/s*6.3)*.05,s*=d=.8)p.yz*=rotate2D(d);d=length(vec2(length(p.yz)-1e4,p.x))-3e3;v.x+=v.x*step(7e2,q.x)*(1.1-mod(FC.y,2.)*2.);q+=sin(i)+v/r.x*d;o+=exp(-d*d/vec4(3,2,1,1))/i;}
41
155
1.9K
57.6K
Arkin Terli
Arkin Terli@arkinterli·
@_trish_xD C++ can be as easy as any other language if you have enough experience. It’s your experience, not the language itself, that makes things complicated.
English
0
0
0
75
trish
trish@_trish_xD·
The Happy Periodic Family of Programming Languages! It’s not just chemistry that has a periodic family anymore — now developers have one too!
trish tweet media
English
14
16
218
11.6K
Xor
Xor@XorDev·
Allman indentation is the only acceptable style
English
23
4
61
9K
@levelsio
@levelsio@levelsio·
Power still down in Spain and Portugal All internet is gone too cause the 4G masts ran out of battery power, at least in Portugal, don't know Spain Phone calls don't even work anymore! Also I heard many gas stations in Portugal can't pump gasoline cause their pumps work on electricity Only internet we have is in the Continente supermarket which I'm typing this on 😂 We went to buy an analog battery radio at Radio Popular and they're all sold out already! 📻
@levelsio tweet media
@levelsio@levelsio

Power is STILL down That makes it I think the worst blackout in Europe since 2006 and maybe longer We're in Portugal and went to supermarket to get water and food in case power doesn't come back (small chance but already hours now) Absolute anarchy ala COVID 2020 there Internet progressively went from 5G to 4G then 3G then 2G then nothing at all progressively, it seems the telecom masts have a battery for just a few hours Anyway everyone's buying water, food and toilet paper ATMs are all empty of cash money Funny we drove past some guys working on an electricity mast, maybe they didn't get the news it's entire Spain and Portugal? 😂

English
259
99
2.5K
772.2K
LaurieWired
LaurieWired@lauriewired·
Major new QEMU update released. The coolest part? Paravirutalized Apple GPUs. You can now spin up disposable macOS VMs *with* hardware acceleration. macOS guests now expose a thin vGPU (apple-gfx-mmio). very useful for CI, reverse engineering, gfx research, etc
LaurieWired tweet mediaLaurieWired tweet media
English
53
431
4.4K
226.1K