Jack

416 posts

Jack banner
Jack

Jack

@itsjack

Puts a lot of effort into being lazy 🤖 Making https://t.co/9WY5jNIPMe 🎙️ Building products @MaketheProduct | ex-Product @ Vinted, The Next Web, Just Eat 🇪🇺

Amsterdam, The Netherlands Katılım Ağustos 2019
180 Takip Edilen198 Takipçiler
Jack
Jack@itsjack·
@StijnSmits i'm also doubtful, but secretly hoping for a real-life pied piper
GIF
English
1
0
1
15
Jack
Jack@itsjack·
has a 12M token context window benches at frontier level for coding uses 1000x less compute if this is real then it is a genuine breakthrough and this company is about to make a lot of money very fast
Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English
1
0
1
119
Jack
Jack@itsjack·
I've used Posthog for several years at this point, still do across my active work, huge advocate But I hardly ever log in. Agents present me the data I want, raises issues, improvements, and even PRs with me in the loop So this is probably the right move for PostHog in all this, and they make a killer product so I am curious
PostHog@posthog

Introducing PostHog Code, the product editor that: - Understands your product - Identifies usage patterns - Triages bugs and errors for you - Creates PRs to fix them - Continuously monitors and improves your product Join the waitlist: posthog.com/code

English
0
0
1
51
Jack
Jack@itsjack·
@redtachyon that's fair if you couldn't be there for other reasons :)
English
0
0
0
70
Ariel
Ariel@redtachyon·
@itsjack More that I'm on vacation and already very jetlagged, this would be many timezones away and I'm not giving up the vacation for a nerd party
English
1
0
3
299
Ariel
Ariel@redtachyon·
Wait, so you're telling me I should have signed up for the 5/5 codex party even if I knew there's no way I'd attend (geographically)? Fml, lesson learned
English
13
2
190
8.2K
Jack
Jack@itsjack·
@techikansh i live 5k miles away and i still applied 🥸
English
0
0
1
129
Jack
Jack@itsjack·
@championswimmer 1. Composer 2 is only using 1/3 Kimi 2.5, combined with Cursor's own proprietary data / investment 2. Composer 2 is going under RL in real-time, so the model gets better over time 3. Composer 2 is cheaper than 2.6 So no, they are not "the same thing".
English
0
0
11
556
Arnav Gupta
Arnav Gupta@championswimmer·
Cursor Composer 2 is fine-tuned Kimi K2.5 Kimi K2.6 can also be basically defined as the same thing? (Essentially sharing the same pre-trained base model lineage + new post training) Cursor hosts the model on Fireworks. So if you use Kimi K2.6 directly on Fireworks yourself, then you are sorted. What is the Cursor moat? Just that it has sold subscriptions and has some captive audience? And benchmarks below is a lesson that it is hard/impossible to beat the model maker themselves at fine tuning their own model (unsurprising).
Arnav Gupta tweet media
English
33
10
278
26.4K
Jack
Jack@itsjack·
@pa1ar @Jaytel fair if you need it, but i find myself not really needing it
English
0
0
0
12
Jaytel
Jaytel@Jaytel·
4.7 is completely unusable
English
437
170
5.8K
965.4K
Jack
Jack@itsjack·
so i think i basically have unlimited 5.5 now then 🤔 what /goal do i set?
Jack tweet media
English
2
0
3
104
Jack
Jack@itsjack·
@pa1ar @Jaytel even better! 1m context works less well for me than the 200k
English
1
0
0
48
Jack
Jack@itsjack·
@_rain_miao_ @Jaytel i am doing that more and more tbh, just need to keep fighting the bento-boxes and overly wordy meta text it places everywhere 😅
English
0
0
1
28
Jack
Jack@itsjack·
@RefinerApp @Jaytel @Ydj79 i found the 200k context 4.6 to be working well, opus 4.7 has been super inefficient for me
English
0
0
0
442
Reality King
Reality King@RealityPolice1·
@Jaytel @Ydj79 @itsjack 4.7 one shotted a CadQuery class that builds the mating surface required for the edges of two panels connecting at an angle. No other model have been able to do that. Skill issue.
English
1
0
0
30
Jack
Jack@itsjack·
@jerzydejm @Jaytel yeah can work well but i havent found luck with that for sites with existing designs. great for zero to one though
English
0
0
0
25
Jack
Jack@itsjack·
@satory_ua @Jaytel it can for zero to 1 imo where image gen works for creating new concepts havent it got it to work reliably for existing sites yet though
English
0
0
0
24
Jack
Jack@itsjack·
@JustinGorya @Jaytel yeah i think so too for new concepts, but found it harder to work using image to code gen on existing sites. do you find that too or am i missing something?
English
1
0
1
112
Klaas
Klaas@forgebitz·
token spend is like lines of code it's a very impressive sounding metric if you don't know what you are doing lots of companies are going to get burned with this "target"
English
18
0
35
2.1K