Jack

416 posts

Jack

@itsjack

Puts a lot of effort into being lazy 🤖 Making https://t.co/9WY5jNIPMe 🎙️ Building products @MaketheProduct | ex-Product @ Vinted, The Next Web, Just Eat 🇪🇺

Amsterdam, The Netherlands Katılım Ağustos 2019

180 Takip Edilen198 Takipçiler

Jack@itsjack·7h

@StijnSmits i'm also doubtful, but secretly hoping for a real-life pied piper

GIF

English

Stijn@StijnSmits·7h

@itsjack

QME

Jack@itsjack·8h

has a 12M token context window benches at frontier level for coding uses 1000x less compute if this is real then it is a genuine breakthrough and this company is about to make a lot of money very fast

Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

119

Jack@itsjack·7h

I've used Posthog for several years at this point, still do across my active work, huge advocate But I hardly ever log in. Agents present me the data I want, raises issues, improvements, and even PRs with me in the loop So this is probably the right move for PostHog in all this, and they make a killer product so I am curious

PostHog@posthog

Introducing PostHog Code, the product editor that: - Understands your product - Identifies usage patterns - Triages bugs and errors for you - Creates PRs to fix them - Continuously monitors and improves your product Join the waitlist: posthog.com/code

English

Jack@itsjack·11h

@redtachyon that's fair if you couldn't be there for other reasons :)

English

Ariel@redtachyon·11h

@itsjack More that I'm on vacation and already very jetlagged, this would be many timezones away and I'm not giving up the vacation for a nerd party

English

299

Ariel@redtachyon·21h

Wait, so you're telling me I should have signed up for the 5/5 codex party even if I knew there's no way I'd attend (geographically)? Fml, lesson learned

English

190

8.2K

Jack@itsjack·13h

@techikansh i live 5k miles away and i still applied 🥸

English

129

Techikansh@techikansh·13h

bruhhh, this is so stupid... you don't register when you live 1000s of miles away... and now you dont even have the limit bump...

Gianluca Andretta@GianlucaNDRT

If it’s a dream, don’t wake me up.

English

187

25.5K

Jack@itsjack·13h

@championswimmer 1. Composer 2 is only using 1/3 Kimi 2.5, combined with Cursor's own proprietary data / investment 2. Composer 2 is going under RL in real-time, so the model gets better over time 3. Composer 2 is cheaper than 2.6 So no, they are not "the same thing".

English

556

Arnav Gupta@championswimmer·22h

Cursor Composer 2 is fine-tuned Kimi K2.5 Kimi K2.6 can also be basically defined as the same thing? (Essentially sharing the same pre-trained base model lineage + new post training) Cursor hosts the model on Fireworks. So if you use Kimi K2.6 directly on Fireworks yourself, then you are sorted. What is the Cursor moat? Just that it has sold subscriptions and has some captive audience? And benchmarks below is a lesson that it is hard/impossible to beat the model maker themselves at fine tuning their own model (unsurprising).

English

278

26.4K

Jack@itsjack·15h

@pa1ar @Jaytel fair if you need it, but i find myself not really needing it

English

Pavel Larionov@pa1ar·15h

@itsjack @Jaytel sometimes it is pleasant to not have that 200k restriction

English

Jaytel@Jaytel·1d

4.7 is completely unusable

English

437

170

5.8K

965.4K

Jack@itsjack·19h

so i think i basically have unlimited 5.5 now then 🤔 what /goal do i set?

English

104

Jack@itsjack·20h

@pa1ar @Jaytel even better! 1m context works less well for me than the 200k

English

Pavel Larionov@pa1ar·1d

@itsjack @Jaytel but you lose the 1M context by downgrading, right?

English

Jack@itsjack·20h

@_rain_miao_ @Jaytel i am doing that more and more tbh, just need to keep fighting the bento-boxes and overly wordy meta text it places everywhere 😅

English

Runkun Miao@_rain_miao_·22h

@itsjack @Jaytel or just yolo everything on 5.5

English

Jack@itsjack·20h

@ryans_dad @Jaytel maybe another one for me to try!

English

pistonsIn6@ryans_dad·22h

@Jaytel @itsjack 4.5 is better

English

Jack@itsjack·20h

@RefinerApp @Jaytel @Ydj79 i found the 200k context 4.6 to be working well, opus 4.7 has been super inefficient for me

English

442

Refiner@RefinerApp·1d

@Jaytel @Ydj79 @itsjack yeah 4.6 isn't much better.

English

463

Jack@itsjack·20h

@RealityPolice1 @Jaytel @Ydj79 cool use case!

English

Reality King@RealityPolice1·1d

@Jaytel @Ydj79 @itsjack 4.7 one shotted a CadQuery class that builds the mating surface required for the edges of two panels connecting at an angle. No other model have been able to do that. Skill issue.

English

Jack@itsjack·20h

@jerzydejm @Jaytel yeah can work well but i havent found luck with that for sites with existing designs. great for zero to one though

English

λthugg-huh?@jerzydejm·1d

@itsjack @Jaytel tbh no longer the case imo, just use gpt image

English

Jack@itsjack·20h

@intrater @Jaytel im using it through t3 code

English

John Intrater@intrater·1d

@Jaytel @itsjack CLI or desktop app?

Indonesia

Jack@itsjack·20h

@satory_ua @Jaytel it can for zero to 1 imo where image gen works for creating new concepts havent it got it to work reliably for existing sites yet though

English

🇺🇦🍉 Geopolitics expert 🍉🇺🇦@satory_ua·1d

@itsjack @Jaytel 5.5 can do cool frontend too

English

Jack@itsjack·1d

@JustinGorya @Jaytel yeah i think so too for new concepts, but found it harder to work using image to code gen on existing sites. do you find that too or am i missing something?

English

112

Justin@JustinGorya·1d

@itsjack @Jaytel 5.5 is also great at UI: x.com/JustinGorya/st…

Justin@JustinGorya

codex is not bad at UI. its just a layer 8 problem. GPT-5.5 literally just cooked me a very nice Landingpage. combined with GPT-image-2. openAI has some UI issues yes. But if you know how to work you are getting very good results already. @sama great work. congrats to whole team 🤝

English

311

Jack@itsjack·1d

@forgebitz /goal spend tokens

English

Klaas@forgebitz·1d

token spend is like lines of code it's a very impressive sounding metric if you don't know what you are doing lots of companies are going to get burned with this "target"

English

2.1K

Keşfet

@StijnSmits @redtachyon @techikansh @championswimmer @pa1ar @Jaytel @_rain_miao_ @ryans_dad