_jerieljan/

3.8K posts

_jerieljan/ banner
_jerieljan/

_jerieljan/

@jerieljan

I write code and build automations, along with various internet and otaku-related commentary. More on: https://t.co/WebNTUD1qJ https://t.co/r0GjJfwhFK

Philippines 参加日 Ocak 2009
373 フォロー中100 フォロワー
_jerieljan/
_jerieljan/@jerieljan·
Probably unpopular take: if I was YouTube, I'd shadowban/derank any video that has a thumbnail that indicates it's LIVE when it's not. Hate seeing these fake indicators. If anything, they're unnecessary and redundant.
English
0
0
0
10
_jerieljan/
_jerieljan/@jerieljan·
Pixel Launcher still having this annoying, unremovable search bar at the bottom in 2026 is a joke in itself, especially when Google search is abhorrent shit.
English
0
0
0
5
_jerieljan/
_jerieljan/@jerieljan·
@simonw @GroqInc @cerebras Unfortunately, Groq stopped trying since Kimi K2.5 and I'm not expecting anything from them at all. Maybe Cerebras. But still, idk. I'd go Fireworks since their serverless inference has been both reliable and fast, and when they do fast(er) versions, it's really snappy.
English
1
0
5
400
Simon Willison
Simon Willison@simonw·
Really looking forward to one of the super-fast custom silicon inference providers like @GroqInc or @cerebras getting GLM 5.2 running Cerebras has GLM-4.7, Groq is still mostly Llama 3.x and gpt-oss
Jeremy Howard@jeremyphoward

Wow. @Zai_org GLM 5.2 is a marvel! It is *at least* as good as Opus 4.8 and GPT 5.5. It's super fast, inexpensive, and not too verbose. It responds with nuance and judgement, & handles long context VERY well. I've never experienced an open weights model like this before.

English
72
41
1K
102.6K
_jerieljan/
_jerieljan/@jerieljan·
@davis7 Wtf happened on June 17...? Damn, that's a lot of tokens.
English
0
0
0
428
_jerieljan/
_jerieljan/@jerieljan·
@theo Haven't watched this yet but if that's $20k n API calls, I smell the possibility of a subscription tier meant for loops. Kinda terrifying to think of, ngl. I don't want another 10-100x of costs thats trying to be normalized.
English
0
0
1
256
Theo - t3.gg
Theo - t3.gg@theo·
If you're curious how I managed to do over $20,000 in inference on the last 48 hours, here's a video all about it. Spoiler: loops are really powerful
English
119
61
2.2K
262.9K
_jerieljan/
_jerieljan/@jerieljan·
@Larfait Is the surprise here the dollar cost or that it's not a percent-based fee or pricing tiers? Depending on who's working and what they're known for, they can command both high prices and percentages.
English
0
0
2
1.6K
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
The problem with the "if it works who cares what the code looks like" mindset for agentic work is that it assumes the agent has a perfect understanding of "works." Realistically, things are underspecified, agents make bad assumptions, etc. To be fair, agents are pretty good at unit test coverage. They're pretty bad at designing human experiences (API, CLI flags, etc.), especially cohesive ones for future roadmap plans they may not have visibility into (unless your backlog is perfect and vision fully laid out, which I doubt). They're bad at knowing where performance matters and what type (CPU vs memory tradeoffs). They're bad at where compatibility matters and where it doesn't (and tend to err on the side of preserving it without further guidance). Etc. Unless you have this ALL specified, you can't possibly claim "it works" without taking a look and thinking about it.
English
129
310
3.3K
191.4K
_jerieljan/
_jerieljan/@jerieljan·
"oh nice, a shipping notification for a package I expected last month, that took a while" > shows it WAS shipped last month, had a timeline already and appears stuck somewhere in last-mile... fifteen days ago ...what the fu
English
0
0
0
15
_jerieljan/
_jerieljan/@jerieljan·
Funny thing about the fake Mistral announcements is how obviously fake but also plausible they are. Mistral tends to love their Word '95 wordart, if not their orange and pixelblock aesthetic. But... the fat cat and fake benchmarks looks nicer in some ways and it's amusing.
English
0
0
1
22
_jerieljan/
_jerieljan/@jerieljan·
@mitsuhiko I use Kimi K2.6 a lot before and now both trying out Kimi K2.7 and MiniMax M3. They're both promising, but I'm also not going Opus / GPT-5.5 level type of tasks yet but for the APIs I've built with them, they manage to implement things quite well.
English
0
0
0
315
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
Is anyone doing productive engineering work on open weights models (hosted or other) today? If yes, what's the model you use and why? How often do you pair it up with SOTA models.
English
99
23
349
47K
_jerieljan/
_jerieljan/@jerieljan·
Been trying Kimi K2.7 and MiniMax M3 So far, so good. K2.7 is an overall improvement to K2.6 so far and it's more eager to test stuff out. MiniMax M3 is also quite good. Never used the models before so I'm pleased it performs well in my cases.
English
0
0
0
63
Guillermo Rauch
Guillermo Rauch@rauchg·
My flight to London is Starlink-enabled 😭 The greatest advancement to air travel since the Wright brothers. God bless America
Guillermo Rauch tweet media
English
80
21
1.8K
112.4K
_jerieljan/
_jerieljan/@jerieljan·
I won't be surprised these AI labs will someday use gacha company mentality and energy mechanics. To me, they already feel that way. They already obfuscate costs with the obscurity of tokens and a vague HP bar for limits.
OpenAI@OpenAI

We heard you wanted to use Codex rate limit resets on your own time. Starting today, we’re rolling out the ability to save rate limit resets to use later. We’re starting Go, Plus, Pro, and Business users with one free reset:

English
0
0
0
18
_jerieljan/
_jerieljan/@jerieljan·
@thdxr > gangprompting ...new jargon unlocked. I've never heard of that before.
English
0
0
0
170
dax
dax@thdxr·
i am having more fun than ever collaborating with people get an agent in the mix, we all bounce ideas off each other. prompt it, get more ideas. narrow in on something stop dooming about ai and your job and start gangprompting
English
83
47
1.3K
105.1K
_jerieljan/
_jerieljan/@jerieljan·
Bricks and Minifigs basically went for the mutually assured destruction button, eh? Literally going nuclear. The courts are going to be ugly on all sides no matter what imho.
English
0
0
1
126
_jerieljan/
_jerieljan/@jerieljan·
@theo It takes three months on top of the original wait before any doctor gets involved? That's nuts. And wtf, why are appointments tied to the employee inbox instead of the institution's triage system or even the doctor's setup.
English
0
0
1
2.7K
Theo - t3.gg
Theo - t3.gg@theo·
If you're in the Bay Area and you're looking for a concierge doctor/medical service, do not work with Private Medical SF. I've never experienced such a level of incompetence in my life. We scheduled a call over two months ago for today at 2pm. I tried to join and the zoom link was dead. They had terminated the employee who scheduled it. Best they could do is schedule a new call in SEPTEMBER. Here's a brief list of things they failed to do after terminating that employee: - Invite the actual doctor to the meetings she scheduled - Close her Google Workspace account - Close her Gmail inbox (instead they left it open unmonitored) - Audit her work to find anything she dropped - Reschedule any conflicting meetings she booked - Remove her from the website They did spend a lot of time blaming her for it, even though the failure seems to squarely be on them. I was supposed to start care with them in the next couple weeks. They said this will delay me to September at earliest. They cited a "massive influx of interest from new potential patients". I hope this post helps reduce that influx.
English
45
16
1.5K
181.4K
_jerieljan/
_jerieljan/@jerieljan·
This is one of the annoying trends in software lately. Update nags and pimples that take inspiration from Windows updates. Mac apps figured the Sparkle approach ages ago and somehow they're poorly reinventing them with their own popups and call to action chips. Hate it so much.
@levelsio@levelsio

@TermiusHQ this popup every single day is super annoying, it's great you ship a lot but I am working and I don't want to every day close and re-open all my tabs!

English
0
0
0
38