sven2401

3.5K posts

sven2401

sven2401

@sven2401

Katılım Mart 2016
120 Takip Edilen28 Takipçiler
sven2401
sven2401@sven2401·
@TheAhmadOsman Please Tell us what do I really need to service 5-12 people at minimum?
English
0
0
0
15
Ahmad
Ahmad@TheAhmadOsman·
No, you won’t be able to “service 5-12” people with a bunch MacBooks hooked to each other That larper didn’t even know what an MoE is 4 months ago and now he’s acting like an expert and spreading misinformation When did lying become so normalized in this space ffs
English
51
7
211
10.9K
dax
dax@thdxr·
something is going wrong with gpt 5.5 caching doesn't look like much on this chart but this it's now using 2.5x as many input tokens as a week ago and dropping
dax tweet media
English
66
14
713
61K
sven2401
sven2401@sven2401·
@Sentdex Correct but if the anthropic employees say that it’s fine and then you put effort in and get rug pulled that’s frustrating. It wouldn’t be so bad if it would be like that from the beginning. If they cleared all that up in January after they banned opencode usage
English
0
0
0
8
sven2401
sven2401@sven2401·
@lucasmeijer Not sure how implements slash commands, but skills invoked with a command have the additional feature that they are not only a prompt file but can have additional support files in that skill folder the agent can discover if needed.
English
0
0
0
154
Lucas Meijer
Lucas Meijer@lucasmeijer·
i never use skills. almost everything is a pi slash command instead.
English
16
2
113
17.8K
sven2401
sven2401@sven2401·
@jayair Would love to see more stats of what the community is using model wise.
English
0
0
0
133
sven2401
sven2401@sven2401·
@thdxr Do you use manual compaction? When do you compact or simply if the context overflows?
English
0
0
0
24
dax
dax@thdxr·
since compaction is good now i've been keeping a session pinned per PR im working on and reusing it until it merges what's nice about this is i can see the cost of the session and understand what the feature cost me (pinning sessions is under an experimental flag)
dax tweet media
English
52
5
487
32.1K
King Le Breton
King Le Breton@KingBreton78876·
@thdxr What do you mean "Compaction is good now"???? Where's the news, that OpenCode has a compaction that does not cause lobotomy??
English
2
0
4
529
sven2401
sven2401@sven2401·
@bukitsorrento @Howaboua The requests is only an estimate it’s not request based, afaik. You run through „dollar“ too.
English
0
0
1
10
Howaboua
Howaboua@Howaboua·
Clean Opencode uses 13k for "Hi". Pi loaded with skills and extension tools uses 4k. Been a while since I last checked this.
Howaboua tweet mediaHowaboua tweet media
English
9
2
172
21.4K
sven2401
sven2401@sven2401·
@mattpocockuk I added the specific call to it in the prompt for use with GitHub copilot. I can make a grilling session with gpt 5.4 xhigh for just 1 request with the cost equivalent of couple of cents. Need to squeeze as much as possible until end of month.
English
0
0
0
77
sven2401
sven2401@sven2401·
@LottoLabs With API cost one small task can already cost you 1$ as it wastes thinking tokens like crazy. 4h work for 6$ seems cheap in comparison
English
0
0
1
37
Lotto
Lotto@LottoLabs·
Opencode Go update $10 subscription Usage limits are definitely lower than ollama cloud and that makes sense for half the cost but I dug (slightly) deeper tonight I did 4 hours of work today with k2.6 and billed $6.59 that’s out of $30 weekly limit and $60 monthly limit One issue is finding the pricing of the model $/M tokens not listed, and it isn’t API costs. You can see examples of input tokens, output tokens in the screenshot and they average around $35-50/M tokens, this is obviously way above the $3.5-4 for api. At $35-50/m tokens and the limits in place you can get roughly 1-2M tokens out in a month. If you just used API that would cost $3.5-7 a month anyways. Obviously you get access to other models and it’s hosted fairly by opencode etc. but it’s not as crazy a deal as people make it out to be. I still recommend it, especially the first month because you actually are net ahead at $5 minus getting rate limited. Unless I’m missing something with input output costs you basically get $10 equivalent in api anyways.
Lotto tweet mediaLotto tweet mediaLotto tweet mediaLotto tweet media
English
57
12
521
99.4K
0xSero
0xSero@0xSero·
@davis7 Really happy you found it useful I respect your opinions on things.
English
1
0
13
617
Ben Davis
Ben Davis@davis7·
@0xSero helped me setup local models properly and I uh, had no idea these things had gotten this good Are they frontier level? No, but considering this is running on just my 5090 it's remarkably capable First tests on a couple of programming tasks and the qwen 3.6-27b model with no reasoning feels about on par with something like sonnet 4-ish, probably better it's really impressive But also setting up local models isn't easy, I don't know nearly enough to talk much about it yet other than you need to know what you're doing to have a good experience. The out of the box stuff is not nearly as good as setting it up correctly
English
22
5
161
14.4K
sven2401
sven2401@sven2401·
@mattpocockuk And what if you don’t have a ui in the general sense (not programmable at least)? 🤔
English
2
0
0
4.7K
Matt Pocock
Matt Pocock@mattpocockuk·
The more I replace plans with prototypes, the better the outputs Who'd have thought that low fidelity prototypes were better than walls of spec Oh yeah, the entire industry for 20 years Stop going against decades of knowledge because someone in SF shipped it as a 'mode'
dax@thdxr

i never make plans i hate looking at markdown i don't wanna read markdown files i just plan by having it make changes to the code then i look at the code to see what sucks then i prompt again

English
123
115
2K
326.3K
Matt Pocock
Matt Pocock@mattpocockuk·
Mini-realization this morning Whenever you're QA-ing code produced by an AFK agent, you're not just QA-ing the code... ...you're also QA-ing the agent itself. So, fixes must land in both in the same commit.
English
19
12
227
20.3K
sven2401
sven2401@sven2401·
@mattpocockuk Maybe yes as a free course it might even boost your other courses.
English
0
0
0
10
Matt Pocock
Matt Pocock@mattpocockuk·
Sounds mad, but maybe I should just make a course about writing great skills? I.e. for actual life/work productivity, not just dev. Breaking down daily tasks into skills. Turning HITL tasks into AFK ones. Creating a working language with the agent. Feels pretty deep
English
103
23
1.3K
39.7K
sven2401
sven2401@sven2401·
@MrAhmadAwais But that was different. Kimi K2.5 was used hugely by them after its release in January
English
0
0
0
7
sven2401
sven2401@sven2401·
@alikonkret @2000WU6 @sparbuchfeinde Ich weiß nicht wie das in dem Fall ist aber Arbeitszeit und Arbeitszeit. Reisezeit ist nur dann Arbeitszeit für das Arbeitszeitgesetz wenn du hinterm Steuer sitzt. Trotzdem wird man normalerweise für die ganze Zeit bezahlt.
Deutsch
0
0
0
11
Ali Konkret
Ali Konkret@alikonkret·
@2000WU6 @sparbuchfeinde "Ausbeutung" ist mir etwas zu hoch aufgehangen. Es ist halt so in der Leistungsgesellschaft. Erst recht, wenn man ergebnis- und nicht nur verrichtungsverantwortlich ist.
Deutsch
2
0
0
160
sparbuchfeinde
sparbuchfeinde@sparbuchfeinde·
4h zum Kunden fahren, 4h Meeting und Business-Lunch, 4h wieder nach Hause fahren. In jedem Land der Welt ein anstrengender, aber normaler Arbeitstag. In Deutschland ein Verstoß gegen das Arbeitszeitgesetz. Und du bist als Vorgesetzter oder Mitarbeiter direkt in der Mithaftung, wenn etwas passiert. Es nervt einfach nur. Und muss dringend reformiert werden. Wenn Gewerkschaften wirklich etwas an ihren Mitgliedern gelegen wäre, würden sie wegen der absurd hohen Steuerbelastung oder der Rente auf die Straße gehen.
sparbuchfeinde tweet media
Deutsch
170
100
1.6K
93.5K
sven2401
sven2401@sven2401·
@championswimmer The thing is at what cost. Kimi k2.6 throws tokens out of the window like it would be nothing. Compared to K2.5 don’t like that. Cost and time to finish manyfolded.
English
0
0
0
25
Arnav Gupta
Arnav Gupta@championswimmer·
Cursor Composer 2 is fine-tuned Kimi K2.5 Kimi K2.6 can also be basically defined as the same thing? (Essentially sharing the same pre-trained base model lineage + new post training) Cursor hosts the model on Fireworks. So if you use Kimi K2.6 directly on Fireworks yourself, then you are sorted. What is the Cursor moat? Just that it has sold subscriptions and has some captive audience? And benchmarks below is a lesson that it is hard/impossible to beat the model maker themselves at fine tuning their own model (unsurprising).
Arnav Gupta tweet media
English
35
10
285
28.9K
sven2401
sven2401@sven2401·
@dixiidev @r_marked Mainly that comes from physical products. You buy something on Amazon you get it. It’s not what you though you would get you return. Bc you didn’t had the option to really examine the physical product. If you buy in a store that’s not mandatory but many still do it.
English
0
0
0
33
cornball.dev 🐳
cornball.dev 🐳@dixiidev·
@r_marked Why does the EU allow refunds under a 14 day period lmao what's the reasoning behind it
English
8
0
2
7.1K
mark
mark@r_marked·
That's funny, because I can see that you have been creating and cancelling accounts for a while now...
mark tweet media
English
54
4
738
145.5K
sven2401
sven2401@sven2401·
@arrzzt @theo Yeah they stopped it due to „having it stable for the current users“ mid of last month. Now it’s clear what the plan was all along with that. 😅
English
0
0
1
48
Arijit
Arijit@arrzzt·
@theo That’s why they stopped the current subscriptions, I couldn’t purchase their $10 plan today
English
1
0
4
918
sven2401
sven2401@sven2401·
@mkurman88 @burkeholland That is so true. I can basically pay per token on openrouter at OpenAI them selfs or anywhere. The only benefit is to not have to have all the api keys but that openrouter already solves. And yeah you get free autocomplete but that’s not somethings someone is staying for
English
0
0
2
119
Mariusz Kurman
Mariusz Kurman@mkurman88·
@burkeholland If you somehow make it more affordable, I will stay. For now, the planned prices are as high as OpenAI's API. Why should we stick to your subscription if it doesn't give us any profits yet restricts us with an AI token limit that doesn't even accumulate?
Mariusz Kurman tweet mediaMariusz Kurman tweet media
English
1
0
5
533
Mariusz Kurman
Mariusz Kurman@mkurman88·
After the copilot sub becomes useless, I’m fully switched to DeepSeek.
English
6
0
34
5.8K