Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)

1K posts

Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』) banner
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)

Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)

@daniel_kukiela

ML/coding @ruliad_ai and @sentdex, co-author of the Neural Networks from Scratch book - https://t.co/HgvHw2ObbX

127.0.0.1 Katılım Şubat 2010
227 Takip Edilen925 Takipçiler
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
@tszzl Smoke tests, at least for me, actually increased the success rate of implementing features on the first try - the model can spot that something does not work as intended, and correct mistakes without any interference on my side.
English
0
0
0
113
roon
roon@tszzl·
I don’t think I love anything as much as language models love “smoke tests”
English
263
147
4.5K
232.8K
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
I'm trying to follow open-source models for coding and get into self-hosting them with Llama.cpp. I barely set @MiniMax_AI M2.7 with a reasonable 37 t/s, and M3 has been released - a much bigger model, only 10 t/s, so not very reasonable, but still possible to use. And what I see? @xai's GLM 5.2 weights are about to be released. This model is epic from what I heard, but it's even bigger - beyond what I could self-host. And GPU prices are still going in the wrong direction. We finally have capable models with open weights, but it's also getting harder to self-host them.
English
0
0
0
55
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
Interesting. I'll take a look. Thank you! I even contacted them directly previously to try to find out if we can control the screens directly (other than sending full-screen images, which is slow) and if they have any protocol for partial updates, or just straight framebuffer control, and they were not very helpful. I have things related to G1 pretty high on my todo still, including writing my own firmware (I figured out a few things for this already), but maybe now we'll have an easier way? I wonder if this refactor relates to G1s as well.
English
0
0
0
5
Nima Zeighami
Nima Zeighami@NimaZeighami·
At the Even Realities talk at AWE Attendee: “Hey will there be a way to port apps natively to the glasses without using the phone?” Even CEO: “Looking forward, forget about apps man. We’re in AI world now.”
English
3
1
16
1.9K
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
@Sentdex Another reason is that what you send to the internet stays on the internet. Your chats are being read by humans and used in various ways. You should be aware that, for example, your research is no longer yours once you use any of these models within it.
English
0
0
1
107
Harrison Kinsley
Harrison Kinsley@Sentdex·
Too many people on my timeline are talking about why we need OSS AI because they can just turn off the faucet at any time. This is missing the bigger point. We need OSS AI because this AI company literally turned that AI model against you to psychologically mislead you.
English
15
14
154
4.5K
Harrison Kinsley
Harrison Kinsley@Sentdex·
TIL python 3.14 interactive mode has syntax highlighting
Harrison Kinsley tweet media
English
7
0
105
6.7K
Harrison Kinsley
Harrison Kinsley@Sentdex·
Playing with Nvidia Cosmos3 Super models for image and video generation. Here's obligatory Will Smith eating spaghetti. First few renders were pretty boring, so I went all out on an "energetic stuffing face with spaghetti prompt" here. That's some bottomless spaghetti.
English
19
7
103
27.5K
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
I'm trying @NVIDIAAI's Cosmos3 Nano (can't fit the Super variant). After 7 tries with different prompts, negative prompts, couple of hyperparameters and a little bit ot luck with the seed, I got this - does this pass the "horse walking backwards" test?
English
1
0
1
676
Harrison Kinsley
Harrison Kinsley@Sentdex·
I did not appropriately prepare for this day that is apparently already here.
Harrison Kinsley tweet media
English
8
1
70
3.4K
Harrison Kinsley
Harrison Kinsley@Sentdex·
@daniel_kukiela I would imagine it's not likely. Qwen3.6 max never opened the weights afaik, but the smaller variants did
English
1
0
8
1.3K
Harrison Kinsley
Harrison Kinsley@Sentdex·
thx Minimax and Hermes. Let's see if I can survive here and maybe even downgrade further. lately Claude has been most helpful at reading hermes sessions when things aren't working, then producing guidance where it's going wrong and saving as md references for later
Harrison Kinsley tweet mediaHarrison Kinsley tweet media
English
6
1
18
2.9K
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
@Sentdex Yes. The thing is, once guided, these smaller models can just follow instructions and implement things pretty efficiently and accurately - they're really good at this already.
English
0
0
0
13
Harrison Kinsley
Harrison Kinsley@Sentdex·
@daniel_kukiela Yeah I really think we'll find ourselves way more often running smaller local models and only pulling out these larger ones for assistance rather than always boiling the ocean for simpler stuff..
English
1
0
3
107
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
@bitplane @vicentesurraco @Sentdex Yes, StackOverflow if full of direct answers, but it's question-answer only. It's knowledge, not "intuition" and multi-step conversation. And user-generated messages on your model are invaluable for future model improvements in the areas the model is less capable, makes mistakes.
English
0
0
0
18
davidsong
davidsong@bitplane·
@vicentesurraco @daniel_kukiela @Sentdex Every message you send to Claude Code is a message it's learning to send to itself. They're training autonomous agents with higher level planning and steering. We're its future inner monologue. Stack overflow doesn't have step by step task completion data, Claude Code users do.
English
1
0
1
46
Daniel Kukieła (『 ᴰʰᵃⁿ🄾🅂 』)
You do not have to train on this directly, you can extract the mistakes where the user clearly stated them and what the response should be (useful for RL), you can extract user reactions, expectations, tasks, etc. I don't know which plan you are at, but it's not unusual for me to hit the limits. Also depends on the model, context size and if you're getting back to sessions that already expired in the KV cache (which means counting all the history as input tokens) - and these things are not clear about how they set them.
English
2
0
1
25
Vicente Surraco
Vicente Surraco@vicentesurraco·
@bitplane @Sentdex @daniel_kukiela I am unconvinced. First off, AI training on AI code, even when led by a human, would likely lead to a decline in code quality. Second, I (and many others) rarely hit anywhere near usage limits. Perhaps "maxing out" the usage is subsidized, but by who? To some extent, other users
English
1
0
0
18