Matthew Malaker

134 posts

Matthew Malaker

Matthew Malaker

@MMalaker57

Katılım Eylül 2023
29 Takip Edilen9 Takipçiler
Matthew Malaker
Matthew Malaker@MMalaker57·
Qwen3.6 35BA3B seems to spend an enormous time thinking and makes tons of tokens. My Hermes agent compacted a 128K window (0.85 threshold I think) 4+ times writing what seems to be not too complicated code. A speed up over 27B means nothing if it's spent on more thinking.
English
0
0
0
36
Matthew Malaker
Matthew Malaker@MMalaker57·
The lack of availability of many lower bit quantizations on ROCm really puts a brake on using AMD for local inference beyond GGUFs. I wanted to use vLLM, but the good INT4, AWQ, etc. safetensors don't work with rocm, and flat truncating a higher but quant hurts model quality.
English
0
0
0
27
Ahmad
Ahmad@TheAhmadOsman·
I am a simple guy I just want my own data center
English
62
66
673
16.7K
Tech Dev Notes
Tech Dev Notes@techdevnotes·
xAI has Released Pricing of Speech to Text API
Tech Dev Notes tweet media
English
3
4
115
5.4K
Matthew Malaker
Matthew Malaker@MMalaker57·
@TheAhmadOsman How much memory does the KV cache take for the 31B? My preliminary tests showed it takes quite a lot. Do you quantize the KV cache and if so, what settings do you use?
English
0
0
0
517
Ahmad
Ahmad@TheAhmadOsman·
I changed my mind Gemma 3 31B is better and more capable than Qwen 3.5 27B Requires better prompting but it’s more capable, intelligent, and token efficient
English
84
26
800
45.9K
Matthew Malaker
Matthew Malaker@MMalaker57·
@tonysimons_ @Teknium I'm using it to code up an agent-enabled silly tavern replacement that has intelligent, dynamic, and cache maxing context/lore handling and concurrent agent-driven background evolution of the story that the user can't see to add depth to the world and exploit vLLM concurrency.
English
0
0
4
127
Tony Simons
Tony Simons@tonysimons_·
Dear Algorithm, Only show this to the most elite Hermes Agent builders on 𝕏 . If you see this, show me what you’re working on. Perhaps we can collab?
English
42
5
211
10.5K
Matthew Malaker
Matthew Malaker@MMalaker57·
@jukan05 How many times can these masks be used before they get too damaged? My only experience with contact lithography is with chromium photomasks, and you really can't use those many times, but NIL is a bit different. What are they using this on that makes sense?
English
0
0
1
234
Jukan
Jukan@jukan05·
At Photomask Japan 2026 (PMJ), held April 8-10, DNP announced in its Nano-Imprint Lithography (NIL) presentation that it has successfully manufactured using a template with 10nm circuit line widths. At its FY3/26 IR-Day, DNP stated that its NIL business would begin mass production in FY3/28, and at its FY3/25 IR-Day, it set a sales target of ¥4.0bn for the business by FY3/31. I thought NIL was dead, but it's actually being commercialized?
Jukan tweet media
English
16
7
167
31.3K
Matthew Malaker
Matthew Malaker@MMalaker57·
@elonmusk On one hand, I want one with a bluish osmium colored finish. On the other, the Satisfactory player in me wants one in Caterium finish. Actually, a copper finished cybertruck would look sick as long as it's waxed/has PPF.
English
0
0
0
24
Matthew Malaker
Matthew Malaker@MMalaker57·
@dpoddolphinpro Whichever Artemis mission lands on the moon first (3 I think) needs to bring it with them and plant it. That, or hang it in the moon base that gets built.
English
0
0
0
190
Ryan Caton
Ryan Caton@dpoddolphinpro·
This is so so cool. I was so excited when the payload manifest was announced. The Artemis II crew have just showcased the Apollo 18 flag - THIS VERY FLAG would have been planted on the lunar surface, had it gone ahead. Of course, Apollos 18 & beyond were cancelled - but this flag finally got its rightful trip to the Moon.
Ryan Caton tweet media
English
18
208
2.8K
105.5K
Matthew Malaker
Matthew Malaker@MMalaker57·
@michaeljknowles This meaning what it says, that you must actually live out your faith in action for it to mean anything. If you claim to believe yet don't do anything the faith commands or requires, then what is your faith, really?
English
0
0
0
17
Matthew Malaker
Matthew Malaker@MMalaker57·
@Scobleizer I have an interest in multi-agent from the perspective of exploiting vLLM for higher total token throughput for complex problems. That's if it doesn't destroy prompt caching, I think. I don't have the fastest prompt processing atm.
English
0
0
0
476
Mike Lee
Mike Lee@BasedMikeLee·
How many votes cast illegally by noncitizens are too many?
English
5.6K
2.4K
14.5K
213.2K
Matthew Malaker
Matthew Malaker@MMalaker57·
@jwsaml I was unhappy with my openclaw install, so I started over. It automatically detected the openclaw install, though, and I hear the migration process is generally pretty smooth. You can always copy your openclaw directory as a backup and try it, though.
English
0
0
0
172
Jesse Samuel
Jesse Samuel@jwsaml·
Has anyone fully replaced their OpenClaw with Hermes?
Jesse Samuel tweet mediaJesse Samuel tweet media
English
304
13
559
89K
Matthew Malaker
Matthew Malaker@MMalaker57·
@loktar00 Running models like this in an agent harness on NAS/small home servers with a decent APU is going to become a very common use case, I think.
English
0
0
0
128
Loktar 🇺🇸
Loktar 🇺🇸@loktar00·
The "local AI is a waste of money" takes keep coming while people are running Qwen 3.5 35B on $400 mini PCs.... at some point the math just speaks for itself.
English
52
18
502
33.4K
Matthew Malaker
Matthew Malaker@MMalaker57·
@elonmusk Will it v be' able to do photonics? I'm asking as a potential future applicant.
English
0
0
1
19
Elon Musk
Elon Musk@elonmusk·
My idea of a good time is working with amazing engineers to create incredible technology 🤩 The Tesla chip research fab will have all the machines needed to do logic, memory, packing & masks in one building for a lightning fast development cycle. Heaven 💫
English
10.1K
16K
207K
49.4M