Seriously? No way!

378 posts

Seriously? No way! banner
Seriously? No way!

Seriously? No way!

@imhere_eth

参加日 Mart 2017
448 フォロー中39 フォロワー
Tim McNamara
Tim McNamara@timClicks·
If this is accurate, then NVIDIA's grip on the tech industry has just vanished. Matrix matrix multiplication (MatMul) is notoriously computationally difficult, which is why it's offloaded to GPUs. If MatMul can be avoided, then it's not just leveling the playing field. It's creating new playing fields.
Rohan Paul@rohanpaul_ai

This is really a 'WOW' paper. 🤯 Claims that MatMul operations can be completely eliminated from LLMs while maintaining strong performance at billion-parameter scales and by utilizing an optimized kernel during inference, their model’s memory consumption can be reduced by more than 10× compared to unoptimized models. 🤯 'Scalable MatMul-free Language Modeling' Concludes that it is possible to create the first scalable MatMul-free LLM that achieves performance on par with state-of-the-art Transformers at billion-parameter scales. 📌 The proposed MatMul-free LLM replaces MatMul operations in dense layers with ternary accumulations using weights constrained to {-1, 0, +1}. This reduces computational cost and memory utilization while preserving network expressiveness. 📌 To remove MatMul from self-attention, the Gated Recurrent Unit (GRU) is optimized to rely solely on element-wise products, creating the MatMul-free Linear GRU (MLGRU) token mixer. The MLGRU simplifies the GRU by removing hidden-state related weights, enabling parallel computation, and replacing remaining weights with ternary matrices. 📌 For MatMul-free channel mixing, the Gated Linear Unit (GLU) is adapted to use BitLinear layers with ternary weights, eliminating expensive MatMuls while maintaining effectiveness in mixing information across channels. 📌 The paper introduces a hardware-efficient fused BitLinear layer that optimizes RMSNorm and BitLinear operations. By fusing these operations and utilizing shared memory, training speed improves by 25.6% and memory consumption reduces by 61% over an unoptimized baseline. 📌 Experimental results show that the MatMul-free LLM achieves competitive performance compared to Transformer++ baselines on downstream tasks, with the performance gap narrowing as model size increases. The scaling law projections suggest MatMul-free LLM can outperform Transformer++ in efficiency and potentially in loss when scaled up. 📌 A custom FPGA accelerator is built to exploit the lightweight operations of the MatMul-free LLM. The accelerator processes billion-parameter scale models at 13W beyond human-readable throughput, demonstrating the potential for brain-like efficiency in future lightweight LLMs.

English
109
432
4.5K
1.5M
Seriously? No way!
Seriously? No way!@imhere_eth·
@tldraw Would be great if you allow to parametrize the prompt which framework to use for components generation, not just pure html. Like use React Material UI, etc.
English
0
0
3
399
tldraw
tldraw@tldraw·
GPT-4o plays even better with the fig 1. fig 2. meta
English
11
38
351
53.8K
Seriously? No way!
Seriously? No way!@imhere_eth·
@Noahpinion We may see some improvements between now and Nov. Prob..l back to usual business after that
English
0
0
0
113
Haworth, Inc.
Haworth, Inc.@Haworthinc·
Have you met Fern? You’ll want to. Inspired by nature, it’s a breakthrough in comfort that responds to your every move. But the best part: it's 15% off now.
English
4
2
40
264.4K
Austen Allred
Austen Allred@Austen·
I’m surprised that so much of Silicon Valley has such a negative view of the defense industry. “You’re building weapons!” Yes! What’s the alternative? Do you think the United States refusing to maintain any military presence would usher in more peace in the long run?
English
212
45
1.4K
301.7K
Thomas Godden
Thomas Godden@GoddenThomas·
Three day weekend means I get to let the intrusive thoughts win, just for a bit.
Thomas Godden tweet media
English
2
2
16
4.3K
Joe Lonsdale
Joe Lonsdale@JTLonsdale·
Sad and infuriating. Wacky DA’s - thanks to people like Soros, and other misguided ideologues - have terrified owners from defending themselves or their stores. If thieves saw that clerks could shoot them - without being prosecuted - these crimes would be far more rare!
BAY AREA STATE OF MIND@YayAreaNews

A San Francisco store clerk has died after being attacked by a thief who knocked him down & hit him with a baseball bat while stealing a beer from his store

English
17
64
341
64.7K
Neal Khosla
Neal Khosla@nealkhosla·
I’m as big of an AI bull as there is and have been for about ten years but holy shit most of the biggest grifters I know have become “AI people” and it absolutely terrifies me.
English
43
42
623
133.7K
Seriously? No way!
Seriously? No way!@imhere_eth·
@garrytan Btw, wondering if the Chesa recall made things any better or the new DA is not helping much? It's been a year already.
English
2
0
0
304
Garry Tan
Garry Tan@garrytan·
SF hard leftists (Berniecrats and DSA) are Anti-Asian racists and smearing me with bogus claims of anti-semitism These are the people who created the doom loop: making SF unsafe for my elders. Taking away math from our children. All their electeds must be voted out in 2024. dumpdean.org
Garry Tan tweet media
English
101
98
1.3K
219.5K
Seriously? No way!
Seriously? No way!@imhere_eth·
@NimbyPatrol @Noahpinion Check Google maps, this neighborhood is quite dense already. There are a few 5+ stories buildings around. It doesn't look like something out of place there.
English
0
0
0
50
NIMBY Patrol
NIMBY Patrol@NimbyPatrol·
This building is NIMBY nightmare fuel and I love it. 😍😍 215 Boyleston Ave E, Seattle 98102
NIMBY Patrol tweet media
English
81
107
3.1K
338.3K
Hayden
Hayden@the_transit_guy·
The Las Vegas Council unanimously approved the Boring Company's plan to build 68 miles of Tesla's tunnel (unproven technology) in 15.5 minutes. In the same meeting, they spent 52 minutes discussing whether a homeowner should be allowed to build a vestibule...
Hayden tweet media
English
423
1K
11.1K
1.9M
Seriously? No way!
Seriously? No way!@imhere_eth·
@benedictevans Your prediction would be more sound if we saw other charging networks finally catch up, to make Tesla network non essential.. Imagine the supercharger network has disappeared and we're suddenly many years back with very little progress made.
English
0
0
1
25
Thomas Godden
Thomas Godden@GoddenThomas·
For all the wild hacks people used to do, the best bed adhesion fix I've found is just washing the build plate with soap and water.
English
7
0
11
1.6K
Seriously? No way!
Seriously? No way!@imhere_eth·
@JumpInRE If you double windows surface,it will be ok, but without proper windows almost any structure looks bad.
English
0
0
0
15
Tyler | Kenji Capital
Tyler | Kenji Capital@KenjiCapital·
A new multifamily development just popped up in my neighborhood. Looks like one commercial unit on ground level and a few levels of residential. What do you think of the design?
Tyler | Kenji Capital tweet media
English
212
6
220
227.3K
Maia
Maia@maiamindel·
enough about tipping discourse, now for the REAL issue: why do americans not include taxes in the prices of products
English
80
37
1.1K
197.1K
Seriously? No way!
Seriously? No way!@imhere_eth·
@cpheinrich So how much you expect to be paid by open ai? What about Google, they search info too?
English
0
0
0
55
Chris Heinrich
Chris Heinrich@cpheinrich·
So OpenAI's business model is: 1) Scrape humanity's knowledge for free 2) Sell it back to us at $0.06 per 1k tokens Did I get that right? Such a rip
English
29
6
114
24.3K
Faraz Khan
Faraz Khan@faraz_r_khan·
Request for product: a bulk McMaster-carr hardware buyback program. I literally just needed 2 of these and no they’ll never be useful later.
Faraz Khan tweet media
Oakland, CA 🇺🇸 English
10
0
23
5.3K
Emil
Emil@EmilSkandul·
@kevinamezaga I used the phrase high-speed but yes it is really higher-speed. FL is also the flattest state in the country. Still, taking all of that into consideration...
English
4
1
33
4.5K
Lawrence Faulkner
Lawrence Faulkner@lawfaulkner·
@ZavalaA No matter the cost, the bullet train is a necessity! Many western civilizations have had bullet trains for decades. We are way behind. It will also reduce the carbon footprint and pay itself off over time. We can’t be prisoners of the moment and need to see the bigger picture.
English
17
0
8
2.2K
Ashley Zavala
Ashley Zavala@ZavalaA·
The estimated price tag on California’s bullet train project linking SF to LA is now up to $128 Billion. While there’s no completion date set for that route, the cost for the current segment under construction from Bakersfield - Merced also increased. kcra.com/article/califo…
English
184
105
332
627K