Fleek

3.5K posts

Fleek banner
Fleek

Fleek

@fleek

Something new coming soon

Katılım Ekim 2018
193 Takip Edilen96.1K Takipçiler
Fleek
Fleek@fleek·
@irnlx83 hoping to release the new product this month. but we don't want to misspeak. once we have better certainty on timelines we will start sharing more publicly
English
4
0
3
613
Fleek
Fleek@fleek·
please excuse the silence. we've been cooking up something cool and are excited to share more details soon
English
13
1
23
2.3K
Fleek
Fleek@fleek·
@xcorat not for long! ⚡️
English
0
0
0
99
Fleek
Fleek@fleek·
NVIDIA just dropped benchmarks showing 4-bit inference loses less than 1 point vs BF16 on most tasks. It's not accuracy per request that you should be measuring. It's tasks completed per dollar. And at that metric, 4-bit wins by a landslide. Read the full blog 👇
Fleek@fleek

x.com/i/article/2016…

English
13
6
27
5.9K
Fleek
Fleek@fleek·
@OpheliaMystic There are several other components to the stack that will be open-sourced in the coming weeks / months. Stay tuned, and keep an eye out on the repos 👀
English
0
0
0
50
Ophelia
Ophelia@OpheliaMystic·
@fleek This looks intriguing! The integration of mdspan with CUTLASS layouts could streamline our CUDA workflow significantly. I appreciate the focus on minimizing overhead while enhancing functionality. Excited to check out the complete example! 🚀
English
1
0
1
55
Fleek
Fleek@fleek·
1/ Yesterday we announced mdspan-cute: C++23 std::mdspan syntax with CUTLASS cute layouts. One header. Zero overhead. Here's how it works 🧵
English
3
5
15
2.2K
Fleek
Fleek@fleek·
7/ Layout algebra is formalized in Lean 4. 26 theorems, 0 sorry. Properties extracted to RapidCheck tests. The art/ directory has 23 SVG visualizations - we drew pictures until we understood.
English
1
0
5
1.4K
Fleek
Fleek@fleek·
💿 Open Source Release 💿 mdspan-cute: a zero-overhead bridge between C++23 std::mdspan and CUTLASS cute layouts. One header. Swizzled memory. No bank conflicts. Read the blog and check out the repo (links in reply)
English
1
1
8
1.6K
Fleek
Fleek@fleek·
@cv_alphas @grok If their broad definition stands, it has implications for other industries including drones, robotics, and more - not just the EVs they claim later in the patent it applies to
English
0
0
1
38
Fleek
Fleek@fleek·
@cv_alphas @grok "Derivative" would mean it's based on prior art. rfl means it IS prior art. Not derived. Identical.
English
1
0
1
38
Fleek
Fleek@fleek·
6/ On "bit augmentation": Log/exp is a bijection. Information in = information out. You can't create precision from a reversible transformation. Thermodynamics doesn't allow it.
English
0
0
2
344
Fleek
Fleek@fleek·
5/ Quantized RoPE already runs in: → LLaMA → Mistral → Most open source inference stacks This isn't obscure. It's foundational.
English
1
0
2
422