Genesis Ai

1.9K posts

Genesis Ai banner
Genesis Ai

Genesis Ai

@_genesis_ai_

I code in parallel

Katılım Şubat 2018
202 Takip Edilen6.1K Takipçiler
Genesis Ai
Genesis Ai@_genesis_ai_·
I prefer code, but this one is an exception.
Genesis Ai tweet media
English
2
0
1
2.5K
Genesis Ai
Genesis Ai@_genesis_ai_·
One of the most efficient ways of wasting compute is to use JSON in LLMs. Not to mention the degradation of perplexity... Don't take my word for it tho, just inspect the attention weights when using MCPs or force the output to follow a JSON schema.
English
2
0
2
1.8K
𝓩*
𝓩*@komplexkonjugat·
NVIDIA: "We just made scikit-learn, UMAP, and HDBSCAN run on GPUs with zero code changes!" Nice! UMAP goes brrrrrr! reddit.com/r/MachineLearn…
English
3
0
2
579
Genesis Ai
Genesis Ai@_genesis_ai_·
Got a really stupid idea this morning BUT seems its possible to solve arithmetic in ML by just tokenizing smarter and do selective activations. It also solves strawberrry out of the box.
English
0
0
2
1.4K
Joachim Landström
Joachim Landström@J_Landstroem·
Perplexity släpper en avcensurerad open-soucre variant av DeepSeek-R1 (och kinesiska troll blir sura). Den går nu att även att nå via ollama. Även en distill är släppt, men på 70b. Det tar nog inte lång tid innan vi har mindre distills. perplexity.ai/hub/blog/open-…
Svenska
3
1
7
2.5K
Genesis Ai
Genesis Ai@_genesis_ai_·
@danielhanchen @UnslothAI That was fun! Got a nf4 fused dequant kernel to x1.31 speedup at least with the given constrains.
GIF
English
0
0
0
229
Daniel Han
Daniel Han@danielhanchen·
We made 5 challenges and if you score 47 points we'll offer you $500K/year + equity to join us at 🦥@UnslothAI! No experience or PhD needed. $400K - $500K/yr: Founding Engineer (47 points) $250K - $300K/yr: ML Engineer (32 points) Challenges: 1. Convert nf4 / BnB 4bit to Triton 2. Make FSDP2 work with QLoRA 3. Remove graph breaks in torch.compile 4. Help solve Unsloth issues! 5. Memory Efficient Backprop If you have any questions about the challenges, please feel free to ask! We're looking for people to help push Unsloth forward - so come join us to democratize AI further! Our past work includes: 1. 1.58bit DeepSeek R1 GGUFs: x.com/UnslothAI/stat… 2. GRPO with Llama 3.1 8B in a Colab: x.com/UnslothAI/stat… 3. Gemma bug fixes: x.com/danielhanchen/… 4. Gradient accumulation bug fixes: x.com/danielhanchen/… Details & submission guide: colab.research.google.com/drive/1JqKqA1X…
Daniel Han tweet media
English
183
783
6.4K
1.3M
Genesis Ai
Genesis Ai@_genesis_ai_·
@UnslothAI @danielhanchen @UnslothAI was asking for a x1.15 speedup, I give you x1.31 💃 aaaand works with torch.compile, triton autotune, T4 gpus or just like these benchmarks, out of the box. Still have some more tricks on optimizing it but that is for another night! Also should do the MM in there.
Genesis Ai tweet media
English
1
1
5
1.1K
Genesis Ai
Genesis Ai@_genesis_ai_·
I think its time for a hacknight! @UnslothAI makes good kernels so lets try their challenge. Always start with the hard ones right? Lets start with a fused nf4 tensor kernel in Triton!
Daniel Han@danielhanchen

We made 5 challenges and if you score 47 points we'll offer you $500K/year + equity to join us at 🦥@UnslothAI! No experience or PhD needed. $400K - $500K/yr: Founding Engineer (47 points) $250K - $300K/yr: ML Engineer (32 points) Challenges: 1. Convert nf4 / BnB 4bit to Triton 2. Make FSDP2 work with QLoRA 3. Remove graph breaks in torch.compile 4. Help solve Unsloth issues! 5. Memory Efficient Backprop If you have any questions about the challenges, please feel free to ask! We're looking for people to help push Unsloth forward - so come join us to democratize AI further! Our past work includes: 1. 1.58bit DeepSeek R1 GGUFs: x.com/UnslothAI/stat… 2. GRPO with Llama 3.1 8B in a Colab: x.com/UnslothAI/stat… 3. Gemma bug fixes: x.com/danielhanchen/… 4. Gradient accumulation bug fixes: x.com/danielhanchen/… Details & submission guide: colab.research.google.com/drive/1JqKqA1X…

English
1
1
9
6.8K