paneerchilli65

3.8K posts

paneerchilli65 banner
paneerchilli65

paneerchilli65

@paneerchilli65

grind feels like melamine | 2x |

bbsr Katılım Haziran 2024
1.5K Takip Edilen1.9K Takipçiler
shydev
shydev@shydev69·
people who follow shydev in india get no izzat no matter how well you look. solving dsa questions is an achievement here.
English
6
1
61
1.6K
neural nets.
neural nets.@cneuralnetwork·
I came across this blog from Chip Huyen in 2022 where she discussed the ai courses to do. i feel they are relevant to this day! Here they are - STATS 202 :- data mining basics CS109 :- probability foundations CS231N :- computer vision DL CS224N :- NLP deep learning CS229 :- theoretical machine learning CS221 :- intro artificial intelligence CS228 :- probabilistic graphical models CS234 :- reinforcement learning CS238 :- decision theory RL CS224W :- graph machine learning CS246 :- large-scale data mining CS230 :- applied deep learning CS236 :- generative models EE263 :- linear dynamical systems CS336 :- robot perception CS103 :- discrete math foundations CS124 :- NLP basics CS223A :- robotics fundamentals i have done a few from here, would reccomend
English
10
69
845
24.2K
TickenChikka
TickenChikka@TickenChikkka·
So this happened :/
TickenChikka tweet media
English
1
0
19
383
khushal
khushal@khushaltwt·
CUDA krne ke liye GPU hi nhi h bc, U series lenovo intel laptop h 🤡
Indonesia
5
0
23
768
Nabhag - oss/acc
Nabhag - oss/acc@NabhagMotivaras·
@Hi_Mrinal coldemailing doesn't work in 2026. tbh best alternative to coldmailing this days is just join their discord server and talk about how to improve the product. easier way to directly talk to orgs entire team and caught the attention.
English
1
0
7
462
himanshu
himanshu@himanshustwts·
RL Env companies running ads to hire talent is timeline to me and i think we gonna see more of this coming
himanshu tweet media
English
7
1
115
5.9K
chuyi shang
chuyi shang@chuyishang·
Wrote a deep dive on implementing a language model from scratch in JAX and scaling it with distributed training! If you’re coming from PyTorch and want to see how the same ideas look in JAX, or just want a hands-on intro to distributed training, check out this blog post: chuyishang.com/blog/2026/jax-… Comes with code + an assignment and test cases so you can follow along!
chuyi shang tweet mediachuyi shang tweet media
English
9
66
602
32.1K
paneerchilli65
paneerchilli65@paneerchilli65·
@kmeanskaran demand forecasting for rl, slms for iot devices and intraday would become very good projects/products 😁
English
0
0
0
538
Karan🧋
Karan🧋@kmeanskaran·
Before it's too late, learn any of these skills to become elite AI/ML Engineer: - Maths focused AI research - Data Engineering with distribution - AI Agents orchestration - Multi-GPU training - Inference optimization - Demand Forecasting with RL - MLOps and AgentOps - SLMs for IoT devices - Deployment on cloud - System Design for ML - Data drift detection - Rollback and task queues for ML - Backend for AI - Vision Transformers in IoT - Quant and Intraday using ML The most important skill is using minimal setup with high impact on business more than just ML metrics. Also, important part is just deliver projects within less span by iterative release.
English
7
54
558
19K
silicognition (blue tick here)
silicognition (blue tick here)@silicognition·
what rigs do y'all work on for ai research/dev? i'll start: 3090 Ti 24GiB VRAM on lab machine, 4090 8GiB VRAM on my laptop.
silicognition (blue tick here) tweet media
English
5
0
8
592
paneerchilli65
paneerchilli65@paneerchilli65·
next on my list is training with muon from actual MuonWithAuxAdam and then with Muon with AdamW
English
0
0
2
78
freshlimesofa
freshlimesofa@freshlimesofa·
I did something interesting today. A recent project that I worked on had a Text-to-SQL agent, While testing on local using Ollama, we used a 3B version of Llama that failed pretty bad at SQL gen despite a pretty good prompt (an 8B version did work tho) I took the 3B model and prepared a dataset of natural language vs SQL with respect to our database schema and decided to perform SFT with LoRA. The result was pretty good, it generated executable SQL queries apt to the schema. Will download the GGUF and hook it up to the application tomorrow, should work !!
freshlimesofa tweet media
English
8
1
39
777
freshlimesofa
freshlimesofa@freshlimesofa·
How do i create good synthetic datasets ?
English
4
0
11
474
Shreyas Vaidya
Shreyas Vaidya@shreyasvaidya23·
Khi to top 100 AIR aya 🫡
Shreyas Vaidya tweet media
Indonesia
78
4
1K
88.7K