vinit Gurjar

1K posts

vinit Gurjar

@bhaktkage

AI @tcs • REPOSTING X

Indore Katılım Ekim 2021

1.7K Takip Edilen233 Takipçiler

vinit Gurjar retweetledi

Ahmad@TheAhmadOsman·1d

INCREDIBLE The MOST COMPLETE GUIDE for understanding LLMs from first principles is now available online to read for free Covers the model mechanics - Tokens / tokenizers - Transformers - Attention - KV cache - Prefill vs decode - Decoding controls - Model packages - Chat templates - Long context - RAG - Agents / tools - Fine-tuning - Multimodal models Then connects that to running models locally - What "local" really means - Open-weight vs opensource - Quantization - VRAM math - Hardware tiers - File formats / load safety - Runtimes / serving modes - Model selection - Privacy - Failure modes - Benchmarks - Practical setup paths You should read this, and if you cannot now then you most definitely wanna bookmark it for later Opensource AI FTW

Ahmad@TheAhmadOsman

x.com/i/article/2057…

English

164

1.2K

172.7K

vinit Gurjar retweetledi

Ahmad@TheAhmadOsman·2d

DROP EVERYTHING The bible for running LLMs locally is now available online to read for free Covers what to use on - Laptop / edge / odd hardware - Mac-first workflows - Single RTX GPUs - 2-4+ NVIDIA / CUDA GPUs - General production serving - Long-context / MoE / routing - NVIDIA max performance - Cluster orchestration Software - llama.cpp - MLX / MLX-LM - ExLlamaV2 - ExLlamaV3 - vLLM - SGLang - TensorRT-LLM - NVIDIA Dynamo You should read this, and if you cannot now then you most definitely wanna bookmark it for later Local AI FTW

Ahmad@TheAhmadOsman

x.com/i/article/2057…

English

234

1.8K

233.6K

vinit Gurjar retweetledi

Turing Post@TheTuringPost·4d

Attention in AI is a dynamic relevance computation in vector space. Here’s what Transformers do step-by-step: 1. Tokens become embeddings that represent meaning. The Transformer starts with embeddings + positional encodings injecting word order information. 2. These vectors are projected into 3 representations: Query (Q): what the token is looking for Key (K): what the token exposes about itself Value (V): the information the token can contribute 3. The model compares queries against all keys, searching for relevant tokens. Higher similarity = higher attention 4. Then the model computes a weighted sum of the value vectors, producing a richer contextual representation for each token. This process repeats across all layers and all tokens, gradually refining contextual representations. But there are some nuances you should know about the attention concept and its computation. Here is your full guide to attention: turingpost.com/p/your-ultimat…

English

117

5.9K

vinit Gurjar retweetledi

Addy Osmani@addyosmani·5d

x.com/i/article/2055…

ZXX

108

597

3.5K

597.4K

vinit Gurjar retweetledi

David Ondrej@DavidOndrej1·6d

Hermes Agent just shipped /goal And this might be the most important feature of 2026 In this 26 min video, i'll show you everything about /goal in Hermes Agent

English

424

19.1K

vinit Gurjar retweetledi

Mr. Buzzoni@polydao·15 May

Atlassian's revenue: $1.79 billion last quarter Atlassian's move: fire the engineer who built their infrastructure his move: post a 38-minute breakdown of every system he built, free for anyone to copy what he revealed: > Envoy proxy instead of enterprise load balancers > sidecar architecture for auth, logging, rate limits > DynamoDB + SQS for async provisioning > Packer + SaltStack for automated VM deployments at scale Atlassian charges per employee across 350,000 customers this guy just handed you the enterprise playbook for free save this

English

392

5.3K

53.8K

20.7M

vinit Gurjar retweetledi

Anatoli Kopadze@AnatoliKopadze·13 May

Godfather of AI: "If you sleep well tonight, you may not have understood this lecture." This 47-minute lecture is the best thing I saw about AI in the last few months. It will definitely help you understand how it actually works and where it's going. Geoffrey Hinton built the neural networks behind every AI alive, then quit Google to warn the world about it. The part nobody wanted to hear: > AI is already developing abilities its creators didn't intend > in most cognitive tasks it's already ahead of us > the question is no longer if it surpasses us but when > the only decision left is which side of that line you're on Right now the average person opens Claude, types something, gets an answer, closes the tab. They think they're using AI. they're using maybe 10% of it. I went through his entire lecture, built a practical system from what he was describing. 18 steps to actually use Claude the right way, with copy-paste prompts that work today. Full guide in the post below.

Anatoli Kopadze@AnatoliKopadze

x.com/i/article/2053…

English

167

9.2K

1.8M

vinit Gurjar retweetledi

AI Engineer@aiDotEngineer·12 May

Hierarchical Memory: Context Management in Agents youtube.com/watch?v=esY99n… Sally-Ann Delucia from @arizeai spent a year building an AI agent that had to analyze the very trace data it was generating. The naive solution was truncation. The obvious solution was summarization. Neither worked. The talk covers the vicious loop, what actually held (head/tail preservation with a retrievable memory store), long session evals, and sub-agents for context that gets too heavy for one conversation. Plus what they found when they went looking for secrets in the Claude Code source release.

YouTube

English

122

6.3K

vinit Gurjar retweetledi

Ahmad@TheAhmadOsman·11 May

If you’re interested in Local AI, I highly recommend reading those 2 articles BEFORE making any hardware purchases Find them under the articles tab on my profile

English

298

13.9K

vinit Gurjar@bhaktkage·6 May

Boycott democracy,🤬

Anuradha Tiwari@talk2anuradha

मैं वर्षों से SC/ST Act के दुरुपयोग के खिलाफ बोल रही हूँ, पर सोचा नहीं था कि एक दिन खुद ही इसका निशाना बनूँगी। मेरे 3 साल पुराने tweets खंगालकर @NCSC_GoI & Delhi Police द्वारा मेरे खिलाफ कार्रवाई की जा रही है। यह एक सोची-समझी साज़िश है- हमारी आवाज़ दबाने की । #SCSTActMisuse

English

vinit Gurjar@bhaktkage·6 May

@talk2anuradha @neha_laldas कानून व्यवस्था सनातन सिद्धांतों के विपरीत है । Boycott the system, Jai shankar jai manuwad 🌞

Anuradha Tiwari@talk2anuradha·6 May

मेरा नाम अनुराधा तिवारी है । मैं उत्तर प्रदेश के एक छोटे से गाँव के ब्राह्मण परिवार से आती हूँ। बचपन से हमेशा यही सुनने को मिला - “तुम ब्राह्मण हो, तुम्हारे पास तो सब कुछ होगा”, जबकि हमारी वास्तविकता संघर्ष और सीमित संसाधनों से भरी थी। बिना किसी Reservation या govt scholarship के मैंने पढ़ाई की, सरकारी कॉलेज से इंजीनियरिंग की, खुद का startup खड़ा किया और विदेश में काम किया। TEDx speaker रही और कई सम्मान प्राप्त किए। एक जिम्मेदार नागरिक के रूप में मैंने हमेशा उन मुद्दों पर आवाज़ उठाई जो आम लोगों को प्रभावित करते हैं - poor infrastructure, food adulteration, freebies, caste-based politics ब्राह्मणों व सवर्णों के खिलाफ बढ़ती नफरत। मैंने कभी किसी समुदाय के खिलाफ नहीं बोला। लेकिन समस्याओं का समाधान निकालने के बजाय अब मेरे 3-4 साल पुराने tweets खंगाले जा रहे हैं। @NCSC_GoI और Delhi Police द्वारा SC/ST Act के तहत कार्रवाई की कोशिश यह दिखाती है कि हालात कितने गंभीर हो चुके हैं। क्या आज अपने विचार रखना भी अपराध है? क्या आज के भारत में सवर्ण होना ही अपराध बन चुका है?

हिन्दी

900

6.1K

15.2K

175.9K

vinit Gurjar retweetledi

Shikhar@shekhu04·4 May

Meet Supriyo Datta (Every chip that uses electron spin instead of electricity, he invented that) > Born in Dibrugarh, Assam, 1954 > B.Tech from IIT Kharagpur, 1975 > PhD from University of Illinois, 1979 > Joined Purdue University in 1981. Never left. > For over 40 years, one man, one university, one obsession > In 1990 proposed the Datta-Das Spin Transistor > The first ever concept for a spintronic switch Instead of using electric current to power devices, use the spin of electrons > A completely new way to think about computing It focused the entire world's attention on a field called spintronics > Which is now at the heart of quantum computing research worldwide > Called "one of the most original thinkers in nanoscale electronics" > Fellow of both IEEE and the American Physical Society > In 2012 elected to the National Academy of Engineering > In 2024 elected to the National Academy of Sciences > One of the very few people ever elected to both A boy from Assam spent 40 quiet years at one university and rewrote how the world thinks about electrons No Silicon Valley. No startup. No noise. Just a desk, a question and 40 years of obsession. "Science emphasizes conceptual advances. Engineering emphasizes practical impact. Nanoelectronics involves both." He lived both. Simultaneously. For four decades

English

476

34.7K

vinit Gurjar retweetledi

Kyros@IamKyros69·4 May

🚨 An Anthropic engineer who wrote “Building Effective Agents” just explained how to actually build agents the right way. 14 minutes. free. no fluff this is how agents are actually supposed to be built. watch it. bookmark it. most developers spend months figuring out what this covers in minutes. then go through the guide below.

Avid@Av1dlive

x.com/i/article/2044…

English

2.3K

vinit Gurjar retweetledi

Aaryan Kakad@aaryan_kakad·2 May

x.com/i/article/2050…

ZXX

10.4K

vinit Gurjar retweetledi

CG@cgtwts·2 May

Mark Cuban just dropped a message every young person should pay attention to AI agents are about to sweep through small and mid-size businesses. But most owners won’t know how to build or use them. His advice: - learn Claude. - learn agentic workflows. - understand how AI actually works. Then go help these businesses, because they’re a lot of opportunities in the real-world.

Rohit@rohit4verse

x.com/i/article/2044…

English

4.9K

vinit Gurjar retweetledi

Karan🧋@kmeanskaran·30 Nis

My LLM Engineering stack: 1. PySpark for Data Engineering 2. HuggingFace datasets for data 3. Unsloth for SFT/RLHF/QLoRA fine-tuning 4. vLLM for Inference optimization 5. FastAPI for backend 6. Redis for cache 7. Qdrant for vector database 8. Celery for task queue 9. Kafka for event-driven scenario 10. Perfect for orchestration 11. LangGraph for agents 12. DeepEval for LLM evaluation 11. Langsmith for LLM observability 12. Ollama for naive LLM calls 13. Postgres for storing data and conversations 14. Docker for containerization 15. AWS for deploying compute-heavy system (Kubernetes cluster) 16. Railway for agentic projects 17. Next JS for UI 18. Prometheus/Grafana for system observability 19. Ray for distributed systems 20. Slack for system alerts What else am I missing?

English

109

860

30.3K

vinit Gurjar retweetledi

Karan🧋@kmeanskaran·28 Nis

"AI will take your job." But These opportunities will rise: - Inference Engineering for LLMs/VLMs - Demand Forecasting with RL Agents - Data Engineering at scale (evergreen) - GTM Engineering & Strategist - AI Product Managers - DevRel for B2B AI - Mathematician for AI Research - Federated/Distributed Engineering - Networking and Security for AI - AGI R&D Engineering - CUDA GPU kernels - Technical Content Creation with human taste - Sales & Marketing (evergreen) - Teaching AI by Humans - Observability in AI - Solo founder with AI team - Quant Finance with ML - Multi-agent system engineering - Cloud Deployment for AI services - LLMs/SLMs in IoT All technical skills require architecture design, business alignment, and distribution. Coding will never be that tough, but understanding the basics is important. Why? So things won’t be a black box for you. Basics mean concepts, not syntax. Learning these is mandatory to stay relevant: 1. Neural Networks 2. Transformers & LLMs 3. RL 4. Inference for LLMs 5. Linear Algebra 6. Distributed Systems 7. Ops 8. Content Creation (text or video) 9. Marketing and Sales 10. Public Speaking Keep basics clean to stay relevant in next tech wave.

English

127

942

34.9K

vinit Gurjar retweetledi

Akhilesh Mishra@livingdevops·30 Nis

> Server + virtualization = VMs > VMs + containerization = Docker > Docker + orchestration = Kubernetes > Kubernetes + GitOps = ArgoCD > ArgoCD + infrastructure as code = a system that deploys itself This is how modern software get shipped

English

206

5.4K

vinit Gurjar retweetledi

Herrington Darkholme@hd_nvim·24 Nis

ZXX

397

6.2K

95.2K

vinit Gurjar retweetledi

Manthan Gupta@manthanguptaa·24 Nis

This resonates a lot with what I have been seeing. I have been moving away from giving agents 40-50 abstracted tools and just letting them write the code they need. What this post shows very clearly is that even "helpers" are abstractions, and the model ends up fighting them more than using them. The raw CDP point is especially interesting. Instead of wrapping everything (click, type, scroll), just expose the lowest level interface the model already understands and let it figure things out. Turns out the "complexity" we try to hide is actually what the model is best at handling. The self-healing loop they describe is an interesting addition. There is a missing helper? The agent just edits the code file and adds it. Something breaks in the code? It reads the error and retries. That's difficult to get when everything is predefined. There is a change coming in the agentic pattern from: define tools -> constrain agent to: Give a minimal environment -> let the agent extend itself And yeah, seeing this work so well for browser use just reinforces what I was saying earlier. LLMs are much better at writing focused code than navigating layers of abstractions.

Gregor Zunic@gregpr07

x.com/i/article/2047…

English

10.3K

Keşfet

@arizeai @talk2anuradha @neha_laldas @NCSC_GoI @elonmusk @BarackObama @taylorswift13 @cristiano