mickel

649 posts

mickel

@mickel_25

Heart's Hill No. 352, Saskatch Katılım Aralık 2016

923 Takip Edilen53 Takipçiler

mickel retweetledi

Roland Memisevic@RolandMemisevic·4d

It's hard to believe, but there was a time at which the kernel trick was considered the absolute holy grail of AI, as it made it possible to use convex optimization instead of back-prop and SGD. Incidentally, self-attention and linear RNNs feel weirdly reminiscent of those vibes.

Avi Chawla@_avichawla

Why is the Kernel Trick called a "trick"? (it's a popular ML interview question) Many ML algorithms use kernels for robust modeling, like SVM, KernelPCA, etc. The core objective of a kernel function is to compute dot products in some other feature space (mostly high-dimensional) without projecting the vectors to that space. But how does that even happen? Consider the image below. Let’s assume the following polynomial kernel function: - k(X, Y) = (1+XᵀY)². Also, for simplicity, let’s say both X and Y are two-dimensional vectors: - X = (x1, x2) - Y = (y1, y2) As shown in the image below, simplifying the kernel expression produces a dot product between the two 6-dimensional vectors. This shows that the kernel function we chose earlier computes the dot product in a 6-dimensional space without explicitly visiting that space. And that is the primary reason why we also call it the “kernel trick.” More specifically, it’s framed as a “trick” since it allows us to operate in high-dimensional spaces without explicitly computing the coordinates of the data in that space. RBF kernel is even better in this respect. It lets you compute the dot product in an infinite-dimensional space without explicitly visiting that space. I have shared an article in the comments with a mathematical explanation of the RBF Kernel. ____ Find me → @_avichawla Every day, I share tutorials and insights on DS, ML, LLMs, and RAGs.

English

638

66.3K

mickel retweetledi

Mathematica@mathemetica·3d

5 Mathematically Efficient Fine-Tuning Techniques for LLMs This diagram compares the core math behind: • LoRA – Low-rank decomposition (A ∈ R^{d×r}, B ∈ R^{r×d}) with frozen W • LoRA-FA – Freezes one low-rank matrix during updates • VeRA – Vector-based scaling with fixed d=1 and b=0 • Delta-LoRA – Updates pretrained weights using difference of low-rank products • LoRA+ – Applies asymmetric learning rates to matrices A and B Clear visual breakdown of weight matrices, dimensions, and update rules for parameter-efficient adaptation.

English

186

893

29.3K

mickel retweetledi

Priyanka Vergadia@pvergadia·4d

Research papers you must read for AI Engineer interviews: 1. Attention is all you need (Transformers) 2. LoRA (Low rank adaption) 3. PEFT ( Parameter Efficient Fine Tuning) 4. VIT (Vision Transformers) 5. VAE (Variational Auto Encoder) 6. GANs ( Generative Adversarial Networks) 7. BERT ( Bidirectional Encoder Representation from Transformers) 8. Diffusion Models (Stable Diffusion) 9. RAG (Retrieval Augment Generation) 10. GPT (Generative Pre-trained Transformers)

English

199

1.3K

49.7K

mickel retweetledi

Nikku.Dev ⨀ I’m Nad 🦹‍♂️@itsNikku876·4d

Applications are now open for the Google Data Center Community AI Fellowship 2026 a fully funded global fellowship powered by Google & Watson Institute! This fellowship is designed for students, developers, founders, researchers, and working professionals who want to use AI & technology to solve real-world community challenges. 🌍🤖 Selected fellows will receive: ✨ Fully funded fellowship experience ✨ Leadership & entrepreneurship training ✨ Global networking opportunities ✨ Mentorship from industry experts ✨ Support to scale impactful AI-driven solutions If you’re passionate about AI, startups, innovation, or community impact, this is a massive opportunity to grow globally while building meaningful solutions. 📅 Priority Deadline: July 12, 2026 #AI #Google #Fellowship #Opportunities

English

530

43.4K

mickel retweetledi

Mathilde Papillon🦋 mathildepapillon .bsky .social@mathildepapillo·5d

🏆 The 2026 Topological Deep Learning Challenge is officially live, now in its 4th edition! 🏆 This year’s theme is “Bridging the Gap” between the GNN and TDL worlds. Win incredible prizes including up to $1000 in cash 💸 and AI research internships! Submission deadline: Aug. 1

Mathilde Papillon🦋 mathildepapillon .bsky .social tweet media

English

267

18.8K

mickel retweetledi

Xander@xanderwasserman·5d

ESP32 OTA shouldn’t need an infrastructure team. Upload firmware, create deployments, roll out gradually, track device update status, and update ESP32 devices with SimpleOTA’s lightweight Arduino client library. Built for makers, startups, and small hardware teams.

English

387

mickel retweetledi

Bodhisattwa Majumder@mbodhisattwa·4d

✨ I'm hiring for 2 roles in Asta/AI for Science @allen_ai Research Engineer: RL/post-training for hypothesis generation, long-horizon agents, continual learning PhD Research Intern, Fall 26: Designing new rewards beyond surprise & novelty 🔗 in 🧵. Email me if interested. A brief pitch when contacting would be wonderful. I may not be able to reply to all, but if it aligns, I'll be sure to contact you. Prior relevant experience would be preferred, but is not a necessary condition.

English

179

16.4K

mickel retweetledi

Jahir Sheikh@jahirsheikh8·6d

If I had 6 months to become an AI Infrastructure Engineer. I’d do this. Stage 1 — Linux + Networking Processes, memory, GPUs, sockets, HTTP, TCP/IP basics. Stage 2 — Python + Backend Async Python, FastAPI, queues, concurrency fundamentals. Stage 3 — GPU Fundamentals CUDA basics, VRAM, batching, quantization, throughput. Stage 4 — LLM Inference vLLM, TensorRT-LLM, speculative decoding, KV caching. Stage 5 — Distributed Systems Load balancing, queues, retries, autoscaling, distributed workers. Stage 6 — AI Serving Model APIs, streaming responses, rate limiting, observability. Stage 7 — Data Pipelines Kafka, Airflow, ETL pipelines, vector indexing. Stage 8 — Kubernetes + Cloud Docker, Kubernetes, AWS/GCP basics, infra automation. Stage 9 — Monitoring + Reliability Prometheus, Grafana, tracing, AI cost monitoring. Stage 10 — Real AI Systems Deploy scalable chat apps, RAG pipelines, inference clusters. Stage 11 — Open Source Contribute to inference tooling or AI infra projects. Stage 12 — Apply AI Infra Engineer, Platform Engineer, ML Systems Engineer. AI apps go viral. AI infrastructure prints money.

English

105

843

238.8K

mickel retweetledi

DailyPapers@HuggingPapers·6d

NVIDIA just released AnyFlow on Hugging Face The first any-step video diffusion model that generates high-quality text-to-video with any inference budget - 4 steps or 50, quality scales smoothly without degradation.

English

419

41K

mickel retweetledi

MATS Research@MATSprogram·6d

1/ 🚨 MATS Autumn 2026 applications are now open. 10-week fully-funded fellowship for aspiring AI alignment, security & governance researchers and field-builders. 📍 Berkeley + London 📅 Sep 28 – Dec 4, 2026 💰 $5000/month stipend + $8,000/month compute Apply by June 7 AoE ↓

English

699

117.3K

mickel retweetledi

tut_ml@tut_ml·13 May

Learn LLMs- mltut.com/best-large-lan…

English

241

5.8K

mickel retweetledi

tut_ml@tut_ml·12 May

Best CNN Courses- mltut.com/best-convoluti…

English

122

683

16K

mickel retweetledi

Niyi Abiri, Ph.D 💻🥃@xniyi·12 May

For every research paper you write apply for available travel and research grants on Publeesh.org.

English

277

34.1K

mickel retweetledi

Zhiwen(Aaron) Fan@zhiwen_fan_·13 May

We are working together to launch the Rising Star Award for Spatial Intelligence. Applications are due in 10 days. Thanks to 2077AI for sponsoring a $30K research gift fund to support a PhD student or postdoc advancing spatial intelligence. Apply by May 22: e2e3d.github.io/rising_star.ht… Thanks to ChatGPT for helping turn a simple prompt into this poster :)

English

11.9K

mickel retweetledi

Suraj Sharma@suraj_sharma14·12 May

If you want to master RAG in 2026. Build these projects. Project 1: PDF Chat Application Upload PDFs, ask questions, retrieve context-aware answers, citations, semantic search. Project 2: AI Research Assistant Search across multiple documents, summarize findings, generate references, compare sources. Project 3: YouTube Q&A Engine Convert videos into searchable knowledge bases with transcript retrieval and timestamp citations. Project 4: AI Customer Support Bot Connect company docs, FAQs and support history with contextual responses and memory. Project 5: GitHub Codebase Assistant Ask questions across repositories, retrieve relevant code snippets, explain architecture. Project 6: Multi-Modal RAG System Combine PDFs, images, audio and databases into one unified retrieval pipeline. Project 7: Personal Knowledge OS Build your own second brain with notes, bookmarks, search and AI memory retrieval. Project 8: Agentic RAG Workflow Create autonomous agents that search, retrieve, reason and execute multi-step tasks. Project 9: Enterprise Search Engine Slack, Notion, Google Drive, Confluence retrieval with permission-aware search systems. Project 10: Medical/Legal RAG Assistant Domain-specific retrieval with hallucination reduction, citations and verification pipelines. Most people consume AI content. Builders ship systems.

English

514

20.8K

mickel retweetledi

Mayukh@mayukh_panja·13 May

I don’t agree. A PhD student should not prioritize work-life balance. Getting to do a PhD is a privilege. You are paid to think. There is no pressure for you to be economically useful. It is a unique opportunity to push the boundaries of human knowledge and produce something ground breaking. And nothing great ever happens without complete devotion. Look at everything that moved and shaped the world. Every single person who created anything meaningful, in science, in arts, in music, in movies, devoted their lives to their craft. Extraordinary outcomes require extraordinary inputs and some degree of sacrifice. Sure, have work-life balance during your PhD. But be content a mediocre outcome.

Dr. Manabendra Saharia@m_saharia

Yesterday, I was giving an intro talk to our dept's new PhD students. Technical things aside, my number 1 suggestion has remained the same over the years: Treat your PhD like a job. - Avoid 1.5h lunch and three tea breaks. - Avoid gossiping and loitering at work. - Lab at 9 am and leave at 6 pm. Being productive till 11 pm in the lab is a lie people till themselves when their day starts at 1 PM. Everything worth doing can be done with high intensity focus during work hours. And having fun in life is the secret to being productive in a marathon.

English

359

277

1.1M

mickel retweetledi

Xiuyu Li@sheriyuo·10 May

AI research is already falling into a death cycle. If you do not get an internship at a top lab/company, you cannot access the core techniques or gain real frontier engineering experience. But without those experiences, it becomes almost impossible to pass the resume screening and multiple interview rounds for those same internships. People joke about using Macs for AI, but in reality they are often just better SSH terminals into remote GPU clusters. In frontier labs, the most important thing about an internship is not the payout. What really matters is which team (foundations/data/infra/ToC/...) you are on and how much GPU cluster (have you tried training on 64 GPUs?) access you get. That determines the actual value of the internship for your future research and career. The most advanced models, datasets, and compute resources are increasingly concentrated inside a handful of companies. That concentration is quietly reshaping the entire field.

紫云@dviolettchan

CS used to be a relatively less toxic field because the tools were open and cheap. You could do meaningful research with a laptop, or maybe a single GPU. Those good old days are probably never coming back. (1/3)

English

112

1.5K

123.9K

mickel retweetledi

Jiafei Duan@DJiafei·8 May

Launching my research group, MAGIC (Manipulation and General Intelligence Control) Lab @NUSComputing, Singapore! We focus on building the next generation of human-centric models for robotic manipulation — deployable safely, reliably, and easily in the real world. Our research spans MLLM reasoning, 3D vision, robot learning, simulation, dexterous manipulation, and cross-embodiment learning. Interested in joining? Sign up here and I'll send a reminder email: forms.gle/oJPLR2pLTt8kLC…