Ritish

85 posts

Ritish

@Ritish_1618

Machine Learning Practitioner | figuring out.

Katılım Temmuz 2021

570 Takip Edilen9 Takipçiler

Ritish@Ritish_1618·1h

AI productivity can be a con if not managed properly.

English

Ritish@Ritish_1618·22h

@civilianBTC Interested | Been working on a game recommendation system. Would be enjoyable to work on something else simultaneously.

English

Civilian@civilianBTC·22h

I am hiring! Looking for reply guys to work 4+ hours a day. Would talk privately about pay. Comment below if you’re interested.

English

369

182

18.7K

Ritish@Ritish_1618·1d

We all knw abt overfitting, right? Do LLMs overfit? When a user & model get stuck in a loop during a chat, is that a form of overfitn? If not, what'd happen in the same situation w/o the loop? And r hallucinations related to overfitting in any way? PS: Still new to LLMs/GenAI.

English

Ritish@Ritish_1618·3d

Thanks to @jahooma for freebuff!! It's been really useful in my workflow, productivity, and consistent discussions around questions along with curiosity driven thoughts. Absolutely recommended for anyone out there!

English

Ritish retweetledi

Jae Kyoung Kim@umichkim·5d

Even when the brain is “at rest,” it is not silent. A new Nature paper suggests that spontaneous brain activity may reflect a critically initialized neural network: stable, but close to the edge of instability.

English

232

19.8K

Ritish@Ritish_1618·6d

@A_K_Nain Curious to know some examples of OSS in ML. Can you tell some?

English

Aakash Kumar Nain@A_K_Nain·8 Eyl

If you do not believe in OSS, especially in ML, you are on the wrong side my friend. The reason we made progress so far is only because of OSS in ML

English

5.7K

Ritish@Ritish_1618·6d

Vector databases are really the de facto cornerstone of modern AI applications.

English

Ritish@Ritish_1618·12 Haz

@YichuanM Interesting.

English

Yichuan Wang@YichuanM·10 Haz

The web was never meant to be flattened into text. Yet most web RAG systems start by parsing HTML --- a complex and lossy process. 🔥 Introducing PixelRAG: the first RAG system that retrieves and reads 30M+ web pages as pixels. Instead of extracting text, PixelRAG retrieves screenshots and lets a VLM read them directly. PixelRAG not only preserves visual information, but also outperforms text-based RAG on text-only QA benchmarks by +18.1%. Why? (1) HTML-to-text conversion often discards layout, structure, tables, and other useful signals. (2) We continued pretraining a VLM on web page screenshots and turned it into a surprisingly strong visual retriever. (3) Recent VLMs are remarkably good at understanding web pages, often with better accuracy and token efficiency than text-only pipelines. Takeaway: HTML parsing may be one of the biggest self-inflicted bottlenecks in web RAG. Demo below 👇 Code: github.com/StarTrail-org/… Paper: github.com/StarTrail-org/… Playground: pixelrag.ai

English

117

697

73.8K

Ritish@Ritish_1618·7 Haz

@Old_But_Gold50s These are music sheets, right?? If so, how can one start to learn them? Even more broadly how to start with music theory? Just all things music for me to be able to do cool stuff at the intersection of AI and music.

English

1.9K

🎶𝗖𝗹𝗮𝘀𝘀𝗶𝗰𝗮𝗹 𝗠𝗲𝗹𝗼𝗱𝗶𝗲𝘀 ✨@Old_But_Gold50s·6 Haz

What a beautiful visualization

English

1.7K

8.2K

427.4K

Ritish@Ritish_1618·6 Haz

@TheMingjie Waterloo guys are really cracked af

English

842

Jerry Jiang@TheMingjie·6 Haz

Waterloo guy btw

Arjun Virk@virkvarjun

I just spent months handwriting a 200 page guide on the entirety of ML foundations and math from scratch. The guide features: - Neural Nets (Backprop, Adam, SGD, Batch Norm) - ML Algorithms (SVM, Grad Boosting, K-means, PCA) - Hardware (Tensor Cores, Systolic Arrays, CUDA) - Transformers (Multi-Head Attn, KV Cache, LoRA) - Vision (ViT, Convolutions, MAE, IoU, NMS, VLM) - Agents (OpenClaw, ReAct, Memory, Orchestration) Everything I wish I had years ago, for free.

English

974

103K

Ritish@Ritish_1618·6 Haz

@freshlimesofa Thoughts, agreed. Checkout Expert Systems. Maybe rings a bell!

English

Harsh.@freshlimesofa·6 Haz

Just thinking out loud but, I think the world is bored with LLMs and we're soon going to hit saturation. Look although I haven't gone really deep into newer architectures that are coming up and have been a little out of touch with deep learning, but I do feel proposals like JEPA and world modelling are more impactful and provide a greater meaning than doubling down on LLMs and post training them to ace a specific benchmark just so it can vomit out probabilistic representations. How far are we going to get with this ? The implementation of this into different industries and solutions at an enterprise is already almost done. In fact we took LLMs and made agents out of them, that's the intersection of the architecture with core engineering principles, Isn't this saturation already. However, It isn't the saturation of machine intelligence, it's just that the low hanging fruit of next token prediction is harvested. We've plateaued. Tldr : Random thoughts.

English

322

Ritish@Ritish_1618·6 Haz

@dejavucoder Ask questions. Experiment. That's all!! Follow your curiosity is the take away (driven by the understanding of right and wrong). Consequently, definitely try computational thinking.

English

sankalp@dejavucoder·6 Haz

what are strategies to become better at problem solving, idea generation and thinking a bit out of the box as someone who has a small working memory

English

1.5K

Ritish retweetledi

Deep-ML@real_deep_ml·4 Haz

"Download a paper. Implement it. Keep doing this until you have skills." - George Hotz one of the main core ideas behind Deep-ML

English

987

26.1K

Ritish@Ritish_1618·3 Haz

@waghweb Then, never deviate from what you are building. These are the rat race people.

English

Mandar Wagh@waghweb·2 Haz

@Ritish_1618 People are telling us to focus on slop products instead of the foundational work we're trying to build.

English

123

Mandar Wagh@waghweb·2 Haz

We will prove our doubters wrong by building a company that's generational

English

4.3K

Ritish retweetledi

Paul Graham@paulg·2 Haz

The most important component of writing clearly is simply to have high standards for clarity. Then if you write something unclear, you notice, and ask: what did I mean to say? You can just keep doing this over and over. And if you have high standards for clarity, you will.

English

203

390

4.6K

220.9K

Ritish@Ritish_1618·2 Haz

Using Harness and agents is really fun. Outsourcing the work a bit; not thinking and understanding. Ultimately, it's all collaboration, but with scrutiny.

Born to gamble@borntogambles

He defended a PhD in mechanical engineering and walked away with an $800,000 grant. The grunt work was done by Claude Code on Opus 4.8. Not the dissertation. The work underneath it. The literature that takes six months, he cleared in a weekend. Dropped hundreds of PDFs into Claude Code and asked it to surface the gaps, the contradictions, the open questions in his field. It handed back a map of what nobody had proven yet. The simulations that eat a semester, he built in one night. "Write a solver for this stress model, sweep the parameters, give me the plots." The code ran clean on the first pass. Then the real move. He didn't write chapters one by one. He fed it his data, his results, his structure, and asked for a draft he could tear apart and rewrite. Three hundred pages of skeleton by morning. He sat down and made it his own. The university runs hundreds of researchers. Their labs spin the same loop for years: read, compute, write, repeat. He ran that loop alone, between classes and dinner. The unlock isn't that AI wrote the text. The unlock is that AI stripped out everything that wasn't thinking. The reading, the sorting, the code plumbing, that's what eats a doctorate. He handed it to the machine and kept the science for himself. Open Claude Code today. Drop your PDFs and ask it to find the hole in your field. Your model, and ask for the solver. Your data, and ask for the draft. The grant is his. The time was handed back by the agent. He closed the loop in one night. You're still opening your twentieth tab.

English

259

Ritish retweetledi

Scholarship for PhD@ScholarshipfPhd·1 Haz

Believing you slept well can boost your cognitive performance even after a bad night. Mind over matter.

Nicholas Fabiano, MD@NTFabiano

Believing you slept well can boost your cognitive performance even after a bad night. Mind over matter.

English

4.5K

Ritish@Ritish_1618·1 Haz

@Harshit77406528 Suggest some which are specifically good at complex tasks. TIA!

English

186

Harshit Pandit@Harshitp147·1 Haz

@Ritish_1618 Try using chinese models. U get more for the same price 🫡

English

192

Ritish@Ritish_1618·1 Haz

Wtf? Codex with Go subscription resets monthly now?? Instead of weekly.

English

1.7K

Ritish@Ritish_1618·1 Haz

@zuzanna_pathway @adrian_pathway @lukaszkaiser @YesThisIsLion @mlech26l Curios to know if these two fields are comparable? Since Information Retrieval is relatively smaller to Intelligence? However, will listen to the talk.

English

Zuzanna Stamirowska@zuzanna_pathway·28 May

“We have not yet had a PageRank moment for intelligence.” We’ve got so many comments and questions about this statement delivered by @adrian_pathway during our recent Transformer vs Post-Transformer debate with @lukaszkaiser @YesThisIsLion @mlech26l - thanks! Let’s dig into it. In the 1990s, web search already existed. We could index information. AltaVista existed. The web was growing fast. Then PageRank happened. That moment combined three things: 1. A simple but deep mathematical idea: treat the web as a giant graph and compute a stationary distribution of a *random walk* on that *graph* 2. A scalable implementation: large-scale graph computation on huge clusters 3. A company that integrated and scaled the idea end-to-end: Google That combination gave search a much clearer center. It stopped being just a pile of heuristics and started to look more like: here is the mathematical object we need to compute, now let’s build the systems needed to compute it well. Adrian asked Lukasz Kaiser directly whether he sees a PageRank-like idea inside the Transformer. Lukasz said no. For intelligence, we still do not have that kind of unifying operator or process. We do not yet have an agreed mathematical object that says: this is the core computation behind it. That missing unifier is what Adrian meant by the absent “PageRank moment for intelligence.” That is also the main idea behind our work on BDH, our Post-Transformer architecture. We are after that fundamental “platform discovery” for intelligence. The full Transformer vs Post-Transformer debate is a good place to go deeper on these topics. Link below.

English

382

Keşfet

@civilianBTC @jahooma @A_K_Nain @YichuanM @Old_But_Gold50s @TheMingjie @freshlimesofa @dejavucoder