Ritish

85 posts

Ritish banner
Ritish

Ritish

@Ritish_1618

Machine Learning Practitioner | figuring out.

Katılım Temmuz 2021
570 Takip Edilen9 Takipçiler
Ritish
Ritish@Ritish_1618·
AI productivity can be a con if not managed properly.
English
0
0
0
0
Ritish
Ritish@Ritish_1618·
@civilianBTC Interested | Been working on a game recommendation system. Would be enjoyable to work on something else simultaneously.
English
1
0
0
87
Civilian
Civilian@civilianBTC·
I am hiring! Looking for reply guys to work 4+ hours a day. Would talk privately about pay. Comment below if you’re interested.
English
369
14
182
18.7K
Ritish
Ritish@Ritish_1618·
We all knw abt overfitting, right? Do LLMs overfit? When a user & model get stuck in a loop during a chat, is that a form of overfitn? If not, what'd happen in the same situation w/o the loop? And r hallucinations related to overfitting in any way? PS: Still new to LLMs/GenAI.
English
0
0
0
17
Ritish
Ritish@Ritish_1618·
Thanks to @jahooma for freebuff!! It's been really useful in my workflow, productivity, and consistent discussions around questions along with curiosity driven thoughts. Absolutely recommended for anyone out there!
English
1
0
2
53
Ritish retweetledi
Jae Kyoung Kim
Jae Kyoung Kim@umichkim·
Even when the brain is “at rest,” it is not silent. A new Nature paper suggests that spontaneous brain activity may reflect a critically initialized neural network: stable, but close to the edge of instability.
Jae Kyoung Kim tweet mediaJae Kyoung Kim tweet media
English
7
46
232
19.8K
Ritish
Ritish@Ritish_1618·
@A_K_Nain Curious to know some examples of OSS in ML. Can you tell some?
English
0
0
0
17
Aakash Kumar Nain
Aakash Kumar Nain@A_K_Nain·
If you do not believe in OSS, especially in ML, you are on the wrong side my friend. The reason we made progress so far is only because of OSS in ML
English
4
1
35
5.7K
Ritish
Ritish@Ritish_1618·
Vector databases are really the de facto cornerstone of modern AI applications.
English
0
0
0
85
Yichuan Wang
Yichuan Wang@YichuanM·
The web was never meant to be flattened into text. Yet most web RAG systems start by parsing HTML --- a complex and lossy process. 🔥 Introducing PixelRAG: the first RAG system that retrieves and reads 30M+ web pages as pixels. Instead of extracting text, PixelRAG retrieves screenshots and lets a VLM read them directly. PixelRAG not only preserves visual information, but also outperforms text-based RAG on text-only QA benchmarks by +18.1%. Why? (1) HTML-to-text conversion often discards layout, structure, tables, and other useful signals. (2) We continued pretraining a VLM on web page screenshots and turned it into a surprisingly strong visual retriever. (3) Recent VLMs are remarkably good at understanding web pages, often with better accuracy and token efficiency than text-only pipelines. Takeaway: HTML parsing may be one of the biggest self-inflicted bottlenecks in web RAG. Demo below 👇 Code: github.com/StarTrail-org/… Paper: github.com/StarTrail-org/… Playground: pixelrag.ai
English
25
117
697
73.8K
Ritish
Ritish@Ritish_1618·
@Old_But_Gold50s These are music sheets, right?? If so, how can one start to learn them? Even more broadly how to start with music theory? Just all things music for me to be able to do cool stuff at the intersection of AI and music.
English
2
0
2
1.9K
Ritish
Ritish@Ritish_1618·
@TheMingjie Waterloo guys are really cracked af
English
1
0
6
842
Ritish
Ritish@Ritish_1618·
@freshlimesofa Thoughts, agreed. Checkout Expert Systems. Maybe rings a bell!
English
0
0
1
15
Harsh.
Harsh.@freshlimesofa·
Just thinking out loud but, I think the world is bored with LLMs and we're soon going to hit saturation. Look although I haven't gone really deep into newer architectures that are coming up and have been a little out of touch with deep learning, but I do feel proposals like JEPA and world modelling are more impactful and provide a greater meaning than doubling down on LLMs and post training them to ace a specific benchmark just so it can vomit out probabilistic representations. How far are we going to get with this ? The implementation of this into different industries and solutions at an enterprise is already almost done. In fact we took LLMs and made agents out of them, that's the intersection of the architecture with core engineering principles, Isn't this saturation already. However, It isn't the saturation of machine intelligence, it's just that the low hanging fruit of next token prediction is harvested. We've plateaued. Tldr : Random thoughts.
English
4
0
9
322
Ritish
Ritish@Ritish_1618·
@dejavucoder Ask questions. Experiment. That's all!! Follow your curiosity is the take away (driven by the understanding of right and wrong). Consequently, definitely try computational thinking.
English
0
0
0
17
sankalp
sankalp@dejavucoder·
what are strategies to become better at problem solving, idea generation and thinking a bit out of the box as someone who has a small working memory
English
3
0
8
1.5K
Ritish retweetledi
Deep-ML
Deep-ML@real_deep_ml·
"Download a paper. Implement it. Keep doing this until you have skills." - George Hotz one of the main core ideas behind Deep-ML
English
6
70
987
26.1K
Ritish
Ritish@Ritish_1618·
@waghweb Then, never deviate from what you are building. These are the rat race people.
English
0
1
1
40
Mandar Wagh
Mandar Wagh@waghweb·
@Ritish_1618 People are telling us to focus on slop products instead of the foundational work we're trying to build.
English
1
1
1
123
Mandar Wagh
Mandar Wagh@waghweb·
We will prove our doubters wrong by building a company that's generational
English
5
3
18
4.3K
Ritish retweetledi
Paul Graham
Paul Graham@paulg·
The most important component of writing clearly is simply to have high standards for clarity. Then if you write something unclear, you notice, and ask: what did I mean to say? You can just keep doing this over and over. And if you have high standards for clarity, you will.
English
203
390
4.6K
220.9K
Ritish
Ritish@Ritish_1618·
Using Harness and agents is really fun. Outsourcing the work a bit; not thinking and understanding. Ultimately, it's all collaboration, but with scrutiny.
Ritish tweet media
Born to gamble@borntogambles

He defended a PhD in mechanical engineering and walked away with an $800,000 grant. The grunt work was done by Claude Code on Opus 4.8. Not the dissertation. The work underneath it. The literature that takes six months, he cleared in a weekend. Dropped hundreds of PDFs into Claude Code and asked it to surface the gaps, the contradictions, the open questions in his field. It handed back a map of what nobody had proven yet. The simulations that eat a semester, he built in one night. "Write a solver for this stress model, sweep the parameters, give me the plots." The code ran clean on the first pass. Then the real move. He didn't write chapters one by one. He fed it his data, his results, his structure, and asked for a draft he could tear apart and rewrite. Three hundred pages of skeleton by morning. He sat down and made it his own. The university runs hundreds of researchers. Their labs spin the same loop for years: read, compute, write, repeat. He ran that loop alone, between classes and dinner. The unlock isn't that AI wrote the text. The unlock is that AI stripped out everything that wasn't thinking. The reading, the sorting, the code plumbing, that's what eats a doctorate. He handed it to the machine and kept the science for himself. Open Claude Code today. Drop your PDFs and ask it to find the hole in your field. Your model, and ask for the solver. Your data, and ask for the draft. The grant is his. The time was handed back by the agent. He closed the loop in one night. You're still opening your twentieth tab.

English
0
0
1
259
Ritish
Ritish@Ritish_1618·
@Harshit77406528 Suggest some which are specifically good at complex tasks. TIA!
English
1
0
0
186
Ritish
Ritish@Ritish_1618·
Wtf? Codex with Go subscription resets monthly now?? Instead of weekly.
Ritish tweet media
English
1
0
6
1.7K
Zuzanna Stamirowska
Zuzanna Stamirowska@zuzanna_pathway·
“We have not yet had a PageRank moment for intelligence.” We’ve got so many comments and questions about this statement delivered by @adrian_pathway during our recent Transformer vs Post-Transformer debate with @lukaszkaiser @YesThisIsLion @mlech26l - thanks! Let’s dig into it. In the 1990s, web search already existed. We could index information. AltaVista existed. The web was growing fast. Then PageRank happened. That moment combined three things: 1. A simple but deep mathematical idea: treat the web as a giant graph and compute a stationary distribution of a *random walk* on that *graph* 2. A scalable implementation: large-scale graph computation on huge clusters 3. A company that integrated and scaled the idea end-to-end: Google That combination gave search a much clearer center. It stopped being just a pile of heuristics and started to look more like: here is the mathematical object we need to compute, now let’s build the systems needed to compute it well. Adrian asked Lukasz Kaiser directly whether he sees a PageRank-like idea inside the Transformer. Lukasz said no. For intelligence, we still do not have that kind of unifying operator or process. We do not yet have an agreed mathematical object that says: this is the core computation behind it. That missing unifier is what Adrian meant by the absent “PageRank moment for intelligence.” That is also the main idea behind our work on BDH, our Post-Transformer architecture. We are after that fundamental “platform discovery” for intelligence. The full Transformer vs Post-Transformer debate is a good place to go deeper on these topics. Link below.
Zuzanna Stamirowska tweet media
English
2
5
14
382