Abhishek Pandey 💎 🙌

9.8K posts

Abhishek Pandey 💎 🙌 banner
Abhishek Pandey 💎 🙌

Abhishek Pandey 💎 🙌

@vishooj

Falun Dafa is Good! | humanrights | wanna be sherlock holmes | #NLP | chess l developer | RT != endorsement.

Bangalore | Allahabad Katılım Nisan 2012
901 Takip Edilen306 Takipçiler
Abhishek Pandey 💎 🙌
Abhishek Pandey 💎 🙌@vishooj·
@arpit_bhayani This should be implemented by @X as well. A single new things breaks out and I get 10 tweets about it on my timeline, with not much different content.
English
0
0
0
259
Arpit Bhayani
Arpit Bhayani@arpit_bhayani·
Say you are building a news aggregator (like Google News). One of the biggest problems you'll face is de-duplicating articles across millions of documents. Naive O(n^2) comparisons will crush you at scale. MinHash + LSH is how you actually solve it. MinHash converts a large set into a small, fixed-size signature, such that the similarity between two signatures approximates the Jaccard similarity of the original sets. Jaccard similarity is simply set intersection divided by set union; a measure of how much two sets overlap. It is a fast, probabilistic way to estimate "how alike are these two documents?" without comparing them word by word. The first step is shingling, where you break each document into overlapping n-grams (say, 3-word sequences), and then run MinHash on that shingle set. MinHash gives you a compact signature, typically 100-200 hash values. The key property is that the probability that two signatures share the same minimum hash value equals the Jaccard similarity of their original shingle sets. This way, you estimate similarity without ever comparing raw text. But you still have the comparison problem. Even with compact signatures, comparing every pair is expensive. That's where LSH (Locality Sensitive Hashing) comes in. You split each signature into b bands of r rows each, and hash each band into a bucket. Two documents that are similar enough will likely land in the same bucket for at least one band, and only those candidate pairs get compared. This approach collapses billions of comparisons down to millions, and it is what systems like Google News and early web crawlers used to deduplicate content at scale. Several Google papers and engineering blogs from the early 2000s reference this exact approach. Pretty simple and neat. As is almost always true at scale, you do not need a perfect similarity detection system. A fast, good-enough one is preferred, given that the cost is ultimately the forcing function.
English
23
38
899
59.2K
mitsuri
mitsuri@0xmitsurii·
Distractions are killing your life.
English
15
330
1.8K
36.4K
Abhishek Pandey 💎 🙌 retweetledi
Steve Magness
Steve Magness@stevemagness·
Our brains are fried. You try to read a book...can’t focus. Sit with loved ones...your mind drifts to work or phone. Feel a buzz in your pocket....but there’s no notification. We’re not just distracted. We’re digitally disoriented. Here’s what’s going on and how to push back:
English
34
286
2.4K
280.9K
Abhishek Pandey 💎 🙌
Hi @nikitabier, could we tailor the feed such a way that if one has seen an info from a tweet, similar tweets from another account doesn't show up? There is lot of ctrl+c and ctrl+v across tweets. @elonmusk @x
English
1
0
0
25
DealzTrendz
DealzTrendz@dealztrendz·
Hey, @grok, who was the most famous person to visit my profile? It doesn't need to be a mutual, don't tag them, just say who it was.
English
93
21
286
100.8K
DealzTrendz
DealzTrendz@dealztrendz·
Get Perplexity Pro FREE for 12 months if you’re an Airtel user! Check the Airtel Thanks app under Rewards or OTT. Available for prepaid, postpaid, and Wi-Fi users.
DealzTrendz tweet media
English
332
175
2.2K
1.5M
Abhishek Pandey 💎 🙌 retweetledi
Falun Dafa Information Center
Falun Dafa Information Center@FalunInfoCtr·
Tomorrow is World Falun Dafa Day! A few days ago, Dafa practitioners and supporters gathered in central Manhattan to celebrate World #FalunDafaDay—marking 33 years since the practice was first introduced to the public in China. Banners proclaiming the values of truth, compassion, and forbearance were accompanied by traditional performances, colorful floats, and a powerful marching band. #FalunDafaDay #May13
English
2
52
165
3.7K
Abhishek Pandey 💎 🙌 retweetledi
Jan Jekielek
Jan Jekielek@JanJekielek·
A Falun Gong practitioner fled his hospital bed in the middle of the night ESCAPING forced organ harvesting. He had a gash on his side. Scans later on proved parts of his liver and lung were missing. 🚨 He’s living proof. The smoking gun. Systematic organ harvesting in China is happening.
English
35
243
509
36K
Abhishek Pandey 💎 🙌 retweetledi
StripMallGuy
StripMallGuy@realEstateTrent·
Opened ChatGPT. Prompt: “Now that you can remember everything I’ve ever typed here, point out my top five blind spots.” Mind. Blown.
English
850
1.8K
33K
5.5M
Firebase
Firebase@Firebase·
Meet Firebase Studio: A cloud-based, agentic dev environment powered by Gemini ✨💻✨ Find everything you need to prototype, build, and run production-quality full-stack AI apps quickly and safely. Learn more about building AI apps with Firebase → goo.gle/4j3MS9v #GoogleCloudNext
English
154
775
4.9K
800.8K
Libs of TikTok
Libs of TikTok@libsoftiktok·
MUST WATCH: Stephen Miller just TORCHED the fake news reporters to their faces
English
2.9K
31.9K
136.3K
3.1M