Jaskaran Singh

2.6K posts

Jaskaran Singh

@jasksing

Humanity is Awesome! | Engineer

เข้าร่วม Haziran 2021

1.2K กำลังติดตาม178 ผู้ติดตาม

Jaskaran Singh รีทวีตแล้ว

Jaynit@jaynitx·3 May

Terence Tao: "Previously, you needed a PhD to contribute to math research. Now a high school student can." Dwarkesh asks the world's most famous mathematician: what's your advice for someone considering a career in math, especially in light of AI progress? Tao is honest about uncertainty: "We live in a time of change. A particularly unpredictable era. Things that we've taken for granted for centuries may not hold anymore. The way we do everything... not just mathematics... will change." He admits his preference: "In many ways, I would prefer a much more boring, quiet era where things are much the same as they were 10 or 20 years ago. But one just has to embrace this. There's going to be a lot of change. The things you study... some of them may become obsolete or revolutionized. But some things will be retained." On new opportunities: "Previously, you had to go through years and years of education and get a math PhD before you could contribute to the frontier of math research. But now it's quite possible at the high school level that you could get involved in a math project and actually make a real contribution... because of all these AI tools and Lean and everything else." His advice: "There will be a lot of non-traditional opportunities to learn. You need a very adaptable mindset. There'll be worth pursuing things just for curiosity and for playing around. Still go through traditional education and learn math and science the old-fashioned way for a while... credentials will still be important. But you should also be open to very, very different ways of doing science. Some of which don't exist yet." He concludes: "It's a scary time. But also very exciting."

English

130

677

76.6K

Jaskaran Singh รีทวีตแล้ว

Reyaa@snr_boost·2 May

@kingofknowwhere Dumb person's idea of a smart person. That doesn't mean he's not smart. He probably is given his credentials. But it's not his job to know nitty gritty of GANs or PCPs

English

1.5K

Jaskaran Singh รีทวีตแล้ว

Ravid Shwartz Ziv@ziv_ravid·24 Nis

New episode of The Information Bottleneck is out, this time with @liuzhuang1234 (Princeton). We talked about ConvNeXt and whether architecture still matters; dataset bias and what "good data" actually looks like; ImageBind and why vision is the natural bridge across modalities; CLIP's blind spots; memory as the real bottleneck behind the agent hype; whether LLMs have world models; and Transformers Without Normalization. For years, the vision community debated what actually matters: architecture, inductive bias, self-attention vs convolution. After a lot of back-and-forth, we ended up in a funny place: ViT and ConvNet give roughly the same performance once you tune the details. What I find interesting is that once you reach a certain performance level, it becomes much easier to swap and tweak components without really changing the outcome. Talking to Zhuang on this episode, I kept wondering whether the same is now true for LLMs. If we wil spent serious time on an alternative architecture today, would you actually get a meaningfully different model, or just land on the same Pareto curve with extra steps? I'm starting to suspect it's the latter. Architecture matters less than we think. Data, compute, and a handful of pillars do most of the work.

English

25.5K

Jaskaran Singh@jasksing·23 Nis

@chengyenhsieh import from lucidrains

English

404

wesley hsieh@chengyenhsieh·23 Nis

This developer has reproduced many classic works, including ViT, AlphaFold3, DDPM, Imagen, and DALL·E. Whenever I want to cross-check the details of a paper with code, I often end up looking at his implementation. On one hand, his work is incredibly impressive from an educational perspective. On the other hand, I rarely see someone who has done so much work yet remains so silent on social media. Lucidrains: github.com/lucidrains

English

990

44.4K

Jaskaran Singh@jasksing·15 Nis

@paraschopra @justalexoki In an eternal space created by inflation, anything that can exist will exist according to the laws of physics.

English

Paras Chopra@paraschopra·15 Nis

@justalexoki because nothing is impossible to exist (but the real question is why something so specific rather than something else so specific)

English

4.9K

taoki@justalexoki·15 Nis

actually why is there something rather than nothing

English

425

778

80.2K

Jaskaran Singh@jasksing·13 Nis

THIS IS DEEPSEEK FOR TTS LITERALLY!!!

Shruti@heyshrutimishra

🚨 Someone just open-sourced a 2B parameter TTS model that does what ElevenLabs charges $330/month for. > Zero-shot voice cloning. > 48kHz studio-grade audio. > 30+ languages and including 8 Chinese dialects.

English

Jaskaran Singh@jasksing·9 Nis

@paraschopra None the less math will be automated or not, it is still essential

English

Paras Chopra@paraschopra·9 Nis

i'm actually surprised by the replies from people who believe math/physics/cs will never be automated. even current systems are at the level of a grad student, but the anons commenting on human supremacy seem to be living in a cave.

Paras Chopra@paraschopra

What advice should one give to kids to prepare for the future? I used to think mastering basics of physics, math, cs is the way to go but now I’ve updated my belief as these fields will get automated soon. What we need kids to learn is personality traits like grit, resourcefulness, optimism, resilience, etc.

English

279

22.3K

Jaskaran Singh@jasksing·8 Nis

@paraschopra Age of empires like game

English

Paras Chopra@paraschopra·8 Nis

Steal this idea. Here’s a semi research project that’s been on my mind for a while. Take frontier sovereign models of each country and subject them to multiplayer war games, to come up with a elo based leaderboard. This should be a wake up call for nations.

English

238

14.6K

Jaskaran Singh@jasksing·8 Nis

@paraschopra Math and EQ ~ Curiosity and being humble IDK

English

Paras Chopra@paraschopra·8 Nis

English

158

896

89.8K

Jaskaran Singh@jasksing·4 Nis

Open data open weights with detailed technical report in this economy??

Kangwook Lee@Kangwook_Lee

My team has been cooking nonstop for a while... and I’m so excited to finally share what we’ve been building!!! Today, we’re releasing four open models, many of which are the best models of the same size 🥳!!! tldr; 1) Raon-Speech: 9B SOTA speech LLM 2) Raon-SpeechChat: 9B full duplex model 3) Raon-OpenTTS: 0.3B/1B open-data-open-weight SOTA TTS 4) Raon-VisionEncoder: 0.4B vision encoder trained only with public data huggingface.co/collections/KR… === 1) Raon-Speech (9B) Raon-Speech is a speech LLM (LLM + speech understanding + speech generation). It's a bilingual model (English/Korean), and it's ranked #1 on both leaderboards 😎 tldr; it's the best open-model alternative to ChatGPT voice mode. Model: huggingface.co/KRAFTON/Raon-S… Tech report: huggingface.co/KRAFTON/Raon-S… Web demo: raon.krafton.ai ("Speech Chat" menu here. "auto" is a bit unstable, so use "manual" and choose the language!) 2) Raon-SpeechChat (9B) While a speech LLM is useful, it’s kind of like a walkie-talkie. A full-duplex model is more like a phone, so it is even more useful in many applications. That’s why we also built and are releasing Raon-SpeechChat. Again, on several quantitative evaluation metrics, Raon-SpeechChat scored the best on average. Model: huggingface.co/KRAFTON/Raon-S… Tech report: huggingface.co/KRAFTON/Raon-S… Web demo: raon.krafton.ai ("Full Duplex" menu here.) 3) Raon-OpenTTS (0.3B, 1B) We’re also releasing Raon-OpenTTS, a state-of-the-art open-data, open-weight TTS model. Model + data: huggingface.co/KRAFTON/Raon-O… The 1B model and a detailed tech report are coming soon! 4) Raon-VisionEncoder (0.4B) Last but not least, we’re releasing Raon-VisionEncoder, a vision encoder trained from scratch using only public data. It closely matchs the SOTA vision encoder quality too! Model: huggingface.co/KRAFTON/Raon-V… Tech blog: krafton.ai/blog/posts/202… === That’s it! I’m incredibly proud of what my team has built! My AI research team at KRAFTON (@Krafton_AI), which undoubtedly is the most cracked team in Korea, has been cooking nonstop for a while for this 😅... This is just the beginning of our planned model releases, so stay tuned! ps1/ Ah, by the way, you may ask why “Raon”? “Raon” is an old Korean word meaning happy. And, well, we’re kRAftON :-) ps2/ KRAFTON is one of the four teams participating in Korea’s national frontier-model project, together with SK Telecom. We’re training something very exciting together... and more to come soon!

English

Jaskaran Singh@jasksing·29 Mar

@AuthorPeterPike @DrChrisCombs okay give me a confidence interval then

English

194

Peter Pike@AuthorPeterPike·29 Mar

@DrChrisCombs The line is above the dog's head and we don't have anything saying the pole nor dog are the same size on both sides, so strictly speaking the answer is unknowable. Making assumptions that the gaps don't matter and the objects are the same size, the dog is 50 cm.

English

102

21.6K

Chris Combs (iterative design enjoyer)@DrChrisCombs·29 Mar

Evan you might want to ask for a refund

Chris Combs (iterative design enjoyer) tweet media

English

1.5K

526

32.3K

Jaskaran Singh รีทวีตแล้ว

Tanmay@imnottanmay·27 Mar

@HarveenChadha Pradhan Mantri har ghar backprop yojna

Svenska

1.1K

Jaskaran Singh@jasksing·27 Mar

@prajdabre Left ka right Right ka left

English

184

Raj Dabre@prajdabre·27 Mar

Want to test someone's real algo knowledge? Ask them to code up bubble sort with pen and paper.

Divya Porwal@divyaporwal_

If I ever get the chance to interview college students, I’ll never ask questions like Number of Islands or Longest Path in Matrix. Not because they’re bad problems… But because they’ve become memorized patterns. Most students don’t solve them. They recall them. Instead, I would ask questions that: • Combine multiple data structures • Involve dynamic programming in a non-obvious way • Feel like real-world problem solving Because that’s what actual engineering looks like. And honestly, I don’t like questions that can be mugged up. I like questions that make you think, struggle, and explore.

English

10.7K

Jaskaran Singh@jasksing·26 Mar

was waiting for JEPA to be in Audio. Clearly working in latent space prove to be effective!

English

Jaskaran Singh@jasksing·26 Mar

Really cool Stuff!

English

155

Jaskaran Singh@jasksing·25 Mar

@HarveenChadha Sounds like a good wikipedia page

English

Harveen Singh Chadha@HarveenChadha·25 Mar

After killing a dozen apps Openai killed its own app

Sora@soraofficialapp

We’re saying goodbye to the Sora app. To everyone who created with Sora, shared it, and built community around it: thank you. What you made with Sora mattered, and we know this news is disappointing. We’ll share more soon, including timelines for the app and API and details on preserving your work. – The Sora Team

English

173

10.1K

Jaskaran Singh@jasksing·24 Mar

@auto_grad_ @KathuriaAyoosh DL is mostly experimental with theoretical work which is as same as in the 2000s

English

Ishaan@auto_grad_·24 Mar

@KathuriaAyoosh everything in DL is theoritical, there's no point in picking up DL if you're afraid to put a bit of pressure on your brain to understand stuff

English

861

Ishaan@auto_grad_·24 Mar

if you want to learn the beautiful domain of LLM-RL, this is the basic path i would suggest: > go through david silver's playlist (with sutton's book along side you (it has stuff which david didn't cover in classes)) > go through the policy grad blog by karpathy > try to formulate the MDP for applying RL on LLMs without any external help (think hard on this) > once you get it, implement ppo via pytorch and play with hyper params

English

478

19.2K

Jaskaran Singh@jasksing·20 Mar

@voughtboy Congratulations 🎉 💐

English

135

W@voughtboy·19 Mar

somehow cleared GATE. might do masters from IIT.

English

1.1K

46.1K

Jaskaran Singh@jasksing·19 Mar

IITM doesn;t get appreciated enough

IIT Madras@iitmadras

Shaping the future of AI—responsibly and at scale. Meet Prof. Krishna Pillutla, Assistant Professor at the Wadhwani School of Data Science and AI, IIT Madras, whose research advances privacy-preserving and robust machine learning in the era of generative AI. His work focuses on building trustworthy, reliable AI systems designed for real-world impact. At WSAI, innovation goes beyond the lab. With cutting-edge academic programmes, interdisciplinary research centres, and industry collaborations, the school is driving AI breakthroughs while preparing the next generation of data scientists and AI leaders—firmly placing India on the global AI map. 🎥 Watch as he takes you inside WSAI and shares why this is a defining moment for Data Science and AI at IIT Madras. @iitmadras @WSAI_IITM

English

Jaskaran Singh รีทวีตแล้ว

Pedro Domingos@pmddomingos·18 Mar

Geoff Hinton set out to figure out how the brain works and failed. Andrew Ng set out to build a complete robot and failed. Demis Hassabis set out to achieve AGI using deep RL and failed. Yet they all succeeded.

English

566

39.4K

ค้นพบ

@kingofknowwhere @liuzhuang1234 @chengyenhsieh @paraschopra @justalexoki @AuthorPeterPike @DrChrisCombs @HarveenChadha