Tarun Khajuria

399 posts

Tarun Khajuria

Tarun Khajuria

@tarunkhajuria

PhD Student @UniTartuCS. Computational Neuroscience, Vision-Language.

Tartu, Eesti Katılım Ağustos 2009
1.3K Takip Edilen197 Takipçiler
Tarun Khajuria retweetledi
François Chollet
François Chollet@fchollet·
We underestimate how much "abstract" thought is just repurposed sensorimotor control circuitry. A lot of reasoning is essentially about moving through idea-space the way we move through physical space.
English
177
244
2.4K
137.6K
Tarun Khajuria retweetledi
Ziming Liu
Ziming Liu@ZimingLiu11·
🚨Transformers don't learn Newton's laws? They learn Kepler's laws! Like us, transformers don't predict a flying ball via a differential equation, but by fitting a curve. Moreover, reducing context length steers a transformer from Keplerian to Newtonian. Compression in play.
Ziming Liu tweet media
English
25
206
1.2K
117.1K
Tarun Khajuria retweetledi
Liv
Liv@livgorton·
1/ I've been thinking about the "non-linear features" objection to interpretability work. But it can be hard to reason about, since the space of possible non-linear features is so large. Here's an attempt to untangle it: livgorton.com/non-linear-fea…
Liv tweet media
English
25
56
679
84.6K
Tarun Khajuria retweetledi
Jaan Aru
Jaan Aru@jaaanaru·
Under challenging conditions, human vision turns into iterative problem solving. Typical deep learning algorithms don't capture this behavior. Our paper out now in Plos Computational Biology Led by @tarunkhajuria journals.plos.org/ploscompbiol/a…
English
2
13
42
3.3K
The Jaipur Dialogues
The Jaipur Dialogues@JaipurDialogues·
Justice Gavai mocked Bhagwan Vishnu : No Action Ajeet Bharti mocked Justice Gavai : Picked up for Questioning by Noida Police Is Contempt of Judges Higher Than Contempt of God?
The Jaipur Dialogues tweet media
English
953
10K
30.9K
297.7K
Tarun Khajuria retweetledi
Jim Fan
Jim Fan@DrJimFan·
I just pulled the numbers on vision-language benchmarks for Llama-3.2-11B (vision). Surprisingly, the open-source community at large isn't behind in the lightweight model class! Pixtral, Qwen2-VL, Molmo, and InternVL2 all stand strong. OSS AI models have never been stronger. The last 3 lines are API-only frontier models. Gemini-flash and GPT-4o (likely in heavier-weight class) are still the reigning champions. But never bet against OSS. Never underestimate the combined firepower of so many talents distributed all over the world.
Jim Fan tweet media
English
18
64
502
61.1K
Tarun Khajuria retweetledi
Andrew Lampinen
Andrew Lampinen@AndrewLampinen·
What aspects of human knowledge are vision models missing, and can we align them with human knowledge to improve their performance and robustness on cognitive and ML tasks? Excited to share this new work led by @lukas_mut! 1/10
Andrew Lampinen tweet media
English
5
56
401
61.6K
Tarun Khajuria retweetledi
Jason Wei
Jason Wei@_jasonwei·
Super excited to finally share what I have been working on at OpenAI! o1 is a model that thinks before giving the final answer. In my own words, here are the biggest updates to the field of AI (see the blog post for more details): 1. Don’t do chain of thought purely via prompting, train models to do better chain of thought using RL. 2. In the history of deep learning we have always tried to scale training compute, but chain of thought is a form of adaptive compute that can also be scaled at inference time. 3. Results on AIME and GPQA are really strong, but that doesn’t necessarily translate to something that a user can feel. Even as someone working in science, it’s not easy to find the slice of prompts where GPT-4o fails, o1 does well, and I can grade the answer. But when you do find such prompts, o1 feels totally magical. We all need to find harder prompts. 4. AI models chain of thought using human language is great in so many ways. The model does a lot of human-like things, like breaking down tricky steps into simpler ones, recognizing and correcting mistakes, and trying different approaches. Would highly encourage everyone to look at the chain of thought examples in the blog post. The game has been totally redefined.
OpenAI@OpenAI

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…

English
87
338
3.3K
526.9K
Tarun Khajuria retweetledi
Jaan Aru
Jaan Aru@jaaanaru·
Could mimicking psychedelic effects in virtual reality bring along therapeutic effects associated with psychedelics?🤔 In our new work, we developed a psychedelic VR experience - Psyrreal - and showed its potential for reducing depressive symptoms. 1/n frontiersin.org/articles/10.33…
English
6
17
114
16.6K
Tarun Khajuria retweetledi
Jaan Aru
Jaan Aru@jaaanaru·
In science we value citations but it is quite refreshing when someone does a video about your work instead. 🙏@tipado VR experience is not the same as psychedelics but it's worth studying the potential of virtual psychedelics. psyarxiv.com/uh9kf youtu.be/Aqxk9SQ41Js?t=…
YouTube video
YouTube
English
0
4
19
0
Tarun Khajuria retweetledi
PHD Comics
PHD Comics@PHDcomics·
Academic Hell
PHD Comics tweet media
English
30
1.5K
7.1K
0
Tarun Khajuria retweetledi
Chong Guo
Chong Guo@ChongGuo6·
Is it possible that adversarially-trained DNNs are already more robust than the biological neural networks of primate visual cortex? Here is a short thread for our #ICML2022 paper arxiv.org/pdf/2206.11228…. 1/8
Chong Guo tweet media
English
17
116
646
0
Tarun Khajuria retweetledi
Jaan Aru
Jaan Aru@jaaanaru·
Can you guess what's hidden in these images? How did you arrive at the solution? Can we build AI algorithms that solve it in a similar way? We developed a new dataset to investigate the iterative generation and refinement of perceptual hypotheses.🧵 openaccess.thecvf.com/content/CVPR20…
Jaan Aru tweet media
English
3
19
85
0
Tarun Khajuria retweetledi
Jaan Aru
Jaan Aru@jaaanaru·
Please share 🙏 A fully funded Ph.D. position on "Injecting background knowledge for Explainable AI" is available at the University of Tartu. The project revolves around extracting knowledge from text to help interpreting AI models for biomedicine. reaalteadused.ut.ee/en/content/doc…
English
0
9
13
0
ankit
ankit@pxthxk·
🥂
ankit tweet media
QME
5
1
16
0