Clement Neo

407 posts

Clement Neo

@_clementneo

making AI safer

Singapore Katılım Mart 2021

318 Takip Edilen469 Takipçiler

Sabitlenmiş Tweet

Clement Neo@_clementneo·10 Eki

🧠🖼️ New paper on interpreting VLMs! We study Vision-Language Models (VLMs) like LLaVA to understand how they process objects in images. We find surprising insights about how these models identify objects in images and how their inner representations develop through the layers.

English

14.1K

Clement Neo@_clementneo·1d

@xuanalogue @StephenLCasper Yeah this is likely overcounting - they’re counting the full funding amount for when DTC was established in 2022, which is before the DTC was designated the AISI two years later in 2024. The DTC also still works on some other non-AIS work afaik

English

xuan (ɕɥɛn / sh-yen)@xuanalogue·2d

@StephenLCasper Very surprised that Singapore AISI is better funded! Wonder if that's mostly bc of it being an existing institute that ended up designated as AISI (maybe @_clementneo knows more).

English

518

Cas (Stephen Casper)@StephenLCasper·2d

CAISI is very small and bureaucratically suppressed. This is part of why I, as an American, I pursued a residency at UK AISI instead of US CAISI. I’m not sure of the exact counts. But I wouldn’t be surprised if AISI has more Americans working for it than CAISI.

Cole Salvador@ColeSalvador31

In 2022 and 2023, tiny teams of researchers drew straight lines on graphs that predicted the US was headed for an energy bottleneck in AI. But the government had no idea. The future of AI is too important to make the same mistake again. We need talent-dense, AI-focused offices that can skate to where the puck is going and implement President Trump’s AI agenda. In a new piece for AFPI (@A1Policy), we discuss 2 promising offices that could act as hubs of government AI foresight: the Center for AI Standards and Innovation (CAISI) in the Department of Commerce and the Bureau of Emerging Threats (ET) in the Department of State. We found that they have the density of talent to succeed but still lack resources: funding, headcount, and authorization. Here’s a summary: 1) The Center for AI Standards and Innovation (CAISI) lacks resources > It has talented technical staff and a strong track record in evaluations, industry relationships, and insight into China > But it’s chronically underfunded. It’s been around for 3 years but only received $30M in total, not annual, funds. That’s 11 times less than the UK’s equivalent. (It’s even short of Canada and Singapore) > It’s only has 20-30 employees who are swamped with workstreams and external requests from agencies like the IC To solve this, Congress should fund CAISI with an annual budget of $50-100 million. 2) CAISI lacks authorization or a focused mission > Between Department asks, inbound from other offices, and the AI Action Plan, it has more missions than staff > Its critical mission could be threatened by future administrations, who would externally pressure it to pursue DEI initiatives Congress needs to enshrine the office and give it a clear mission. We present an America First vision for CAISI, in which it acts as a technical strike team, bridge between industry and government, frontier analysis unit, and technical standards organization. 3) The Bureau of Emerging Threats (ET) lacks authorization > ET is similarly talent-dense, with experts in cyber, AI, and international relations > But it lacks congressional authorization and could be destroyed or co-opted by future administrations The Bureau needs concrete support from Congress and levers of interagency influence, like regular reports to national security leaders. With appropriate action, Congress can help ensure the President has the resources he needs to help America win the AI race and usher in a new golden age of human flourishing. Always fun to collaborate with @CrovitzJack and @YusufSMahmood, who have posted about other sections of our piece.

English

9.2K

Clement Neo@_clementneo·3 Şub

@AlbertBoyangLi Is the code/dataset available?

English

Boyang "Albert" Li@AlbertBoyangLi·2 Şub

The paper is at arxiv.org/abs/2601.12410 . Let me know what you think.

English

106

Boyang "Albert" Li@AlbertBoyangLi·2 Şub

These days I feel LLMs are still worse than humans but I can't say exactly where or how. Do you share the same feeling? Well, maybe we found where and how. We created 500 stories with fewer than 100 words on average and only 2 choices. On this simple test, GPT-4o falls to the level of random guesses. Without any training, humans are at 92%. This test requires LLMs to think of story characters as independent actors with their own minds, who choose actions based on their own knowledge. And, if you believe the cognitive scientists, it is a prerequisite skill to the understanding of human intentions. LLMs, we would argue, do not understand human intentions. At least not yet. And that's why they don't follow your instructions very well.

English

368

Clement Neo retweetledi

Greg Burnham@GregHBurnham·26 Kas

I looked into this and the answer is so funny. In the No Thinking setting, Opus 4.5 repurposes the Python tool to have an extended chain of thought. It just writes long comments, prints something simple, and loops! Here's how it starts one problem:

Epoch AI@EpochAIResearch

Opus 4.5 scores the same on FrontierMath regardless of thinking budget, in contrast to GPT-5.1 where higher reasoning settings correspond to higher scores. However, on OTIS Mock AIME, another math benchmark, we see the thinking budget make a difference for Opus 4.5 as well.

English

772

110.2K

Clement Neo@_clementneo·20 Kas

We are so back with MLP neurons

Transluce@TransluceAI

Is your LM secretly an SAE? Most circuit-finding interpretability methods use learned features rather than raw activations, based on the belief that neurons do not cleanly decompose computation. In our new work, we show MLP neurons actually do support sparse, faithful circuits!

English

361

Clement Neo@_clementneo·15 Eki

@jetnew_sg What would a systematic/breadth first approach look like? And would you say this is a job better done by the platform (ie a doc/notes type app with AI functionalities) or vice versa (an AI chatbot type app with note functionalities)?

English

Jet New@jetnew_sg·15 Eki

We need new interfaces for thinking. Notion, Obsidian, Google Docs, Apple Notes - none of them are designed around thinking with AI. ChatGPT, Claude, and a whole range of AI assistant apps are great for the “rabbit hole” type of inquiry. If you need a systematic, breadth-first approach, they unfortunately fall short.

English

965

Clement Neo@_clementneo·4 Eyl

@jameschua_sg @PradyuPrasad I think the question here is whether the correlation means that it is a necessary evil, or we have just taken it to be so as an excuse to not try to go against the prevailing culture

English

James Chua@jameschua_sg·4 Eyl

@PradyuPrasad (obviously you can be innovative without too many crazies on the street, but you get my point about tradeoffs)

English

Pradyumna (in Bay Area)@PradyuPrasad·4 Eyl

I love the guy who posted the essay because he gets stuff done. But amongst all the people who complain very little do. And this has been the case since the 70s! LKY in a speech then said that the Singaporean is a champion grumbler.

Pradyumna (in Bay Area)@PradyuPrasad

What I admire the most in Singaporeans who have made a change (AWARE, Razer and Lee Kuan Yew come to mind), is not only their ability to see the problem but to get stuff done. I think the onus to do that is on us.

English

1.7K

Clement Neo@_clementneo·24 Tem

@sujantkumarkv @kalomaze There’s a really nice talk by them about their approach, but on a high level it just feels like they care about things as a first principle youtu.be/gSKTfG1GXYQ?si…

YouTube

English

150

polymath daddy@sujantkumarkv·24 Tem

@kalomaze okay now I'm sold BUT please elaborate more why's it such a difference.

English

227

kalomaze@kalomaze·24 Tem

uv is unfathomably good software

English

57.7K

Clement Neo@_clementneo·30 Haz

@alpercanbe @soniajoseph_ @cvenhoff00

QAM

455

Alper Canberk@alpercanbe·29 Haz

how much does the last layer of a VLM retain the original image? i trained a linear probe on the output features of several CLIP/SigLIP models on image reconstruction, and found that *only* with SigLIP, if you multiply the input by 10-100, pixels get reconstructed perfectly??

English

202

51.4K

Clement Neo retweetledi

Dwarkesh Patel@dwarkesh_sp·19 Haz

See you next harvest!

eigen moomin (in sf 13 - 16)@eigen_moomin

nary a kernel of grain left. he has taken it all without even having to lift a finger. my village will not survive the winter, but though life goes hard as a serf, it will never go as hard as this shirt.

English

309

44.4K

Clement Neo@_clementneo·26 Nis

I’m finally presenting this at ICLR today! I’ll be at poster at #246 from 3pm to 5:30pm 😙

Clement Neo@_clementneo

English

1.8K

Clement Neo@_clementneo·22 Nis

This is going to be an interesting social, and the team organizing this are super cool! The public sector is often a good indicator of conservative LLM adopters worried about safety guarantees, and I’ve learned a ton from these folks. Definitely do attend if you’re free.

Gabriel Chua@gabrielchua

interested in LLMs for the public sector? join us at our @iclr_conf social on day 1! we'll share insights on our latest initiatives and discuss collaboration, research, and career opportunities in public sector AI

English

496

Clement Neo@_clementneo·29 Mar

@gzcl3000 My take is that the author is a historian, so Sapiens made sense and was good because it's a history book while the other books weren't compelling

English

Clement Neo@_clementneo·16 Şub

@ivanleomk What do you find good? My brain has been so hardwired on the types of task I perceive Claude vs 4o to be good at that I find it hard to properly explore how 4p has improved

English

Ivan Leo@ivanleomk·16 Şub

Might switch from Claude to 4o wow

English

1.1K

Clement Neo@_clementneo·2 Şub

This reminds me of the phenomenon I think I saw (but can’t find/verify) where Claude was somewhat aware that its response got pre-filled and had a similar disbelief to the earlier part of its response. Does anyone know what I’m referring to?

Garrison Lovely@GarrisonLovely

My roommates kept asking me if the AIs can count the Rs in "Strawberry" yet. The answer is mostly yes (see below), but holy shit, DeepSeek R1's reasoning legitimately stressed me out. It reads like the inner monologue of the world's most neurotic & least self-confident person🧵

English

559

Clement Neo@_clementneo·1 Şub

@jetnew_sg Do you find that memory is useful? A lot of the times for ChatGPT I find that the most random things end up in there and I’m not sure how it improves things, as compared to Claude’s project memory where you can stash key files

English

Jet New@jetnew_sg·1 Şub

I really want a way to integrate personal memory or knowledge base with all model providers and applications. I use ChatGPT's memory extensively by giving life and business updates. But this personal memory isn't shared with o1, Claude, DeepSeek, Qwen. Is anyone building this?

English

281

Clement Neo@_clementneo·14 Kas

I actually worked on this paper for a really long time, my first draft was actually in June last year! I think I’ve grown quite a lot as a researcher since then. Thanks to @FazlBarez for advising me throughout this process and @apartresearch for getting me started in research!

English

339

Clement Neo@_clementneo·14 Kas

This was really nice, hopefully there’ll be more of this to come in due time!

Clement Neo@_clementneo

I will be presenting my first ever poster at EMNLP 2024 from 10:30am-12pm today in the Jasmine room! I think I have a really nice poster so come check it out if you’re around :)

English

Clement Neo@_clementneo·13 Kas

I will be presenting my first ever poster at EMNLP 2024 from 10:30am-12pm today in the Jasmine room! I think I have a really nice poster so come check it out if you’re around :)

Fazl Barez@FazlBarez

📢 🎉 New paper with @_clementneo & Shay Cohen! We study how attention heads work with MLP neurons to predict the next token. We find a set of interpretable activity. More in the thread!

English

3.2K

Clement Neo@_clementneo·5 Kas

@akbirthko Do you think that internet-scale video pretraining would be an equally good prior? (recently I’ve been thinking about whether the most intelligent models for the next decade will continue to be those with majority text pretraining or not)

English

karthik@akbirthko·5 Kas

text is the universal interface is empirically not true. the purpose of most wrappers is handling perception bottlenecks internet-scale text pretraining is a very good prior is a more accurate statement

English

503

Clement Neo@_clementneo·10 Eki

If you have any thoughts about the paper, or have any cool ideas you think are worth exploring in VLMs, do reach out to me in the DMs or replies! I’m currently thinking of pursuing a PhD, and I’ll be exploring further research on multimodality over the coming months.

English

452

Clement Neo@_clementneo·10 Eki

Also shoutout to this parallel work, which seems to have similar results for the logit lens! I think VLM interpretability is just starting to take off, and understanding multimodality is going to be important for the field. x.com/nickhjiang/sta…

Nick Jiang@nickhjiang

🔥 Paper Drop 🔥 What can we understand by peering inside vision-language models (VLMs) like LLaVA? We show that image representations inside VLMs can be directly interpreted and edited in the language space, and we apply our findings to mitigate hallucinations!

English

814

Clement Neo@_clementneo·10 Eki

English

14.1K

Keşfet

@xuanalogue @StephenLCasper @AlbertBoyangLi @jetnew_sg @jameschua_sg @PradyuPrasad @sujantkumarkv @kalomaze