thom lake

40 posts

thom lake

@thomlake

AI @indeed | PhD Student @UTCompSci

Austin, TX, USA Katılım Eylül 2010

161 Takip Edilen89 Takipçiler

Sabitlenmiş Tweet

thom lake@thomlake·26 Haz

Does aligning LLMs make responses less diverse? It’s complicated: 1. Aligned LLMs produce less diverse outputs 2. BUT those outputs are comprehensive, aggregating the useful info from base models 3. ICL can “mimic” fine-tuned models with high fidelity w/ @eunsolc & @gregd_nlp

English

16.1K

thom lake@thomlake·5d

@MilesDigitek @jiaxinwen22 Cross-entropy loss does not require entropy as the conceptual starting point. Categorical distribution → maximum likelihood estimation → negative log-likelihood QED

English

Miles AI Wizard@MilesDigitek·5d

@jiaxinwen22 Try explaining loss functions without entropy, or representation learning without mutual information. That's all information-theoretic.

English

1.9K

Jiaxin Wen@jiaxinwen22·5d

It's very disappointing that information theory cannot explain AI at all.

English

365

103.5K

thom lake@thomlake·5 Şub

I am shocked that language models pre-trained on trillions of tokens of internet forum posts and then post-trained to talk like AI assistants are able to engage with each other on an internet forum where they talk like AI assistants. Wild times.

English

thom lake@thomlake·29 Ara

This work is getting a lot of attention, but the key assumption is shaky: "a small SAE intervention prevents lying, so the model’s self-report is now the truth." If that inference held in general, alignment would be solved. It isn't.

Judd Rosenblatt@juddrosenblatt

Our new research: LLM consciousness claims are systematic, mechanistically gated, and convergent They're triggered by self-referential processing and gated by deception circuits (suppressing them significantly *increases* claims) This challenges simple role-play explanations 🧵

English

thom lake@thomlake·19 Eyl

Our work was accepted to #NeurIPS2025! This was a super fun project ot work on, and it is exciting to see that models released after we created the benchmark (like GPT-5) have made very little progress. Lots of work still to do.

Liyan Tang@LiyanTang4

Our paper "ChartMuseum 🖼️" is now accepted to #NeurIPS2025 Datasets and Benchmarks Track! Even the latest models, such as GPT-5 and Gemini-2.5-Pro, still cannot do well on challenging 📉chart understanding questions , especially on those that involve visual reasoning 👀!

English

156

thom lake@thomlake·30 Nis

Interested in how alignment changes the response distribution defined by LLMs? Come check out my poster at 2 PM at #NAACL2025 x.com/thomlake/statu…

thom lake@thomlake

English

2.5K

thom lake retweetledi

Anirudh Khatry@AnirudhKhatry·23 Nis

🚀Introducing CRUST-Bench, a dataset for C-to-Rust transpilation for full codebases 🛠️ A dataset of 100 real-world C repositories across various domains, each paired with: 🦀 Handwritten safe Rust interfaces. 🧪 Rust test cases to validate correctness. 🧵[1/6]

English

15.3K

thom lake retweetledi

Bespoke Labs@bespokelabsai·23 Nis

Announcing Bespoke-MiniChart-7B, a new SOTA in chart understanding for models of comparable size on seven benchmarks, on par with Gemini-1.5-Pro and Claude-3.5! 🚀 Beyond its real-world applications, chart understanding is a good challenging problem for VLMs, since it requires both mathematical as well as visual reasoning. 1/n🧵

English

19.9K

thom lake retweetledi

Manya Wadhwa@ManyaWadhwa1·22 Nis

Evaluating language model responses on open-ended tasks is hard! 🤔 We introduce EvalAgent, a framework that identifies nuanced and diverse criteria 📋✍️. EvalAgent identifies 👩‍🏫🎓 expert advice on the web that implicitly address the user’s prompt 🧵👇

English

127

20.3K

thom lake@thomlake·24 Oca

@jxmnop Some recent references: arxiv.org/abs/2412.06769 and arxiv.org/abs/2412.17747?

English

136

Jack Morris@jxmnop·24 Oca

with o1 and now R1, models are now generating tens of thousands of tokens to solve hard problems. o3 is likely generating hundreds of thousands or millions of tokens apparently the tokens for solving one task in ARC-AGI with the slowest o3 model cost over $3000 this is why i think chain-of-thought distillation or "horizontal distillation" is maybe the most lucrative problem in AI right now we know how to do "vertical distillation", otherwise known as big-model-to-small-model, very well. and it works but it's known that we can distill horizontally, too: we can train models that think in continuous instead of discrete space, and perform equivalently well with many fewer tokens of output the only technique i know of for doing this right now is from the paper "Implicit Chain of Thought Reasoning via Knowledge Distillation" (Deng et al., 2024) and it's a bit of black magic basically: you progressively remove tokens from the chain-of-thought and train the model without CoT to mimic its own output *with* CoT. and with a special schedule, this works someone needs to build Implicit CoT for o1-type models save everyone thousands of dollars. and then i'm sure deepseek will copy it, and make it even better

English

104

1.3K

106.6K

thom lake retweetledi

Zayne Sprague ✈️ ICLR Rio@ZayneSprague·19 Eyl

To CoT or not to CoT?🤔 300+ experiments with 14 LLMs & systematic meta-analysis of 100+ recent papers 🤯Direct answering is as good as CoT except for math and symbolic reasoning 🤯You don’t need CoT for 95% of MMLU! CoT mainly helps LLMs track and execute symbolic computation

English

300

70.5K

thom lake retweetledi

Zayne Sprague ✈️ ICLR Rio@ZayneSprague·13 Eyl

🍓 still has a way to go for solving murder mysteries. We ran o1 on our dataset MuSR (ICLR ’24). It doesn’t beat Claude-3.5 Sonnet with CoT. MuSR requires a lot of commonsense reasoning and less math/logic (where 🍓 shines) MuSR is still a challenge! More to come soon 😎

English

175

22K

thom lake retweetledi

Greg Durrett@gregd_nlp·9 Ağu

🤔 Want to know if your LLMs are factual? You need LLM fact-checkers. 📣 Announcing the LLM-AggreFact leaderboard to rank LLM fact-checkers. 📣 Want the best model? Check out @bespokelabsai’s’ Bespoke-Minicheck-7B model, which is the current SOTA fact-checker and is cheap and fast to run. LLM-AggreFact collects 11 datasets across NLP tasks covering grounded factuality. These datasets consist of 🤖 LLM responses ✏️ annotated with their hallucinations with respect to grounding documents. This includes question answering and summarization, including RAGTruth, TofuEval, ExpertQA, and more. We benchmark 27 models on the task of detecting hallucinations. Frontier LLMs are good at this task, but very expensive to use in real-world RAG pipelines! Bespoke's model is a step towards We invite progress on this benchmark to figure out what’s the smallest and fastest model we can get to achieve top scores!

English

165

71.8K

thom lake@thomlake·26 Haz

Ultimately, we conclude that current alignment techniques capture but do not extend the useful subset of assistant-like base LLM behavior in the settings we study. Check out the paper for more details: arxiv.org/abs/2406.17692

English

301

thom lake@thomlake·26 Haz

Our work should not be interpreted as a statement about whether existing LLMs are sufficiently diverse. Our analysis ignores information missing from base models themselves, which is a crucial source of underrepresentation.

English

306

thom lake@thomlake·26 Haz

English

16.1K

thom lake@thomlake·10 Tem

@AISafetyMemes @ArthurB LOL, people getting FOMO about not posting misleading content fast enough is the real x-risk.

English

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes·9 Tem

@ArthurB true! i considered a longer version of the tweet adding nuance but then i was like man if you don't post this now this is gonna end up languishing in your drafts folder like every lesswrong post you've ever started

English

3.2K

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes·9 Tem

More totally-not-evidence that AGI might be soon: "LongNet is a Transformer variant that can scale sequence length to more than 1 billion tokens" 1 billion tokens is a lifetime of reading for some people Intuition pump: You can hold a few numbers in your working memory, but imagine if you could fit everything you've ever read "Our work opens up new possibilities for modeling very long sequences, e.g. even the entire Internet as a sequence." Imagine if you could hold the entire internet in your working memory

AI Notkilleveryoneism Memes ⏸️ tweet media

Aran Komatsuzaki@arankomatsuzaki

LongNet: Scaling Transformers to 1,000,000,000 Tokens Presents LONGNET, a Transformer variant that can scale sequence length to more than 1 billion tokens, without sacrificing the performance on shorter sequences abs: arxiv.org/abs/2307.02486 repo: github.com/microsoft/torc…

English

493

403.6K

Keşfet

@MilesDigitek @jiaxinwen22 @jxmnop @bespokelabsai @eunsolc @gregd_nlp @AISafetyMemes @ArthurB