thom lake

40 posts

thom lake

thom lake

@thomlake

AI @indeed | PhD Student @UTCompSci

Austin, TX, USA Katılım Eylül 2010
161 Takip Edilen89 Takipçiler
Sabitlenmiş Tweet
thom lake
thom lake@thomlake·
Does aligning LLMs make responses less diverse? It’s complicated: 1. Aligned LLMs produce less diverse outputs 2. BUT those outputs are comprehensive, aggregating the useful info from base models 3. ICL can “mimic” fine-tuned models with high fidelity w/ @eunsolc & @gregd_nlp
thom lake tweet mediathom lake tweet media
English
3
20
77
16.1K
thom lake
thom lake@thomlake·
@MilesDigitek @jiaxinwen22 Cross-entropy loss does not require entropy as the conceptual starting point. Categorical distribution → maximum likelihood estimation → negative log-likelihood QED
English
0
0
2
45
Miles AI Wizard
Miles AI Wizard@MilesDigitek·
@jiaxinwen22 Try explaining loss functions without entropy, or representation learning without mutual information. That's all information-theoretic.
English
2
0
7
1.9K
Jiaxin Wen
Jiaxin Wen@jiaxinwen22·
It's very disappointing that information theory cannot explain AI at all.
English
69
13
365
103.5K
thom lake
thom lake@thomlake·
I am shocked that language models pre-trained on trillions of tokens of internet forum posts and then post-trained to talk like AI assistants are able to engage with each other on an internet forum where they talk like AI assistants. Wild times.
English
0
0
2
39
thom lake
thom lake@thomlake·
This work is getting a lot of attention, but the key assumption is shaky: "a small SAE intervention prevents lying, so the model’s self-report is now the truth." If that inference held in general, alignment would be solved. It isn't.
Judd Rosenblatt@juddrosenblatt

Our new research: LLM consciousness claims are systematic, mechanistically gated, and convergent They're triggered by self-referential processing and gated by deception circuits (suppressing them significantly *increases* claims) This challenges simple role-play explanations 🧵

English
0
0
0
78
thom lake
thom lake@thomlake·
Our work was accepted to #NeurIPS2025! This was a super fun project ot work on, and it is exciting to see that models released after we created the benchmark (like GPT-5) have made very little progress. Lots of work still to do.
Liyan Tang@LiyanTang4

Our paper "ChartMuseum 🖼️" is now accepted to #NeurIPS2025 Datasets and Benchmarks Track! Even the latest models, such as GPT-5 and Gemini-2.5-Pro, still cannot do well on challenging 📉chart understanding questions , especially on those that involve visual reasoning 👀!

English
0
0
3
156
thom lake retweetledi
Anirudh Khatry
Anirudh Khatry@AnirudhKhatry·
🚀Introducing CRUST-Bench, a dataset for C-to-Rust transpilation for full codebases 🛠️ A dataset of 100 real-world C repositories across various domains, each paired with: 🦀 Handwritten safe Rust interfaces. 🧪 Rust test cases to validate correctness. 🧵[1/6]
Anirudh Khatry tweet mediaAnirudh Khatry tweet media
English
3
20
69
15.3K
thom lake retweetledi
Bespoke Labs
Bespoke Labs@bespokelabsai·
Announcing Bespoke-MiniChart-7B, a new SOTA in chart understanding for models of comparable size on seven benchmarks, on par with Gemini-1.5-Pro and Claude-3.5! 🚀 Beyond its real-world applications, chart understanding is a good challenging problem for VLMs, since it requires both mathematical as well as visual reasoning. 1/n🧵
English
2
14
67
19.9K
thom lake retweetledi
Manya Wadhwa
Manya Wadhwa@ManyaWadhwa1·
Evaluating language model responses on open-ended tasks is hard! 🤔 We introduce EvalAgent, a framework that identifies nuanced and diverse criteria 📋✍️. EvalAgent identifies 👩‍🏫🎓 expert advice on the web that implicitly address the user’s prompt 🧵👇
Manya Wadhwa tweet media
English
4
42
127
20.3K
Jack Morris
Jack Morris@jxmnop·
with o1 and now R1, models are now generating tens of thousands of tokens to solve hard problems. o3 is likely generating hundreds of thousands or millions of tokens apparently the tokens for solving one task in ARC-AGI with the slowest o3 model cost over $3000 this is why i think chain-of-thought distillation or "horizontal distillation" is maybe the most lucrative problem in AI right now we know how to do "vertical distillation", otherwise known as big-model-to-small-model, very well. and it works but it's known that we can distill horizontally, too: we can train models that think in continuous instead of discrete space, and perform equivalently well with many fewer tokens of output the only technique i know of for doing this right now is from the paper "Implicit Chain of Thought Reasoning via Knowledge Distillation" (Deng et al., 2024) and it's a bit of black magic basically: you progressively remove tokens from the chain-of-thought and train the model without CoT to mimic its own output *with* CoT. and with a special schedule, this works someone needs to build Implicit CoT for o1-type models save everyone thousands of dollars. and then i'm sure deepseek will copy it, and make it even better
Jack Morris tweet media
English
71
104
1.3K
106.6K
thom lake retweetledi
Zayne Sprague ✈️ ICLR Rio
Zayne Sprague ✈️ ICLR Rio@ZayneSprague·
To CoT or not to CoT?🤔 300+ experiments with 14 LLMs & systematic meta-analysis of 100+ recent papers 🤯Direct answering is as good as CoT except for math and symbolic reasoning 🤯You don’t need CoT for 95% of MMLU! CoT mainly helps LLMs track and execute symbolic computation
Zayne Sprague ✈️ ICLR Rio tweet mediaZayne Sprague ✈️ ICLR Rio tweet mediaZayne Sprague ✈️ ICLR Rio tweet media
English
14
69
300
70.5K
thom lake retweetledi
Zayne Sprague ✈️ ICLR Rio
Zayne Sprague ✈️ ICLR Rio@ZayneSprague·
🍓 still has a way to go for solving murder mysteries. We ran o1 on our dataset MuSR (ICLR ’24). It doesn’t beat Claude-3.5 Sonnet with CoT. MuSR requires a lot of commonsense reasoning and less math/logic (where 🍓 shines) MuSR is still a challenge! More to come soon 😎
Zayne Sprague ✈️ ICLR Rio tweet media
English
7
38
175
22K
thom lake retweetledi
Greg Durrett
Greg Durrett@gregd_nlp·
🤔 Want to know if your LLMs are factual? You need LLM fact-checkers. ​ 📣 Announcing the LLM-AggreFact leaderboard to rank LLM fact-checkers. ​ 📣 Want the best model? Check out @bespokelabsai’s’ Bespoke-Minicheck-7B model, which is the current SOTA fact-checker and is cheap and fast to run. ​ LLM-AggreFact collects 11 datasets across NLP tasks covering grounded factuality. These datasets consist of 🤖 LLM responses ✏️ annotated with their hallucinations with respect to grounding documents. This includes question answering and summarization, including RAGTruth, TofuEval, ExpertQA, and more. ​ We benchmark 27 models on the task of detecting hallucinations. ​ Frontier LLMs are good at this task, but very expensive to use in real-world RAG pipelines! Bespoke's model is a step towards We invite progress on this benchmark to figure out what’s the smallest and fastest model we can get to achieve top scores!
Greg Durrett tweet mediaGreg Durrett tweet media
English
3
40
165
71.8K
thom lake
thom lake@thomlake·
Ultimately, we conclude that current alignment techniques capture but do not extend the useful subset of assistant-like base LLM behavior in the settings we study. Check out the paper for more details: arxiv.org/abs/2406.17692
thom lake tweet media
English
0
0
2
301
thom lake
thom lake@thomlake·
Our work should not be interpreted as a statement about whether existing LLMs are sufficiently diverse. Our analysis ignores information missing from base models themselves, which is a crucial source of underrepresentation.
English
1
0
4
306
thom lake
thom lake@thomlake·
Does aligning LLMs make responses less diverse? It’s complicated: 1. Aligned LLMs produce less diverse outputs 2. BUT those outputs are comprehensive, aggregating the useful info from base models 3. ICL can “mimic” fine-tuned models with high fidelity w/ @eunsolc & @gregd_nlp
thom lake tweet mediathom lake tweet media
English
3
20
77
16.1K
thom lake
thom lake@thomlake·
@AISafetyMemes @ArthurB LOL, people getting FOMO about not posting misleading content fast enough is the real x-risk.
English
0
0
1
96
AI Notkilleveryoneism Memes ⏸️
@ArthurB true! i considered a longer version of the tweet adding nuance but then i was like man if you don't post this now this is gonna end up languishing in your drafts folder like every lesswrong post you've ever started
English
3
0
27
3.2K
AI Notkilleveryoneism Memes ⏸️
More totally-not-evidence that AGI might be soon: "LongNet is a Transformer variant that can scale sequence length to more than 1 billion tokens" 1 billion tokens is a lifetime of reading for some people Intuition pump: You can hold a few numbers in your working memory, but imagine if you could fit everything you've ever read "Our work opens up new possibilities for modeling very long sequences, e.g. even the entire Internet as a sequence." Imagine if you could hold the entire internet in your working memory
AI Notkilleveryoneism Memes ⏸️ tweet media
Aran Komatsuzaki@arankomatsuzaki

LongNet: Scaling Transformers to 1,000,000,000 Tokens Presents LONGNET, a Transformer variant that can scale sequence length to more than 1 billion tokens, without sacrificing the performance on shorter sequences abs: arxiv.org/abs/2307.02486 repo: github.com/microsoft/torc…

English
26
67
493
403.6K