Seth Aycock

2

72

Seth Aycock retuiteado

Slator@slatornews·28 Ağu

👉 slator.ch/PromptEngineer… New study finds that prompt engineering has limits 🚫 in #AI #translation. If a large language model has not learned the task, no prompt will make it perform better. #LLMs #LLM #xl8 #t9n @CharlesUniPRG @JohnsHopkins @LMU_Muenchen @PUT_Poznan @PSchmidtova @BafnaNiyati @sethjsa @AmsterdamNLP @ltl_uva @zouharvi

English

2

6

215

Seth Aycock@sethjsa·25 Nis

Our paper was accepted at ICLR 2025 as a Spotlight! I will present our poster on Saturday April 26, 3-5pm, Poster #241. See you there! arxiv.org/abs/2409.19151

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

0

10

200

Seth Aycock retuiteado

LTL-UvA@ltl_uva·13 Şub

LTL News: Happy to announce that Seth's paper got accepted by ICLR (spotlight) 🥳@sethjsa Paper Link: arxiv.org/abs/2409.19151

English

2

6

222

Seth Aycock@sethjsa·17 Eki

@SimonHiaubeng @ElliotMurphy91 The principle Maximise Minimal Means is part of one version of minimalist theory. But it's not UG - it's a third factor, domain-general constraint

English

1

79

Hiáubêng@SimonHiaubeng·17 Eki

@ElliotMurphy91 UG sounds like a minimax principle similar to that in math real analysis. the minimal maximal domain is the place that good works can be begun with.

English

0

2

366

Seth Aycock@sethjsa·17 Eki

@Linguist_UR @ElliotMurphy91 Merge, maybe Agree, maybe Labeling. Though I believe there's work ongoing to attribute Merge itself to third factor, domain-general constraints

English

1

192

The artist formerly known as TH@Linguist_UR·17 Eki

@ElliotMurphy91 Genuine question: isn’t the SMT that it’s merge (language specific set formation as the basis for hierarchical constituency) and nothing else?

English

0

428

Seth Aycock@sethjsa·14 Eki

@RaphaelMerx I'm a fan of this paper! We'd expect exactly the same for Kalamang (if we could collect an OOD test set). In the appendix we show too that the 100-example test set consists of short, easy sentences so a ChrF++ of ~30 is really not that proficient

English

1

57

Raphaël Merx@RaphaelMerx·14 Eki

@sethjsa @davidstap @diwuNLP @c_monz @illc_amsterdam @ltl_uva Super interesting! In a similar vein, we studied MT using LLMs w sentences from a language manual for Mambai, and found that while scores were decent on test sentences sampled from the manual, it collapsed on sentences translated by a native speaker: aclanthology.org/2024.eurali-1.…

English

0

5

251

Seth Aycock@sethjsa·11 Eki

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

3

22

127

17.9K

Seth Aycock@sethjsa·13 Eki

@ylecun x.com/sethjsa/status… And do not confuse learning from grammar with learning from parallel sentences!

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

1

70

Yann LeCun@ylecun·12 Eki

Worth repeating: Do not confuse retrieval with reasoning. Do not confuse rote learning with understanding. Do not confuse accumulated knowledge with intelligence.

Taelin@VictorTaelin

mini rant: the illusion LLMs are intelligent comes from their massive scale. it is hard to visualize, but these things memorized the WHOLE internet. everything you've ever asked it, either has been solved before, or is a simple combination of existing solutions. but that's still an illusion. when LLMs face a problem that require a NEW solution - one of original shape, one it has never seen before - they fail. it is that simple. that's what my example shows. i took as simple problem - inverting a binary tree - and added a few constraints to make sure the solution is unique and not in the dataset, forcing it to actually SOLVE it itself. and, surprise - it can't! and I must stress this isn't about THIS problem. but about all. LLMs can't solve ANY problem, at all. it can only spit a memorized solution. if nobody posts this solution online, not even GPT-6, opus-5, or o3, will be able to solve *this very prompt*. I'm betting on that. the inability to create new solutions imply LLMs won't invent new science. yes, they will completely change the world as we know it. they'll have a higher impact than computers and the internet. but, unless a new kind AI emerges, we're still on our own when it comes to curing cancer or making superconductors.

English

237

879

5.8K

620.7K

Seth Aycock retuiteado

Di Wu@diwuNLP·11 Eki

We show that a grammar book provides little or even no help for translation in LLMs, questioning the recent "truly zero-shot translation" --- no data no gain, still 🧐

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

1

8

673

Seth Aycock@sethjsa·11 Eki

@JeffDean (Plus, Kalamang parallel data has been online since November 2020!)

Filipino

1

23

Seth Aycock@sethjsa·11 Eki

@JeffDean x.com/sethjsa/status… Actually we find LLMs learn most/all translation ability from parallel sentences in the book, not the grammar. And we can predict translation performance just from prompts' test set vocab coverage! But we do find that grammar can help *linguistic* tasks

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

218

Jeff Dean@JeffDean·15 Şub

Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pro model. One of the key differentiators of this model is its incredibly long context capabilities, supporting millions of tokens of multimodal input. The multimodal capabilities of the model means you can interact in sophisticated ways with entire books, very long document collections, codebases of hundreds of thousands of lines across hundreds of files, full movies, entire podcast series, and more. Gemini 1.5 was built by an amazing team of people from @GoogleDeepMind, @GoogleResearch, and elsewhere at @Google. @OriolVinyals (my co-technical lead for the project) and I are incredibly proud of the whole team, and we’re so excited to be sharing this work and what long context and in-context learning can mean for you today! There’s lots of material about this, some of which are linked to below. Main blog post: blog.google/technology/ai/… Technical report: “Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context” goo.gle/GeminiV1-5 Videos of interactions with the model that highlight its long context abilities: Understanding the three.js codebase: youtube.com/watch?v=SSnsmq… Analyzing a 45 minute Buster Keaton movie: youtube.com/watch?v=wa0MT8… Apollo 11 transcript interaction: youtube.com/watch?v=LHKL_2… Starting today, we’re offering a limited preview of 1.5 Pro to developers and enterprise customers via AI Studio and Vertex AI. Read more about this on these blogs: Google for Developers blog: developers.googleblog.com/2024/02/gemini… Google Cloud blog: cloud.google.com/blog/products/… We’ll also introduce 1.5 Pro with a standard 128,000 token context window when the model is ready for a wider release. Coming soon, we plan to introduce pricing tiers that start at the standard 128,000 context window and scale up to 1 million tokens, as we improve the model. Early testers can try the 1 million token context window at no cost during the testing period. We’re excited to see what developer’s creativity unlocks with a very long context window. Let me walk you through the capabilities of the model and what I’m excited about!

YouTube

English

184

1.1K

6K

1.7M

Seth Aycock@sethjsa·11 Eki

@jxmnop x.com/sethjsa/status… It turns out LLMs learn most or all translation ability from parallel sentences in the book, not the grammar. And fine-tuning a small translation model matches or beats long-context LLM results! (plus Kalamang parallel data has been online since Nov 2020)

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

3

71

dr. jack morris@jxmnop·26 Haz

recently read one of the most interesting LLM papers i've ever read, the story goes something like this > dutch PhD student/researcher Eline Visser lives on remote island in Indonesia for several years > learns the Kalamang language, an oral language with only 100 native speakers > she writes "The Grammar of Kalamang", a textbook on how to write in Kalamang > since Kalamang is a spoken language only, TGOK is the only text on earth written in it > so, there is no internet data in written Kalamang > so, language models haven't read any Kalamang during training > in the paper, researchers explore how to teach a language model a new language from a single book > they evaluate various types of fine-tuning and prompting > much to my chagrin, prompting wins (and it's not close) > larger models & longer context windows help a lot by the way, seems like humans still win at this task (for now)

English

33

201

1.8K

290.7K

Seth Aycock@sethjsa·11 Eki

More generally, we suggest that data collection efforts for multilingual XLR tasks like translation are best focused on parallel data over linguistic description, given the advantages in computational cost, token efficiency, availability!

English

6

332

Seth Aycock@sethjsa·11 Eki

Our results emphasise the importance of task-appropriate data for XLR languages: parallel data for translation, and grammatical data for linguistic tasks.

English