Alon Talmor

233 posts

Alon Talmor banner
Alon Talmor

Alon Talmor

@AlonTalmor

Founder & CEO of Ask-AI, NLP PhD, Traveler.

Tel Aviv 가입일 Ağustos 2009
121 팔로잉663 팔로워
고정된 트윗
Alon Talmor
Alon Talmor@AlonTalmor·
Can we instantly correct pre-trained models by showing them natural language rules and facts? Can they systematically reason over implicit knowledge while doing so? arxiv.org/abs/2006.06609 New joint work with Oyvind Tafjord, Peter Clark, @yoavgo and @JonathanBerant suggests yes!
Alon Talmor tweet media
English
4
38
214
0
Alon Talmor 리트윗함
AK
AK@_akhaliq·
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking paper page: huggingface.co/papers/2312.09… Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed reward hacking. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust reward estimate. We explore the application of reward ensembles to alignment at both training time (through reinforcement learning) and inference time (through reranking). First, we show that reward models are underspecified: reward models that perform similarly in-distribution can yield very different rewards when used in alignment, due to distribution shift. Second, underspecification results in overoptimization, where alignment to one reward model does not improve reward as measured by another reward model trained on the same data. Third, overoptimization is mitigated by the use of reward ensembles, and ensembles that vary by their pretraining seeds lead to better generalization than ensembles that differ only by their fine-tuning seeds, with both outperforming individual reward models. However, even pretrain reward ensembles do not eliminate reward hacking: we show several qualitative reward hacking phenomena that are not mitigated by ensembling because all reward models in the ensemble exhibit similar error patterns.
AK tweet media
English
1
28
67
20.8K
Alon Talmor 리트윗함
Mor Geva
Mor Geva@megamor2·
LMs capture many factual associations, but how do they recall them internally during inference? In a new preprint, we find that LMs build attribute-rich subject representations, from which attention heads extract the predicted attribute. @jasmijnbastings @fajtak @amirgloberson 🧵
Mor Geva tweet media
English
3
89
437
60.8K
Alon Talmor 리트윗함
Ori Yoran
Ori Yoran@OriYoran·
A single chain-of-thought (CoT) may not be enough to answer complex questions, can an LLM meta-reason over multiple CoTs? In our new preprint, we show that meta-reasoning boosts performance on multi-hop QA datasets! Paper: arxiv.org/abs/2304.13007 🧵1/4
GIF
English
2
54
191
26.7K
Alon Talmor 리트윗함
Or Azulay
Or Azulay@orslimy·
שרשוראור מוצרי - איך כורים זהב מתמיכה? אחת מהטעויות ששמתי לב אליהן שמנהלי מוצר בתחילת דרכם עושים - הם לא יודעים איך לייצר ערוץ פידבק איכותי מפניות של לקוחות לתמיכה. חשוב לזכור שלקוחות יפנו אליכם (ישאירו ביקורת, או ישלחו הודעה) רק במקרי קיצון, כי זה דורש מהם זמן ואנרגיה, לדוג׳:
Or Azulay tweet media
עברית
1
1
4
2.3K
Alon Talmor 리트윗함
Mark Peter Davis
Mark Peter Davis@mpd·
On this week’s pod ep we’re combining a guest interview w/ the new partner meeting format. We start with the partner meeting segments then I chat w/ a guest. Hopefully a fun twist for everyone. This week's guest is @AlonTalmor, Founder & CEO of Ask-AI. ipl.vc/pmg1tm
English
2
8
13
0
Alon Talmor 리트윗함
Jonathan Berant
Jonathan Berant@JonathanBerant·
Full house at the Yandex distinguished lecture series as @YejinChoinka is starting her talk!
Jonathan Berant tweet mediaJonathan Berant tweet mediaJonathan Berant tweet media
English
1
8
87
0
Alon Talmor
Alon Talmor@AlonTalmor·
Excited to be speaking at #websummit2022 #breakout sessions 1st of Nov. 17:30PM and, 3rd of Nov. 9:45AM, as well as the panel "The socio-economic impacts of AI" at DeepTech stage, at 14:55 Nov 3rd. Come say hello!
Alon Talmor tweet media
English
0
3
6
0
Eynat Guez
Eynat Guez@EynatGuez·
מחשבות של לילה - הדבר הכי קשה בסטארטאפ בצמיחה זה להבין מה חווית הלקוח ברגע נתון, איפה בתהליכים הוא חווה תסכול או שירות לא טוב. תמיד צריך להשאר מחוברים למקום הזה, זה קשה כי בסוף יש עוד הרבה אנשים בדרך.הטריק שלי זה לקרוא מיילים של לקוחות שמגיעים לתמיכה ולנתח מהם את החוויה לאחור.
עברית
29
3
186
0
Alon Talmor
Alon Talmor@AlonTalmor·
@EynatGuez כחברה שמתמחה במתן תשובות וניתוח תקשורות לקוח ממקורות מרובים, הבעיה שתארת היא במרכז הפתרון שבנינו. בעזרת NLP אנחנו מנתחים טקסט ממקורות כמו טיקטים, CSATים, שיחות עם לקוחות כדי להציף בעיות שחוזרות עצמן לא כאוסף טגיות אלא כמשפט שלם בשפה טבעית. Ask-AI.com (נדב נאמן מכיר)
עברית
0
1
7
0
Alon Talmor 리트윗함