Francisco Girbal Eiras

112 posts

Francisco Girbal Eiras

Francisco Girbal Eiras

@fgirbal

PhD student in Trustworthy and Safe ML at @AIMS_Oxford @OxfordTVG (@UniofOxford), ex-@_FiveAI

Oxford, UK Katılım Eylül 2012
350 Takip Edilen282 Takipçiler
Francisco Girbal Eiras retweetledi
Dan Hendrycks
Dan Hendrycks@hendrycks·
Improving AI's academic abilities may not markedly improve user experience as in the past. In LMSYS rankings, GPT-4o and GPT-4o mini rank 6th and 7th, despite a large academic gap (MMLU: 88.7% v. 82%). o1 may have underwhelmed because most people can't appreciate Olympiad skills.
English
17
4
137
16.5K
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
#ICML2024 was great! 🇦🇹 I had the opportunity to present our oral position paper on open-source Gen AI! And it was fantastic to connect with so many talented people working on important problems related to trustworthy and safe ML 🥳
Francisco Girbal Eiras tweet mediaFrancisco Girbal Eiras tweet media
English
1
0
10
541
Francisco Girbal Eiras retweetledi
Adel Bibi
Adel Bibi@Adel_Bibi·
@fgirbal and @AleksPPetrov presenting our new Open Source position paper in #ICML24. The paper is accepted as Oral and the talk is today at 4.30PM in Hall A1.
Adel Bibi tweet media
English
1
3
10
589
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
I'm excited to be in beautiful Vienna 🇦🇹 for #icml2024! 👋 Let's grab a coffee and chat about trustworthy and safe AI 👇 See below the details of the 3 papers I'm presenting
English
1
0
6
670
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
The impact GenAI will have is highly dependent on the ability to open-source these models. Do the benefits provided outweigh the marginal risks incurred? 👉 Our #icml2024 Oral position paper argues they overwhelmingly do! 🗓️ Drop by Oral 2B (Hall A1) on 23rd of July @ 5pm.
Francisco Girbal Eiras tweet media
English
1
8
19
4.6K
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
After winning an Outstanding Paper Award at the 2nd Workshop of Formal Verification for ML at #icml2023 (📷 🥂), I am excited to present our work "Efficient Error Certification for Physics-Informed Neural Networks" as a poster at #icml2024! 🔗 Paper: arxiv.org/abs/2305.10157
Francisco Girbal Eiras tweet media
English
7
0
16
540
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
To mitigate it, we propose during fine-tuning to mix-in safety data that mimics the format/style of the user data by paraphrasing it using an LLM. We show this simple strategy reduces harmfulness while maintaining task performance more consistently than existing baselines!
Francisco Girbal Eiras tweet media
English
1
0
1
141
Francisco Girbal Eiras
Francisco Girbal Eiras@fgirbal·
📈 Task-specific fine-tuning allows LLMs to solve tasks more efficiently. ❌ Recent work shows fine-tuning on benign/adversarial benign-looking instruction-following data increases harmfulness. 🤔 Does this happen in the task-specific setting? If so, how can we mitigate it? 🧵
GIF
English
1
4
21
2.9K
Francisco Girbal Eiras retweetledi
Jishnu Mukhoti
Jishnu Mukhoti@JishnuMukhoti·
Wanna know how finetuning makes a foundation model forget and how to avoid it? Have a look at our paper accepted @TmlrOrg! 📜 Paper: arxiv.org/abs/2308.13320 Work done with amazing co-authors: @yaringal @philiptorr @puneetdokania @OATML_Oxford @OxfordTVG!
Puneet Dokania@puneetdokania

Finally, after ~3.5 months, 6 reviewers -our paper is accepted 2 @TmlrOrg ⭐️Fine-tuning can cripple your foundation model; preserving features may be the solution: arxiv.org/abs/2308.13320 Last paper of our ex-student @JishnuMukhoti as part f his PhD 👏 @yaringal @OxfordTVG

English
0
11
27
6.5K