Ajay Patel

15 posts

Ajay Patel

Ajay Patel

@ajayp95

Current: ML PhD @ University of Pennsylvania Prev: Founder at Plasticity (YCS17, acq. 2020)

Santa Clara, California Katılım Aralık 2011
46 Takip Edilen15 Takipçiler
Ajay Patel retweetledi
AK
AK@_akhaliq·
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
AK tweet media
English
3
14
97
14.9K
Ajay Patel retweetledi
Andrea Soria Jimenez
Andrea Soria Jimenez@andrejanysa·
🚀 Synthetic data is revolutionizing AI & ML! DataDreamer, an open-source Python library, makes generating synthetic data seamless & integrates effortlessly with @huggingface . Easily push datasets to the Hub and share them with the community 🔍 Learn how: #6790671e20a7d3ca6f72b6cb" target="_blank" rel="nofollow noopener">huggingface.co/blog/asoria/da…
Andrea Soria Jimenez tweet media
English
1
9
28
1.5K
Ajay Patel retweetledi
Luca Soldaini 🎀
Luca Soldaini 🎀@soldni·
Olmo goes multimodal! We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯 We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.
Luca Soldaini 🎀 tweet media
English
21
165
991
89.7K
Ajay Patel retweetledi
Sepp Hochreiter
Sepp Hochreiter@HochreiterSepp·
New exciting research by @DinuMariusC with @ajayp95 (U of Pennsylvania) and @ExtensityAI. We show LLM self-improvement with synthetic data for web agent tasks on WebArena, and introduce an extended VERTEX score for measuring the trajectory quality of agent workflows.
Marius-Constantin Dinu@DinuMariusC

Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]

English
1
11
47
6.9K
Ajay Patel retweetledi
Marius-Constantin Dinu
Marius-Constantin Dinu@DinuMariusC·
Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]
Marius-Constantin Dinu tweet media
English
5
19
69
13.6K
Ajay Patel retweetledi
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
Large Language Models Can Self-Improve At Web Agent Tasks abs: arxiv.org/abs/2405.20309 "We explore fine-tuning on three distinct synthetic training data mixtures and achieve a 31% improvement in task completion rate over the base model on the WebArena benchmark through a self-improvement procedure."
Tanishq Mathew Abraham, Ph.D. tweet media
English
6
54
235
21.2K
Ajay Patel retweetledi
AK
AK@_akhaliq·
DataDreamer A Tool for Synthetic Data Generation and Reproducible LLM Workflows Large language models (LLMs) have become a dominant and important tool for NLP researchers in a wide range of tasks. Today, many researchers use LLMs in synthetic data generation, task evaluation, fine-tuning, distillation, and other model-in-the-loop research workflows. However, challenges arise when using these models that stem from their scale, their closed source nature, and the lack of standardized tooling for these new and emerging workflows. The rapid rise to prominence of these models and these unique challenges has had immediate adverse impacts on open science and on the reproducibility of work that uses them. In this paper, we introduce DataDreamer, an open source Python library that allows researchers to write simple code to implement powerful LLM workflows. DataDreamer also helps researchers adhere to best practices that we propose to encourage open science and reproducibility.
AK tweet media
English
4
33
150
23.3K
Ajay Patel retweetledi
Bryan Li
Bryan Li@bryanlics·
Are GPT-style LMs best for prompting🤔? Our work shows maybe not! Catch us at the poster for "Bidirectional Language Models are Also Few-Shot Learners" (joint w/ @ajayp95, @colinraffel ) in person at #ICLR2023 in Kigali May 3, 11:30-1:30 PM #162 or arxiv.org/abs/2209.14500
English
0
3
13
649
Ajay Patel retweetledi
UPenn NLP
UPenn NLP@upennnlp·
Bidirectional LMs like T5 learn superior representations, but the field mostly trains unidirectional LMs like GPT-3 since the "emergent" property of prompting was never seen in T5. We show that T5 can be prompted, outperforming GPT-3 with 50% fewer params. arxiv.org/abs/2209.14500
English
2
36
279
36.2K