Ajay Patel

15 posts

Ajay Patel

@ajayp95

Current: ML PhD @ University of Pennsylvania Prev: Founder at Plasticity (YCS17, acq. 2020)

Santa Clara, California Katılım Aralık 2011

46 Takip Edilen15 Takipçiler

Ajay Patel retweetledi

Yue Yang@YueYangAI·24 Şub

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: yueyang1996.github.io/cosyn/ Dataset: huggingface.co/datasets/allen… Paper: arxiv.org/pdf/2502.14846 Code: github.com/allenai/pixmo-…

English

194

23.1K

Ajay Patel retweetledi

AK@_akhaliq·21 Şub

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

English

14.9K

Ajay Patel retweetledi

Andrea Soria Jimenez@andrejanysa·23 Oca

🚀 Synthetic data is revolutionizing AI & ML! DataDreamer, an open-source Python library, makes generating synthetic data seamless & integrates effortlessly with @huggingface . Easily push datasets to the Hub and share them with the community 🔍 Learn how: #6790671e20a7d3ca6f72b6cb" target="_blank" rel="nofollow noopener">huggingface.co/blog/asoria/da…

English

1.5K

Ajay Patel retweetledi

Zachary Horvitz@zachary_horvitz·14 Kas

I'm at #EMNLP2024 presenting ✨TinyStyler✨, an efficient, effective, and fast method for few-shot text style transfer! Paper: aclanthology.org/2024.findings-… Demo: huggingface.co/spaces/tinysty… Code: github.com/zacharyhorvitz…

English

2.3K

Ajay Patel retweetledi

Luca Soldaini 🎀@soldni·25 Eyl

Olmo goes multimodal! We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯 We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.

English

165

991

89.7K

Ajay Patel retweetledi

Duncan Watts@duncanjwatts·25 Haz

Very nice coverage of @csspenn's just-launched Media Bias Detector. I'm very excited about this project, which has been a Herculean team effort! ai.seas.upenn.edu/news/mapping-m… mediabiasdetector.seas.upenn.edu

English

5.7K

Ajay Patel retweetledi

Sepp Hochreiter@HochreiterSepp·31 May

New exciting research by @DinuMariusC with @ajayp95 (U of Pennsylvania) and @ExtensityAI. We show LLM self-improvement with synthetic data for web agent tasks on WebArena, and introduce an extended VERTEX score for measuring the trajectory quality of agent workflows.

Marius-Constantin Dinu@DinuMariusC

Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]

English

6.9K

Ajay Patel retweetledi

Marius-Constantin Dinu@DinuMariusC·31 May

English

13.6K

Ajay Patel retweetledi

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·31 May

Large Language Models Can Self-Improve At Web Agent Tasks abs: arxiv.org/abs/2405.20309 "We explore fine-tuning on three distinct synthetic training data mixtures and achieve a 31% improvement in task completion rate over the base model on the WebArena benchmark through a self-improvement procedure."

Tanishq Mathew Abraham, Ph.D. tweet media

English

235

21.2K

Ajay Patel retweetledi

AK@_akhaliq·19 Şub

paper page: huggingface.co/papers/2402.10…

English

7.8K

Ajay Patel retweetledi

AK@_akhaliq·19 Şub

DataDreamer A Tool for Synthetic Data Generation and Reproducible LLM Workflows Large language models (LLMs) have become a dominant and important tool for NLP researchers in a wide range of tasks. Today, many researchers use LLMs in synthetic data generation, task evaluation, fine-tuning, distillation, and other model-in-the-loop research workflows. However, challenges arise when using these models that stem from their scale, their closed source nature, and the lack of standardized tooling for these new and emerging workflows. The rapid rise to prominence of these models and these unique challenges has had immediate adverse impacts on open science and on the reproducibility of work that uses them. In this paper, we introduce DataDreamer, an open source Python library that allows researchers to write simple code to implement powerful LLM workflows. DataDreamer also helps researchers adhere to best practices that we propose to encourage open science and reproducibility.

English

150

23.3K

Ajay Patel retweetledi

Aran Komatsuzaki@arankomatsuzaki·19 Şub

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows repo: github.com/datadreamer-de… abs: arxiv.org/abs/2402.10379

English

254

20.1K

Ajay Patel retweetledi

Bryan Li@bryanlics·2 May

Are GPT-style LMs best for prompting🤔? Our work shows maybe not! Catch us at the poster for "Bidirectional Language Models are Also Few-Shot Learners" (joint w/ @ajayp95, @colinraffel ) in person at #ICLR2023 in Kigali May 3, 11:30-1:30 PM #162 or arxiv.org/abs/2209.14500

English

649

Ajay Patel retweetledi

UPenn NLP@upennnlp·8 Şub

Work done by Ajay Patel, @bryanlics, and @ccb from @upennnlp with collaborators @colinraffel @noahconst and @rasoolims

English

1.3K

Ajay Patel retweetledi

UPenn NLP@upennnlp·8 Şub

Bidirectional LMs like T5 learn superior representations, but the field mostly trains unidirectional LMs like GPT-3 since the "emergent" property of prompting was never seen in T5. We show that T5 can be prompted, outperforming GPT-3 with 50% fewer params. arxiv.org/abs/2209.14500

English

279

36.2K

Keşfet

@huggingface @csspenn @DinuMariusC @ExtensityAI @bryanlics @upennnlp @noahconst @rasoolims