Carlos Guestrin

194 posts

Carlos Guestrin banner
Carlos Guestrin

Carlos Guestrin

@guestrin

@Stanford Prof. National Acad of Eng. Chief Sci @ Visual Layer & Virtue AI. Frm Sr Dir AI @Apple. Co-author of XGBoost, LIME, TextGrad, Alpaca, TVM, GraphLab.

Stanford Entrou em Kasım 2009
15 Seguindo8.9K Seguidores
Carlos Guestrin
Carlos Guestrin@guestrin·
With SDPO, you can now do RL with natural language feedback, like error messages from coding environments or LLMs as judges. You can achieve huge gains over GRPO with scalar rewards!
Jonas Hübotter@jonashubotter

Training LLMs with verifiable rewards uses 1bit signal per generated response. This hides why the model failed. Today, we introduce a simple algorithm that enables the model to learn from any rich feedback! And then turns it into dense supervision. (1/n)

English
3
17
235
38.1K
Carlos Guestrin retweetou
Ronak Malde
Ronak Malde@rronak_·
I don't think people have realized how crazy the results are from this new TTT + RL paper from Stanford/Nvidia. Training an open source model, they - beat Deepmind AlphaEvolve, discovered new upper bound for Erdos's minimum overlap problem - Developed new A100 GPU kernels 2x faster than the best human kernel - Outperformed the best AI coding attempt and human attempt on AtCoder The idea of Test Time Training is to train a model *while* it's iteratively trying to solve a task. Combining this with RL like they do in this paper opens up the floodgates of possibilities for continual learning Authors: @mertyuksekgonul @LeoXinhaoLee @JedMcCaleb @xiaolonw @jankautz @YejinChoinka @james_y_zou @guestrin @sun_yu_
Ronak Malde tweet media
English
45
197
1.8K
110.6K
Carlos Guestrin
Carlos Guestrin@guestrin·
The SAIL Postdoctoral Fellowship is a fantastic opportunity for recent PhDs to do their most innovative and impactful AI research, while being part of a highly collaborative and welcoming environment at Stanford.
Stanford AI Lab@StanfordAILab

Stanford AI Lab (SAIL) is excited to announce new SAIL Postdoctoral Fellowships! We are looking for outstanding candidates excited to advance the frontiers of AI with our professors and vibrant community. Applications received by the end of April 30 will receive full consideration: ai.stanford.edu/postdoctoralfe…

English
0
0
2
1.4K
Carlos Guestrin
Carlos Guestrin@guestrin·
We are super excited to empower developers to focus on their goal of building innovative AI applications; we’ll take care of safety and security! What an awesome ride with Bo Li @uiuc_aisecure, @sanmikoyejo, @dawnsongtweets and the whole @VirtueAI_co team!
Virtue AI@VirtueAI_co

We’ve raised $30M in Seed + Series A funding led by @lightspeedvp and Walden Catalyst Ventures, with participation from Prosperity7 Ventures, Factory, Osage University Partners (OUP), Lip-Bu Tan, Chris Re, and more. Virtue AI is the first unified platform for securing AI across red teaming, guardrails, and agents. We’re proud to already support forward-thinking companies like @Uber , @glean, and several frontier labs. This investment will help us continue improving our platform, scale agentic workflows, build out integrated enterprise use cases, and grow our team. Grateful to our customers, our team, and our investors for the support.

English
0
3
17
2.8K
Carlos Guestrin retweetou
Liana
Liana@lianapatel_·
Excited to share our release of LOTUS 1.1.0, which now makes it easier than ever to get started with LLM-powered data processing over a variety of custom knowledge sources. Seamlessly connect to data from the web, your SQL databases, vector databases and more. And process it all with the power of semantic operators. github.com/lotus-data/lot…
Liana tweet media
English
7
29
153
18.9K
Carlos Guestrin retweetou
Stanford AI Lab
Stanford AI Lab@StanfordAILab·
SAIL is delighted to announce Carlos @guestrin, the Fortinet Founders Professor of Computer Science, as the next Director of @StanfordAILab. Carlos is a talented researcher and leader, known for his work on explainability, graphs, compilation, and boosted trees in AI.
Stanford AI Lab tweet media
English
6
24
97
11.4K
Carlos Guestrin retweetou
Liana
Liana@lianapatel_·
We've been building LOTUS at Stanford and Berkeley to make LLM-powered data processing fast, easy and declarative. LOTUS is an open-source query engine that makes programming as easy as writing Pandas and optimizes your programs for up to 400x speedups. To celebrate the holidays, we're excited to share our release of LOTUS 1.0.0 with a batch of new updates that make reasoning over your data faster, easier and better than ever! Code: github.com/guestrin-lab/l… 🧵👇
Liana tweet media
English
17
185
1.3K
159.5K
Carlos Guestrin retweetou
Luke Bailey
Luke Bailey@LukeBailey181·
Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵
GIF
English
11
81
373
58.7K
Carlos Guestrin retweetou
Teddi Worledge
Teddi Worledge@TeddiWorledge·
🧵LLMs are great at synthesizing info, but unreliable at citing sources. Search engines are the opposite. What lies between them? Our new paper runs human evals on 7 systems across the✨extractive-abstractive spectrum✨for utility, citation quality, time-to-verify, & fluency!
Teddi Worledge tweet media
English
1
21
66
14.2K
Carlos Guestrin retweetou
Nicole Meister
Nicole Meister@nicole__meister·
Prior work has used LLMs to simulate survey responses, yet their ability to match the distribution of views remains uncertain. Our new paper [arxiv.org/pdf/2411.05403] introduces a benchmark to evaluate how distributionally aligned LLMs are with human opinions. 🧵
Nicole Meister tweet media
English
4
36
154
27.9K
Carlos Guestrin retweetou
Irena Gao
Irena Gao@irena_gao·
Many providers offer inference APIs for the same models: for example, there were over nine Llama-3 8B APIs in Summer 2024. Do all of these APIs serve the same completion distribution as the original model? In our new paper, ✨Model Equality Testing: Which Model is This API Serving?✨, we formalize this question as a two-sample distribution testing problem: the user collects samples for their task from the API and a reference distribution, and conducts a statistical test to see if the two distributions are the same. We design tests which show nontrivial power in detecting when models have been quantized, watermarked, or finetuned! (🧵 1/5)
Irena Gao tweet media
English
7
33
167
46K
Carlos Guestrin retweetou
Luis Ceze
Luis Ceze@luisceze·
It’s happening! TVM Conference. Incredibly proud of and grateful to the whole UW SAMPL team and TVM contributors.
Luis Ceze tweet mediaLuis Ceze tweet mediaLuis Ceze tweet mediaLuis Ceze tweet media
English
1
7
48
0