Casey A. Fitzpatrick

357 posts

Casey A. Fitzpatrick

@caseyfitz

@ContextualAI MTS | Quantum Information PhD

Bay Area Katılım Haziran 2011

124 Takip Edilen219 Takipçiler

Sabitlenmiş Tweet

Casey A. Fitzpatrick@caseyfitz·1 Ağu

Hard to believe it’s barely been a year since @douwekiela called to order our very first all hands, then @apsdehal and I spent hours in a tiny room with a whiteboard laying out the technical vision for what we were about to do. So proud of the team we’ve built and what’s to come!

Contextual AI@ContextualAI

We’re excited to share today that we’ve raised $80M in Series A funding to accelerate our mission to change the way the world works through AI. Read more at our blogpost: contextual.ai/news/announcin…

English

1.6K

Casey A. Fitzpatrick@caseyfitz·11 Mar

Another great example of how we frame the problems of enterprise ai from a systems perspective... and this component is a key player 😎 (which is also SOTA by itself 🙌

Douwe Kiela@douwekiela

AI struggles with messy, conflicting, ever-changing data. Today's AI ranking methods can't prioritize clearly, because they lack human guidance. Introducing the world's first instruction-following, SOTA reranker! Give our reranker instructions to control exactly how it ranks: • “Prioritize recent documents” • “Prefer PDFs over other sources” • “The boss is always right” Can’t wait to see what people build with it!

English

111

Casey A. Fitzpatrick@caseyfitz·11 Mar

@douwekiela Psyched to show the world a little bit more about how we're tackling real problems that will unblock AI's true potential.

English

238

Douwe Kiela@douwekiela·11 Mar

English

503

210K

Casey A. Fitzpatrick@caseyfitz·4 Eyl

Here, have a truly open MoE 🤲. More amazing work by @Muennighoff! Tons to dive into here.

Niklas Muennighoff@Muennighoff

Releasing OLMoE - the first good Mixture-of-Experts LLM that's 100% open-source - 1B active, 7B total params for 5T tokens - Best small LLM & matches more costly ones like Gemma, Llama - Open Model/Data/Code/Logs + lots of analysis & experiments 📜arxiv.org/abs/2409.02060 🧵1/9

English

550

Casey A. Fitzpatrick retweetledi

Stas Bekman@StasBekman·18 Tem

Here is a new Machine Learning Engineering chapter: Network debug github.com/stas00/ml-engi… The intention is to help non-network engineers to figure out how to resolve common problems around multi-gpu and multi-node collectives networking - it's heavily NCCL-biased at the moment. Will extend with RCCL and others when I get access to those. Your feedback and corrections are always welcome.

English

172

9.4K

Casey A. Fitzpatrick@caseyfitz·13 Haz

ty @charles_irl and gang

Stas Bekman@StasBekman

The kind folks from @modal_labs have just shared with me this 10-100x faster drop in replacement for pip github.com/astral-sh/uv If you want a much faster CI startup switch to uv now! To use you just add `uv` before `pip` and everything else is the same, so: pip install uv uv pip install -e . uv pip install torch uv pip compile ... etc.

English

143

Casey A. Fitzpatrick@caseyfitz·4 Haz

@hugobowne @sh_reya @VanishingData Haha with none other than the legendary @charles_irl in the mix. Small world.

English

Hugo Bowne-Anderson@hugobowne·4 Haz

@caseyfitz @sh_reya @VanishingData thanks, Casey! We also have this one coming up, which may interest you. Hope all is well :) lu.ma/e8huz3s6?utm_s…

English

Hugo Bowne-Anderson@hugobowne·3 Haz

I'll be doing a livestream w/ @sh_reya (UC Berkeley) for @VanishingData about designing human-in-the-loop interfaces for working with GenAI and LLM systems, with a focus on evaluation (but covering lots of other fun stuff!). Sign up here 👇 lu.ma/zz3qic45?utm_s… 1/

English

12.8K

Casey A. Fitzpatrick retweetledi

Douwe Kiela@douwekiela·9 Nis

Maximizing expected human utility (i.e., KTO) is the natural way to do alignment. Cool to see how well this works even in diffusion models.

AK@_akhaliq

Aligning Diffusion Models by Optimizing Human Utility We present Diffusion-KTO, a novel approach for aligning text-to-image diffusion models by formulating the alignment objective as the maximization of expected human utility. Since this objective applies to each

English

5.9K

Casey A. Fitzpatrick retweetledi

sarah guo@saranormous·21 Mar

from @douwekiela (a pioneer of Retrieval-Augmented Generation) on the value of end-to-end co-optimization of systems (language models + retrievers):

Contextual AI@ContextualAI

Today, we’re excited to announce RAG 2.0, our end-to-end system for developing production-grade AI. Using RAG 2.0, we’ve created Contextual Language Models (CLMs), which achieve state-of-the-art performance on a variety of industry benchmarks. CLMs outperform strong RAG baselines built using GPT-4 and top open-source models like Mixtral, according to our research and customers. Read more in our blog post: rag2.ai

English

11.1K

Casey A. Fitzpatrick@caseyfitz·20 Mar

😎

Nathan Lambert@natolambert

contextual . onlybangers . ai

ART

118

Casey A. Fitzpatrick@caseyfitz·19 Mar

🤩

Omar Khattab@lateinteraction

Not everyone gets to call their new system RAG 2.0 — exciting announcement by the @ContextualAI team, who have been thinking about these problems for many years.

ART

120

Casey A. Fitzpatrick@caseyfitz·19 Mar

rip FRAG 🪦 we'll never forget the many meetup demos

Aaref Hilaly@aaref

2023: RAG vs no RAG 2024: RAG 2.0 vs frozen RAG As AI moves into production, simple RAG systems are not enough. @douwekiela & @ContextualAI show that with data.

English

Casey A. Fitzpatrick@caseyfitz·19 Mar

Last year I joined Contextual AI to focus on designing and building production-grade AI systems, from first principles, focused on real world workflows not demos. Today I'm excited to share some of what we've done so far!

Contextual AI@ContextualAI

English

280

Casey A. Fitzpatrick@caseyfitz·7 Mar

👋hiiii we’re here to play!

Winnie Xu@winniethexu

Excited to share a new model with @ContextualAI that tops the AlpacaEval 2.0 leaderboard! How did we manage to rank higher than models like GPT4, Claude 3 and Mistral Medium? Enter iterative alignment… 🧵

English

127

Casey A. Fitzpatrick retweetledi

Kawin Ethayarajh@ethayarajh·2 Mar

The Orca-Math paper does a comparison of DPO and KTO for mathematical reasoning, finding that KTO is slightly better when all data is used and 25+ pts better when you have fewer positive examples than negative examples.

AK@_akhaliq

Microsoft presents Orca-Math Unlocking the potential of SLMs in Grade School Math Mathematical word problem-solving has long been recognized as a complex task for small language models (SLMs). A recent study hypothesized that the smallest model size, needed to achieve over 80% accuracy on the GSM8K benchmark, is 34 billion parameters. To reach this level of performance with smaller models, researcher often train SLMs to generate Python code or use tools to help avoid calculation errors. Additionally, they employ ensembling, where outputs of up to 100 model runs are combined to arrive at a more accurate result. Result selection is done using consensus, majority vote or a separate a verifier model used in conjunction with the SLM. Ensembling provides a substantial boost in accuracy but at a significant cost increase with multiple calls to the model (e.g., Phi-GSM uses top-48 to boost the performance from 68.2 to 81.5). In this work, we present Orca-Math, a 7-billion-parameter SLM based on the Mistral-7B, which achieves 86.81% on GSM8k without the need for multiple model calls or the use of verifiers, code execution or any other external tools. Our approach has the following key elements: (1) A high quality synthetic dataset of 200K math problems created using a multi-agent setup where agents collaborate to create the data, (2) An iterative learning techniques that enables the SLM to practice solving problems, receive feedback on its solutions and learn from preference pairs incorporating the SLM solutions and the feedback. When trained with Supervised Fine-Tuning alone, Orca-Math achieves 81.50% on GSM8k pass @1 metric. With iterative preference learning, Orca-Math achieves 86.81% pass@1. Orca-Math surpasses the performance of significantly larger models such as LLAMA-2-70B, WizardMath-70B, Gemini-Pro, ChatGPT-3.5. It also significantly outperforms other smaller models while using much smaller data (hundreds of thousands vs. millions of problems).

English

120

33.1K

Casey A. Fitzpatrick retweetledi

Omar Khattab@lateinteraction·31 Oca

I'm glad that a lot more people understand the key ideas behind ColBERT and DSPy now. My only remaining goal is to make sure people can also say them correctly; both are quite tricky😆 * Col-BAIR (it's "the late" interaction retriever, get it?) * Dee-Ess-Pie (like num-pie)

English

185

33.8K

Casey A. Fitzpatrick@caseyfitz·9 Ara

😊 🙏

Andrew Carr 🤸@andrew_n_carr

I know everyone is excited about Mixtral and the new Hyena models - but @ContextualAI just dropped a pile of cool new models and a new alignment framework contextual.ai/better-cheaper…

ART

164

Casey A. Fitzpatrick retweetledi

fraser@Fraser·15 Eki

I believe strongly that: 1) The best products that will emerge from this moment are “full stack”, with teams training their own models, and the models & UI informing one another. 2) This requires researchers who care deeply about what’s best for the product, including data

English

263

85.3K

Casey A. Fitzpatrick@caseyfitz·16 Eki

🌠🌠🌠

Contextual AI@ContextualAI

We're proud to be a 2023 #IA40 Intelligent Applications Rising Star winner!

ART

Keşfet

@douwekiela @Muennighoff @charles_irl @hugobowne @sh_reya @VanishingData @elonmusk @BarackObama