Sabitlenmiş Tweet
Ai2
3.7K posts

Ai2
@allen_ai
Breakthrough AI to solve the world's biggest problems. › Join us: https://t.co/MjUpZpKPXJ › Newsletter: https://t.co/k9gGznstwj
Seattle, WA Katılım Eylül 2015
435 Takip Edilen84.3K Takipçiler
Ai2 retweetledi

Really amazing results analyzing what's creative/novel vs. what's copied from Internet data, enabled by the amazing @liujc1998's Infini-gram! infini-gram.io
This is also enabled in @allen_ai's OlmoTrace allenai.org/blog/olmotrace where anyone can find matching n-grams between LLM-generated text and its training data.
English
Ai2 retweetledi

📢 Save the date!
📌 AI Workflows for Food Systems Research: A Demonstration of AutoDiscovery with the Ai2 Asta Team.
📆 June 4, 9:30 – 10:30 am EDT
💬 Ruben Lozano Aguilera & Bodhisattwa Majumder (@allen_ai); Leroy Mwanzia (@Cipotato); @EliotJonesGarc1 (IFPRI).
🎫 ifpri.zoom.us/webinar/regist…
@CGIAR

English
Ai2 retweetledi

Very cool to see these conversations happening! This is what openness enables. The "tool that allows you to trace those n-grams directly to their source," is infinigram, AKA OlmoTrace from @allen_ai, created by @liujc1998. x.com/alexolegimas/s…
English

We're releasing a dataset of 14K HuggingFace models, datasets, papers, & codebases linked by 51K evaluations, fine-tunings, & references, plus the ArtifactLinker code.
We hope it helps others find SOTA eval results.
💻 Code: github.com/allenai/artifa…
📊 Data: huggingface.co/datasets/lwaek…
English

@huggingface Using ArtifactLinker, we found cases where a strong model had never been evaluated on a benchmark it would set – or near-match – the SOTA on.
We also found that newer LLMs like Gemma often lose to older DeBERTa models on natural language inference tasks.
English

Most models are only evaluated on a fraction of the benchmarks out there.
ArtifactLinker, our new system, predicts which ones would set a new state-of-the-art on benchmarks hosted on @HuggingFace, then runs the evaluation to verify. 🧵

English

We release open models so they're available to builders working on problems that matter to them. On Global Accessibility Awareness Day, PointCheck is a fitting example.
Read more ↓ allenai.org/blog/global-ac…
English

Available now in the same sizes as v1: Nano, Tiny, Base. Open weights, open training code.
If you're running v1 and v1.1 works for your task, expect significant speedups during fine-tuning & inference.
🤗 Models: huggingface.co/collections/al…
🔗 Blog: allenai.org/blog/olmoearth…
English

@GoogleResearch @nvidia We’re releasing the first-phase AIMIP dataset + our analysis of it. We hope to continue AIMIP with future phases that expand its scope & scale.
📘 Learn more in our blog: allenai.org/blog/AIMIP
📊 Paper: arxiv.org/abs/2605.06944
🗂️ Dataset: #data" target="_blank" rel="nofollow noopener">github.com/ai2cm/AIMIP/tr…
English

@GoogleResearch @nvidia We also tested the models on harder scenarios, such as a rapidly warming ocean that was unfamiliar from training
In those tests, the models diverged much more—showing that generalization remains a major challenge.
English






