Yuning Mao

36 posts

Yuning Mao

Yuning Mao

@yuning_pro

TBD @AIatMeta. Agents, UIGen, Code Generation. Post-training since Llama 2 https://t.co/OUjOWWA8kL

Moon Katılım Haziran 2021
203 Takip Edilen337 Takipçiler
Yuning Mao retweetledi
Arena.ai
Arena.ai@arena·
Muse Spark debuts at #7 in the Code Arena - making @AIatMeta the #3 lab right behind @AnthropicAI’s Claude Sonnet 4.6 and @Zai_org’s GLM-5.1, surpassing Gemini-3.1-Pro and GPT-5.4. Code Arena evaluates agentic coding on real-world tasks - building live websites and apps, ranked by users on real workflows. Huge congrats to @AIatMeta on this impressive milestone!
Arena.ai tweet media
AI at Meta@AIatMeta

Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs. Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. Muse Spark is available today at meta.ai and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model. Learn more: go.meta.me/43ea00

English
19
33
346
70.7K
Yuning Mao retweetledi
Design Arena
Design Arena@Designarena·
BREAKING: Muse Spark by @Meta is #7 on SVG Arena with an Elo of 1301! This is in the same performance band as Claude Opus 4.6 by @AnthropicAI and GLM 5 Turbo by @Zai_org Congrats to the @AIatMeta team on the launch!
Design Arena tweet media
English
3
5
156
13.6K
Yuning Mao retweetledi
Design Arena
Design Arena@Designarena·
BREAKING: Muse Spark by @Meta is #6 overall on Design Arena with an Elo of 1324! This is the single biggest improvement we've seen on Design Arena to date, with a jump of 103 positions and 374 Elo points Huge congrats to the @Meta team on the launch!
Design Arena tweet media
English
7
16
262
27.3K
Yuning Mao retweetledi
Leon Lin
Leon Lin@LexnLin·
I tested Muse Spark with my nature-themed website oneshot test. @alexandr_wang and team cooked Prompt is below :)
English
8
9
145
18.1K
Yuning Mao retweetledi
Flavio Adamo
Flavio Adamo@flavioAd·
After MORE THAN A YEAR, Meta finally released a model that passes the Hexagon Test and I’m not gonna lie this is weirdly emotional 🥹 sorry guys but history had to be documented!!
AI at Meta@AIatMeta

Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs. Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. Muse Spark is available today at meta.ai and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model. Learn more: go.meta.me/43ea00

English
23
21
732
169.8K
Yuning Mao retweetledi
Chris
Chris@Chrisgpt·
Meta Muse: “Make a flappy bird clone” One shot in only a couple minutes in canvas not bad at all!
English
9
12
122
39.9K
Yuning Mao retweetledi
Yuning Mao retweetledi
Pietro Schirano
Pietro Schirano@skirano·
The new model from Meta, Muse Spark, is pretty good at converting images to code!
English
28
53
1.2K
125.2K
Yuning Mao retweetledi
Alexandr Wang
Alexandr Wang@alexandr_wang·
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵
Alexandr Wang tweet media
English
729
1.2K
10.4K
4.5M
Yuning Mao retweetledi
Yiqing Xie
Yiqing Xie@YiqingXieNLP·
Training on issue-solving only does NOT guarantee transfer to other tasks. 🎨Introducing Hybrid-Gym - synthetic training tasks for generalization (hybrid-gym.github.io) +25.4% on SWE-Bench / +7.9% on SWT-Bench / +5.1% on Commit-0 with NO issue-solving / test-gen/... training
Yiqing Xie tweet media
English
1
23
104
17.3K
Yuning Mao retweetledi
Xianjun Yang
Xianjun Yang@xianjun_agi·
I was laid off by Meta today. As a Research Scientist, my work was just cited by the legendary @johnschulman2 and Nicholas Carlini yesterday. I’m actively looking for new opportunities — please reach out if you have any openings!
Xianjun Yang tweet media
Susan Zhang@suchenzang

👀

English
263
343
4.5K
1.8M
Yuning Mao retweetledi
Tim Franzmeyer
Tim Franzmeyer@frtimlive·
What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with @AIatMeta 🧵
English
1
12
65
9.6K
Yuning Mao retweetledi
Yiqing Xie
Yiqing Xie@YiqingXieNLP·
How to construct repo-level coding environments in a scalable way? Checkout RepoST: an automated framework to construct repo-level environments using Sandbox Testing (repost-code-gen.github.io) Models trained with RepoST data can generalize well to other datasets (e.g., RepoEval)
Yiqing Xie tweet media
English
3
19
87
10.7K
Yuning Mao retweetledi
Xianjun Yang
Xianjun Yang@xianjun_agi·
📢My New Paper: Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder TLDR: We proposed to use features from SAEs as a measure for data diversity&complexity and proved it's effectiveness on data selection for LLM tuning. arxiv.org/pdf/2502.14050
Xianjun Yang tweet media
English
7
37
179
18.6K
Yuning Mao retweetledi
Thomas Wolf
Thomas Wolf@Thom_Wolf·
Among the most impressive aspect of the Llama 3.1 release is the accompanying research paper! Close to 100 pages of deep knowledge-sharing on LLMs like we havn't seen very often recently What a treat! It covers everything, pretrainining data, filtering, annealing, synthetic data, scaling laws, infrastructures, parallelism, training recipees, post-training adaptation, tool-use, benchmarking, inference strategies, quantization, vision, speech, videos... Mind-blown! Maybe the single paper you can read today to join the field of LLM from zero right to the frontier Read it here and feel the open-science ai.meta.com/research/publi…
Thomas Wolf tweet media
English
15
249
1.1K
76.1K
Yuning Mao retweetledi
AI at Meta
AI at Meta@AIatMeta·
Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3 models — in the coming months we expect to introduce new capabilities, longer context windows, additional model sizes and enhanced performance + the Llama 3 research paper for the community to learn from our work. More details ➡️ go.fb.me/i2y41n Download Llama 3 ➡️ go.fb.me/ct2xko
English
334
1.4K
5.6K
1.1M
Yuning Mao retweetledi
Mikayel Samvelyan
Mikayel Samvelyan@_samvelyan·
Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile tool 🛠️ for diagnosing model vulnerabilities across domains and creating data to enhance robustness & safety 🦺 Co-lead w/ @sharathraparthy & @_andreilupu
English
5
44
178
56.4K