Jane Zhang

243 posts

Jane Zhang banner
Jane Zhang

Jane Zhang

@JaneZ901

HBS Researcher/MIT Knight Science Journalism Fellow/Ex-AI reporter for @Technology/Views are my own.

Cambridge, MA Katılım Ekim 2015
209 Takip Edilen691 Takipçiler
Jane Zhang retweetledi
MBZUAI
MBZUAI@mbzuai·
Arabic AI just took a major step forward. Inception, Cerebras, and MBZUAI have released Jais 2, a next-generation Arabic open-weight LLM trained on the richest Arabic-first dataset to date. Built from the ground up with 70 billion parameters, Jais 2 understands Arabic the way it’s truly spoken across dialects, culture, and modern expression. 🔗Discover more - apps coming soon! jaischat.ai/jaischat 🔗 The Model: huggingface.co/collections/in… @Inception_AI @cerebras
English
3
7
15
1.9K
Jane Zhang retweetledi
Eric Xing
Eric Xing@ericxing·
Now you have an alternative to the super popular but unfortunately not so transparent (you have no idea how it was trained, what data was used, is it safe …) base LLMs such as Qwen 2.5 or 3, to build your own reasoning or general purpose LLMs through post-train, SFT, RL, etc. It is 360-open and reproducible.
MBZUAI@mbzuai

Today, we are releasing a new version of K2 (K2-V2), a 360-open LLM built from scratch as a superior base for reasoning adaptation, while still excelling at core LLM capabilities like conversation, knowledge retrieval, and long-context understanding. K2 fills a major gap: highly capable models with no transparency. Instead of releasing only weights, we’re sharing the full training story — dataset recipes, mid-training checkpoints, logs, code, and evaluation tools. That’s 360-open. What’s inside: • 70B dense transformer engineered as a reasoning-enhanced base model • Native 512K context (extendable via RoPE scaling) • Mid-training reasoning phase • Strong tool-use scaffolding What we’re open-sourcing: • 250M+ reasoning traces (math, planning, multi-step logic) • Full pre- & mid-training data compositions • All mid-training checkpoints • Training logs, code, Eval360 Performance: • GPQA-Diamond: 55.1% mid-training → 69.3% after SFT (strongest fully open 70B model) • KK-8 Logic Puzzles: 83% — competitive with DeepSeek-R1 & OpenAI o3-mini-high • ArenaHard V2: 62.1% — close to Qwen3 235B • Outperforms Qwen2.5-72B and approaches Qwen3-235B despite being smaller and fully transparent. 🔗 The Model: bit.ly/3KIYwuo 🔗Technical Report: bit.ly/49V8h2U 🔗Blog: bit.ly/49V7gb6

English
1
12
43
9.3K
Jane Zhang retweetledi
LLM360
LLM360@llm360·
To mark the 2nd anniversary of LLM360, we are proud to release K2-V2: a 70B reasoning-centric foundation model that delivers frontier capabilities. As a push for "360-open" transparency, we are releasing not only weights, but the full recipe: data composition, training code, logs, and intermediate checkpoints. About K2-V2: 🧠 70B params, reasoning-optimized 🧊 512K context window 🔓 "360-Open" (Data, Logs, Checkpoints) 📈 SOTA on olympiad math and complex logic puzzles
LLM360 tweet media
English
2
25
56
21.3K
Jane Zhang
Jane Zhang@JaneZ901·
📢 Hey #NeurIPS2025 attendees + #AI journalists, I’m the US Comms Lead with IFM @mbzuai — the foundation-model engine for the “Stanford of the Middle East”. We’re hosting an invitation-only mixer on Dec 3 in San Diego. If you want to join or grab a 1:1 coffee, DM me!
English
0
0
3
132
Jane Zhang retweetledi
MBZUAI
MBZUAI@mbzuai·
Introducing K2 Think - a breakthrough in advanced AI reasoning. Developed by MBZUAI’s Institute of Foundation Models and @G42ai, K2 Think delivers frontier reasoning performance at a fraction of the size of today’s largest systems. Smaller. Smarter. Open to the world. Available now: K2Think.Ai/K2Think #K2Think #AI #OpenSource #MBZUAI #G42 #Innovation
English
19
83
318
76.8K
Jane Zhang
Jane Zhang@JaneZ901·
“Ask questions” vs “assign tasks”—that’s where Manus differs from OpenAI, Cheung said. Many users only know how to ask questions but struggle to define tasks, says Cheung: “Everyone should learn how to be a boss.” Educating users to delegate is key to unlocking LLM potential.
English
0
0
0
57
Jane Zhang
Jane Zhang@JaneZ901·
Despite divides on the products front, both value user experience. After studying Cursor AI last July, Manus saw non-coders struggle—so their product focused on what's in the right panel of Cursor and hid the code-heavy left, aiming for a simpler interface for everyday users.
English
1
0
0
96
Jane Zhang
Jane Zhang@JaneZ901·
Interesting contrast to Perplexity’s upcoming “Comet” launch: Chinese startup Manus AI started working on an AI browser last March but sunset it after 6 months, per co-founder Tao Cheung.
Jane Zhang tweet media
English
2
0
1
185
Jane Zhang
Jane Zhang@JaneZ901·
2. Srinivas: The future is gonna be more on agents that have web and search and browsing as a foundational element to them, but build on top of that to actually accomplish tasks, not just give you answers.
English
0
0
0
53
Jane Zhang
Jane Zhang@JaneZ901·
1. Srinivas on Perplexity’s competititive advantage: “In AI, I think nobody really has a moat right now. all the models are catching up doing quickly…the moat comes from really, really good product experience and like really fast iteration and customer obsession.”
English
1
0
0
66
Jane Zhang
Jane Zhang@JaneZ901·
Perplexity is going to launch its browser Comet later this month, becoming the latest to join the LLM-powered new generation of search engine battle. Aravind Srinivas, Co-Founder & CEO of Perplexity shared the news during an MIT event on Tuesday. A few other key takeaways:
Jane Zhang tweet media
English
1
0
1
107
Jane Zhang retweetledi
Sarah Zheng
Sarah Zheng@_szheng·
Didi unveiled a new autonomous concept car, complete with a robotic arm in its trunk, in its first big event since it was ordered to delist in the US back in 2021 bloomberg.com/news/articles/… with @JaneZ901
Sarah Zheng tweet media
English
0
1
5
1.8K
Jane Zhang
Jane Zhang@JaneZ901·
The Communist Party's two-year crackdown on the private sector has rattled China's entrepreneurs and venture capitalists. "Beijing can always come after you." Read The Big Take. bloomberg.com/news/articles/… via @business
English
0
0
1
549
Jane Zhang
Jane Zhang@JaneZ901·
A Chinese startup seeking to be the country’s answer to SpaceX is preparing a satellite launch that could beat Elon Musk’s company by relying on a new generation of rocket fuel. bloomberg.com/news/articles/… via @technology
English
0
3
5
0
Jane Zhang retweetledi
Bloomberg Technology
Bloomberg Technology@technology·
Alibaba decided against disclosing sales results for its Singles’ Day event for the first time, after forecasts that the figure may reveal an unprecedented decline trib.al/iNyMKoF
English
0
3
4
0
Jane Zhang retweetledi
Bloomberg Originals
Bloomberg Originals@bbgoriginals·
Google is getting into chips. The crunchy kind that you can eat. They’re part of the marketing push for the latest Pixel 7 phones from the Internet giant. @rumireports explains trib.al/J7DeTHK
English
0
13
35
0
Rebecca Choong Wilkins 钟碧琪
Rebecca Choong Wilkins 钟碧琪@RChoongWilkins·
China-born and Harvard-trained, Yao teaches a prestigious university class that's shaped some of the country’s biggest AI startups, informed government policy and molded a generation of academics. --> Excellent profile of one of China's most influential figures in AI by @JaneZ901
Jane Zhang@JaneZ901

A 75-year-old Harvard graduate is one of the driving forces behind China's AI ambitions, helping to shape some of the country's biggest startups bloomberg.com/news/articles/… via @technology

English
2
9
8
0