Jane Zhang

243 posts

Jane Zhang

@JaneZ901

HBS Researcher/MIT Knight Science Journalism Fellow/Ex-AI reporter for @Technology/Views are my own.

Cambridge, MA Katılım Ekim 2015

209 Takip Edilen691 Takipçiler

Jane Zhang retweetledi

MBZUAI@mbzuai·9 Ara

Arabic AI just took a major step forward. Inception, Cerebras, and MBZUAI have released Jais 2, a next-generation Arabic open-weight LLM trained on the richest Arabic-first dataset to date. Built from the ground up with 70 billion parameters, Jais 2 understands Arabic the way it’s truly spoken across dialects, culture, and modern expression. 🔗Discover more - apps coming soon! jaischat.ai/jaischat 🔗 The Model: huggingface.co/collections/in… @Inception_AI @cerebras

English

1.9K

Jane Zhang retweetledi

Eric Xing@ericxing·5 Ara

Now you have an alternative to the super popular but unfortunately not so transparent (you have no idea how it was trained, what data was used, is it safe …) base LLMs such as Qwen 2.5 or 3, to build your own reasoning or general purpose LLMs through post-train, SFT, RL, etc. It is 360-open and reproducible.

MBZUAI@mbzuai

Today, we are releasing a new version of K2 (K2-V2), a 360-open LLM built from scratch as a superior base for reasoning adaptation, while still excelling at core LLM capabilities like conversation, knowledge retrieval, and long-context understanding. K2 fills a major gap: highly capable models with no transparency. Instead of releasing only weights, we’re sharing the full training story — dataset recipes, mid-training checkpoints, logs, code, and evaluation tools. That’s 360-open. What’s inside: • 70B dense transformer engineered as a reasoning-enhanced base model • Native 512K context (extendable via RoPE scaling) • Mid-training reasoning phase • Strong tool-use scaffolding What we’re open-sourcing: • 250M+ reasoning traces (math, planning, multi-step logic) • Full pre- & mid-training data compositions • All mid-training checkpoints • Training logs, code, Eval360 Performance: • GPQA-Diamond: 55.1% mid-training → 69.3% after SFT (strongest fully open 70B model) • KK-8 Logic Puzzles: 83% — competitive with DeepSeek-R1 & OpenAI o3-mini-high • ArenaHard V2: 62.1% — close to Qwen3 235B • Outperforms Qwen2.5-72B and approaches Qwen3-235B despite being smaller and fully transparent. 🔗 The Model: bit.ly/3KIYwuo 🔗Technical Report: bit.ly/49V8h2U 🔗Blog: bit.ly/49V7gb6

English

9.3K

Jane Zhang retweetledi

LLM360@llm360·5 Ara

To mark the 2nd anniversary of LLM360, we are proud to release K2-V2: a 70B reasoning-centric foundation model that delivers frontier capabilities. As a push for "360-open" transparency, we are releasing not only weights, but the full recipe: data composition, training code, logs, and intermediate checkpoints. About K2-V2: 🧠 70B params, reasoning-optimized 🧊 512K context window 🔓 "360-Open" (Data, Logs, Checkpoints) 📈 SOTA on olympiad math and complex logic puzzles

English

21.3K

Jane Zhang@JaneZ901·2 Ara

📢 Hey #NeurIPS2025 attendees + #AI journalists, I’m the US Comms Lead with IFM @mbzuai — the foundation-model engine for the “Stanford of the Middle East”. We’re hosting an invitation-only mixer on Dec 3 in San Diego. If you want to join or grab a 1:1 coffee, DM me!

English

132

Jane Zhang retweetledi

MBZUAI@mbzuai·9 Eyl

Introducing K2 Think - a breakthrough in advanced AI reasoning. Developed by MBZUAI’s Institute of Foundation Models and @G42ai, K2 Think delivers frontier reasoning performance at a fraction of the size of today’s largest systems. Smaller. Smarter. Open to the world. Available now: K2Think.Ai/K2Think #K2Think #AI #OpenSource #MBZUAI #G42 #Innovation

English

318

76.8K

Jane Zhang@JaneZ901·6 Nis

“Ask questions” vs “assign tasks”—that’s where Manus differs from OpenAI, Cheung said. Many users only know how to ask questions but struggle to define tasks, says Cheung: “Everyone should learn how to be a boss.” Educating users to delegate is key to unlocking LLM potential.

English

Jane Zhang@JaneZ901·6 Nis

Despite divides on the products front, both value user experience. After studying Cursor AI last July, Manus saw non-coders struggle—so their product focused on what's in the right panel of Cursor and hid the code-heavy left, aiming for a simpler interface for everyday users.

English

Jane Zhang@JaneZ901·6 Nis

Interesting contrast to Perplexity’s upcoming “Comet” launch: Chinese startup Manus AI started working on an AI browser last March but sunset it after 6 months, per co-founder Tao Cheung.

English

185

Jane Zhang@JaneZ901·1 Nis

2. Srinivas: The future is gonna be more on agents that have web and search and browsing as a foundational element to them, but build on top of that to actually accomplish tasks, not just give you answers.

English

Jane Zhang@JaneZ901·1 Nis

1. Srinivas on Perplexity’s competititive advantage: “In AI, I think nobody really has a moat right now. all the models are catching up doing quickly…the moat comes from really, really good product experience and like really fast iteration and customer obsession.”

English

Jane Zhang@JaneZ901·1 Nis

Perplexity is going to launch its browser Comet later this month, becoming the latest to join the LLM-powered new generation of search engine battle. Aravind Srinivas, Co-Founder & CEO of Perplexity shared the news during an MIT event on Tuesday. A few other key takeaways:

English

107

Jane Zhang retweetledi

Sarah Zheng@_szheng·13 Nis

Didi unveiled a new autonomous concept car, complete with a robotic arm in its trunk, in its first big event since it was ordered to delist in the US back in 2021 bloomberg.com/news/articles/… with @JaneZ901

English

1.8K

Jane Zhang@JaneZ901·7 Mar

The Communist Party's two-year crackdown on the private sector has rattled China's entrepreneurs and venture capitalists. "Beijing can always come after you." Read The Big Take. bloomberg.com/news/articles/… via @business

English

549

Jane Zhang@JaneZ901·14 Ara

A Chinese startup seeking to be the country’s answer to SpaceX is preparing a satellite launch that could beat Elon Musk’s company by relying on a new generation of rocket fuel. bloomberg.com/news/articles/… via @technology

English

Jane Zhang@JaneZ901·9 Ara

#NewProfilePic

QME

Jane Zhang retweetledi

Bloomberg Technology@technology·11 Kas

Alibaba decided against disclosing sales results for its Singles’ Day event for the first time, after forecasts that the figure may reveal an unprecedented decline trib.al/iNyMKoF

English

Jane Zhang retweetledi

Bloomberg Originals@bbgoriginals·21 Eki

Google is getting into chips. The crunchy kind that you can eat. They’re part of the marketing push for the latest Pixel 7 phones from the Internet giant. @rumireports explains trib.al/J7DeTHK

English

Jane Zhang@JaneZ901·25 Tem

@RChoongWilkins Thanks for sharing!😆

English

Rebecca Choong Wilkins 钟碧琪@RChoongWilkins·25 Tem

China-born and Harvard-trained, Yao teaches a prestigious university class that's shaped some of the country’s biggest AI startups, informed government policy and molded a generation of academics. --> Excellent profile of one of China's most influential figures in AI by @JaneZ901

Jane Zhang@JaneZ901

A 75-year-old Harvard graduate is one of the driving forces behind China's AI ambitions, helping to shape some of the country's biggest startups bloomberg.com/news/articles/… via @technology

English

Jane Zhang@JaneZ901·25 Tem

A 75-year-old Harvard graduate is one of the driving forces behind China's AI ambitions, helping to shape some of the country's biggest startups bloomberg.com/news/articles/… via @technology

English

Jane Zhang retweetledi

Zheping Huang@pingroma·25 Mar

Come join us🔥🔥

Vlad Savov@vladsavov

Twitter Spaces alert: I'll be hosting a chat with @luluyilun, @pingroma and @BradStone on Tencent and the changing face of China's internet economy at 1pm Tokyo/ noon HK / 10pm PT. You're allowed to stay up late / get up early to tune in.

English

Keşfet

@Inception_AI @cerebras @mbzuai @G42ai @business @technology @rumireports @RChoongWilkins