Jiang Bian

38 posts

Jiang Bian banner
Jiang Bian

Jiang Bian

@jbian22

Partner Research Manager at Microsoft Research

Beijing Katılım Ekim 2015
331 Takip Edilen39 Takipçiler
Jiang Bian
Jiang Bian@jbian22·
See Right (Robustness):BiPS enforces perceptual consistency via a bi-directional KL divergence constraint. This aligns predictive distributions between noisy and focused views, effectively mitigating hallucinations caused by visual noise. 🧠
English
1
0
0
61
Jiang Bian
Jiang Bian@jbian22·
🚀 New Research: Efficient & Robust MLLMs via Bi-directional Perceptual Shaping (BiPS) High-res inputs in Multimodal LLMs cause compute bottlenecks & noise sensitivity. Our new framework solves the "redundancy vs. utility" trade-off. See Less, See Right. 🧵👇
English
1
0
0
80
Jiang Bian
Jiang Bian@jbian22·
🏗️ Generative Design Bridging the gap between parametric history and 3D geometry: 🔹 CADMorph: Geometry-Driven Parametric CAD Editing via a Plan-Generate-Verify Loop #GenerativeAI #CAD
English
0
0
0
31
Jiang Bian
Jiang Bian@jbian22·
🧬 AI for Science & Healthcare 🔹 MIRA: Medical Time Series Foundation Model for Real-World Health Data 🔹 Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations 🔹 Functional Complexity-adaptive Temporal Tensor Decomposition
English
1
0
1
55
Jiang Bian
Jiang Bian@jbian22·
🚀 We are heading to #NeurIPS2025 in San Diego! Excited to announce my group has 7 accepted papers this year, tackling the frontiers of Agentic AI, AI for Health & Science, and Generative Design. A breakdown of our work 🧵👇 #AI #MachineLearning #MicrosoftResearch
English
1
0
1
78
Jiang Bian
Jiang Bian@jbian22·
Second fix: Flexible, Non-Linear Reasoning. No more rigid, one-way chains! PixelCraft has a "Planner" and an "Image Memory" (a "cognitive whiteboard"). This lets the system adaptively revisit any prior visual step, backtrack from errors, and explore different reasoning branches
English
1
0
0
14
Jiang Bian
Jiang Bian@jbian22·
MLLMs are great, but they're surprisingly bad at reading charts and geometry. A tiny "perceptual slip" can wreck the whole reasoning process. We're thrilled to introduce PixelCraft 👾, a new multi-agent system to solve this.
Jiang Bian tweet media
English
1
0
0
37
Jiang Bian
Jiang Bian@jbian22·
Key Insight: Perfect self-verification isn't required. We frame reasoning as a probabilistic process. As long as the chance of improvement is > chance of degradation, the model can converge to the correct answer. Result: An 8B model beat its 600B teacher on AIME.
English
0
0
0
18
Jiang Bian retweetledi
Hanze Dong
Hanze Dong@hendrydong·
💥Thrilled to share our new work Reinforce-Ada, which fixes signal collapse in GRPO 🥳No more blind oversampling or dead updates. Just sharper gradients, faster convergence, and stronger models. ⚙️ One-line drop-in. Real gains. arxiv.org/html/2510.0499… github.com/RLHFlow/Reinfo…
Hanze Dong tweet media
English
7
24
181
18.9K
Jiang Bian
Jiang Bian@jbian22·
🔑 Under the hood • Grounded latent via a proprio Forward-Dynamics Model → deeper motion understanding • Joint diffusion policy where latent & low-level actions co-evolve → long-horizon reasoning • Superior performance on SIMPLER, LIBERO and gripper & dexterous-hand 🏆
English
0
0
0
27
Jiang Bian
Jiang Bian@jbian22·
🌉 Why it matters 1⃣ Latent actions = mid-level bridge 📷🗣️ ➡️ 🤖, enabling structured planning. 2⃣ Embodiment-agnostic latents unlock cross-robot transfer & ultra-fast adaptation to new hardware. 🔄⚡️
English
1
0
0
17
Jiang Bian
Jiang Bian@jbian22·
The impact is clear: Geometry Forcing substantially improves visual quality and 3D consistency over baseline methods, slashing the FVD score from 364 to 243 on a long-term video generation task. Read the full paper here: arxiv.org/pdf/2507.07982
English
0
0
1
18
Jiang Bian
Jiang Bian@jbian22·
Our solution, Geometry Forcing, aligns the video model’s internal representations with features from a pretrained geometric foundation model (VGGT). We introduce two new objectives, Angular Alignment and Scale Alignment, to enforce geometric consistency during training.
English
1
0
0
24