Delegate Chao Wu

2.7K posts

Delegate Chao Wu banner
Delegate Chao Wu

Delegate Chao Wu

@Wu4Delegate

Data Scientist, Maryland State Delegate District 9A (Howard and Montgomery Counties),By Authority of Friends to Elect Chao Wu, Treasurer: Xia Chen.

Maryland, USA Katılım Haziran 2015
865 Takip Edilen1.3K Takipçiler
Delegate Chao Wu retweetledi
Barack Obama
Barack Obama@BarackObama·
Make a plan to vote today: abigailspanberger.com/vote Then get your friends, family members, neighbors and coworkers to make a plan to vote, too. Because if we do, we will elect @SpanbergerForVA as your next governor and put Virginia on the path to a brighter future.
English
828
346
1.6K
592.6K
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
We need be really careful of a few tech companies to use the government power to be the monopoly, then use the monopoly to grab more money and power. It is destroying the country and the people.
English
0
1
3
109
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
Adding a physical layer boundary(framework), and a Kalman filter to describe the state space transition will decrease the search space by several magnitudes and increase the robustness of VLM system. Sharing thoughts based on my control theory background.
Martin Ziqiao Ma@ziqiao_ma

So the key concern is: Using large language models to initialize vision-language(-action) models is a tempting trap — it lets us appear to make progress without truly achieving it. Most benchmarks have overwhelmingly focused on reasoning and digital domains, without fundamentally addressing perception, especially mid- and low-level vision. (Credit: Partly inspired by separate conversations with @xiangyue96 and @YutongBAI1002) As humans, we clearly exhibit pre-linguistic roots in our intuitive physical and psychological understanding, e.g., basic principles like solidity, continuity, and gravity. After we built GroundHog (arxiv.org/abs/2402.16846) in 2024, I took a moment to reflect on the core issues with VLMs. I can no longer convince myself that simply stacking CLIP and DINO with a few projection layers is the ultimate solution to "tokenize" vision. Vision–language models need a much stronger vision foundation, perhaps a fundamental restart from a vision-centric perspective. That’s why I stepped away from VLM development for a year to explore alternatives. A paper @TairanHe99 shared in this thread (led by the brilliant @TongPetersb) was especially thought-provoking. But to truly start over, I began looking into 3D foundation models and video diffusion models, setting aside, for now, the possibility of joint vision–language diffusion models. This led me to take the risk of developing 4D-LRM (arxiv.org/abs/2506.18890), aiming to learn 4D priors at scale with absolutely no language prior. This is only a first step. At some point, I plan to return to VLM engineering. But next time, I hope I have resources to start with a world model first and then unlock the language component on top of it.

English
0
0
2
191
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
In our doctor’s office. Beauty is everywhere.
Delegate Chao Wu tweet media
English
0
0
2
93
Delegate Chao Wu retweetledi
Python Programming
Python Programming@PythonPr·
The AI Agent Staircase
Python Programming tweet media
English
5
148
712
54.7K
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
The AI world is bifurcating and converging . Now Gemini will not generate videos containing “Donald Trump” and MiniMax will not generate videos containing “Xi Jinping”.
English
0
0
1
185
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
This is crazy. BALTIMORE (WBFF) — Speed cameras on the I-83 Jones Falls Expressway have issued more than $18.5 million in fines in the past three years, but about 80% of the revenue has gone to the camera vendor, Verra Mobility — not the city, according to the Baltimore City.
FOX Baltimore@FOXBaltimore

Speed cameras on the I-83 Jones Falls Expressway have issued more than $18.5 million in fines in the past three years, but about 80% of the revenue has gone to the camera vendor, Verra Mobility — not the city, according to the Baltimore City Department of Finance. bit.ly/3Tu4jVu

English
0
1
4
296
Delegate Chao Wu retweetledi
PyQuant News 🐍
PyQuant News 🐍@pyquantnews·
High-value skills in 2025 (and beyond).
PyQuant News 🐍 tweet media
English
12
261
2.2K
190.4K
Delegate Chao Wu retweetledi
Always Keep Learning
Always Keep Learning@AlwaysKeepL·
Public Speaking Secrets
Always Keep Learning tweet media
English
4
314
1.8K
201.5K
Delegate Chao Wu
Delegate Chao Wu@Wu4Delegate·
Enjoy the flowers and the spring.
Delegate Chao Wu tweet mediaDelegate Chao Wu tweet mediaDelegate Chao Wu tweet mediaDelegate Chao Wu tweet media
English
0
0
1
77