Do Xuan Long

101 posts

Do Xuan Long

Do Xuan Long

@dxlong2000

Student Researcher @Google & CS PhD @NUSingapore | Prev. @amazon, @NTUsg

Katılım Aralık 2021
464 Takip Edilen232 Takipçiler
Do Xuan Long retweetledi
𝐊𝐚𝐦𝐢𝐥 𝐏𝐚𝐰𝐥𝐢𝐤
Badacze z Google Cloud AI Research i National University of Singapore niedawno opublikowali papier o A²RD – systemie do generowania względnie długich materiałów wideo za pomocą AI... i trzeba przyznać, że wygląda to naprawdę przyzwoicie. Demka pokazują pojedyncze sceny trwające nawet około minuty bez typowego dla AI rozpadu wizualnego, a ich jakość jest zaskakująco dobra; w pewnym sensie bardziej przypomina mi to rendery z programów graficznych niż typowy AI slop. Polecam sprawdzić samemu.
Polski
2
2
70
2.3K
Do Xuan Long retweetledi
Tomas Pfister
Tomas Pfister@tomaspfister·
If you’ve tried making AI video >30s, you know the nightmare. Bouncing between tools, manual stitching, and fighting "identity drift" where faces morph every frame. We decided to automate the entire crew. Meet Co-Director. 🧵👇 (1/9)
English
1
1
7
698
Tengxiao Liu
Tengxiao Liu@TengxiaoLiu·
Auto research is on 🔥 We give algorithmic problems (like circle packing) to general coding agents, let it run overnight. 🌙 Agents reach SoTA. But more importantly: we analyze 100+ hours of trajectories to understand how it gets there 🧵
Tengxiao Liu tweet media
English
7
18
63
32.1K
Do Xuan Long retweetledi
Jiefeng Chen
Jiefeng Chen@jiefengchen1·
My team at Google Cloud AI Research is looking for a Student Researcher Intern to dive deep into coding agents. We’re looking for someone who doesn’t just read about agentic workflows but builds them. What we’re looking for: Academic Rigor: Currently pursuing a Ph.D. with a strong publication record. Technical Chops: Excellent coding skills are a must. Agent Experience: If you’ve built or experimented with coding agents (like Claude Code, GEMINI CLI or similar frameworks), we want to talk to you. Come help us push the boundaries of LLM-based software engineering. 🚀 If this sounds like a fit, feel free to DM me or send your CV directly to jiefengc@google.com #Google #AIResearch #CodingAgents #LLMs #MachineLearning
English
10
12
160
14.9K
Jinjie Ni
Jinjie Ni@NiJinjie·
Life update: I’ve joined @GoogleDeepMind as a research scientist to work on ✨gemini scaling and RL, under the leadership of Yi Tay (@YiTayML) and Quoc Le (@quocleix). I feel extremely fortunate to be on the critical path towards AGI and can't wait to help push the frontier of gemini capabilities! 🚀
Jinjie Ni tweet media
English
66
26
1.2K
91.7K
Do Xuan Long retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
New Google paper builds a video generator that improves itself at test time by rewriting the prompt while it runs. It first turns the user prompt into a simple timeline of scenes with duration, characters, actions, environment, camera, sounds, and mood. It then makes several videos and picks the best using head to head comparisons that swap the order to avoid bias. The picker also applies hard penalties for broken physics, random text on screen, extra scene cuts, or voice and music that were not requested. After that, 3 separate judges score the winner on visuals, audio, and context, and a meta judge merges the notes into clear issues. A reasoning agent converts those issues into short prompt edits that keep the user’s intent and target the exact failures. The system repeats this loop, generates new candidates, and keeps the best until further edits stop helping. On single scene and multi scene tests with Veo 3 and Veo 2, it raises visual quality, motion realism, prompt match, and audio quality. Across stronger baselines, it reaches up to 60% pairwise wins, and humans choose its results in 66.4% of trials. ---- Paper – arxiv. org/abs/2510.15831v1 Paper Title: "VISTA: A Test-Time Self-Improving Video Generation Agent"
Rohan Paul tweet media
English
5
13
82
6.5K
Do Xuan Long retweetledi
wing.nus
wing.nus@wing_nus·
🤔 Why do Transformers and Mamba (SSMs) fail differently on long context? 🔎 How do they mix and reshape context across depth? 🚀 No one had a unified, token + layer-level view — until now! 🔗 Paper: arxiv.org/pdf/2510.06640 🧵 👇 More in thread #Transformers #Mamba #NLP
wing.nus tweet media
English
1
3
6
342
Do Xuan Long retweetledi
wing.nus
wing.nus@wing_nus·
Ever wondered *how* language models understand discourse relations 🧠⚡️🔍? We address this long-standing question in our #EMNLP2025 paper: “Discursive Circuits: How Do Language Models Understand Discourse Relations?” By @YisongMiao and @knmnyn #NLProc #Discourse 🧵1/n
English
1
3
6
1K
Do Xuan Long retweetledi
God of Prompt
God of Prompt@godofprompt·
Holy shit...Google just dropped a self-improving video generation agent 🤯 It’s called VISTA, and it literally rewrites its own prompts to make videos better every single generation. No retraining. No fine-tuning. Just pure test-time self-reflection. Here’s how it works: → Breaks your idea into a full scene-by-scene plan → Generates multiple videos → Judges them in a tournament → Then critiques itself visually, audibly, and contextually before trying again Each loop = smarter, sharper, more aligned video. The results? A 60% win rate against SOTA models like Veo 3 and 66.4% human preference. This isn’t just text-to-video. This is video that learns from itself.
God of Prompt tweet media
English
15
17
103
27.8K
yi
yi@agihippo·
What's with the young undergraduates in Singapore these days fomo farming internships? I had zero internships and I still turned out pretty fine.
English
8
1
31
7.2K
yi
yi@agihippo·
@dxlong2000 Google has a nice gym
English
1
0
3
764
yi
yi@agihippo·
a while back NUS offered me a fancy professor title (honorary) but i rejected it because there was no point at all with such a title. but now i realised i could have just taken it so i could book the badminton courts there. damnit.
English
2
0
81
6.9K