Ryosuke Matsuda

380 posts

Ryosuke Matsuda banner
Ryosuke Matsuda

Ryosuke Matsuda

@VolumeisRyo

NLP, Multimodal @ Tohoku NLP Group(@tohoku_NLP) M2 / JPHACKS2024 Finalist / AtCode緑🍵

Beigetreten Eylül 2022
469 Folgt484 Follower
Angehefteter Tweet
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
🎉 Excited to share that our paper has been accepted to CVPR 2026 and is now available on arXiv! SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation 🔗 arxiv.org/abs/2603.29186 #CVPR2026 #arXiv [1/N]
Ryosuke Matsuda tweet media
English
1
6
35
2.3K
Ryosuke Matsuda retweetet
SkalskiP
SkalskiP@skalskip92·
I'm putting together a list of top CVPR 2026 papers collecting must-see papers with links to code, demos, and posters all in one place basically my notes so I don't miss anything important link: github.com/SkalskiP/top-c…
SkalskiP tweet media
English
6
44
413
20.7K
Ryosuke Matsuda retweetet
Manu Gaur
Manu Gaur@gaur_manu·
Pretrained ViTs like DINOv2 or CLIP are great, but they produce fixed, generic representations that encode the most salient visual concepts (e.g., "cat"). In human vision, prior priming with language changes how people parse an image. We believe visual encoders should do the same 🚨 Introducing Steerable Visual Representations, a new family of visual features you can steer with text towards specific visual concepts.
Manu Gaur tweet media
English
13
133
897
143.2K
Ryosuke Matsuda retweetet
Anthropic
Anthropic@AnthropicAI·
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
English
2K
6.7K
43.9K
30.7M
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
✅ Our experiments show that humans can reliably identify the better long video, while existing evaluation systems still fall short on 9 out of 10 aspects. Would be happy if you check it out! 🙌 📄 Paper: arxiv.org/abs/2603.29186 🚀 Project Page : slvmeval.github.io [5/N]
English
0
0
0
83
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
📊 This gives us a testbed for evaluating whether existing systems can correctly rank long-video pairs when the quality difference should be obvious to people. In other words, SLVMEval tests a fundamental requirement for reliable long-video evaluation. [4/N]
English
1
0
0
98
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
🎉 Excited to share that our paper has been accepted to CVPR 2026 and is now available on arXiv! SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation 🔗 arxiv.org/abs/2603.29186 #CVPR2026 #arXiv [1/N]
Ryosuke Matsuda tweet media
English
1
6
35
2.3K
Ryosuke Matsuda retweetet
Peter Holderrieth
Peter Holderrieth@peholderrieth·
🚀MIT Flow Matching and Diffusion Lecture 2026 Released (diffusion.csail.mit.edu)! We just released our new MIT 2026 course on flow matching and diffusion models! We teach the full stack of modern AI image, video, protein generators - theory and practice. We include: 📺 Videos: Step-by-step derivations. 📝 Notes: Mathematically self-contained lecture notes 💻 Coding: Hands-on exercises for every component We fully improved last years’ iteration and added new topics: latent spaces, diffusion transformers, building language models with discrete diffusion models. Everything is available here: diffusion.csail.mit.edu A huge thanks to Tommi Jaakkola for his support in making this class possible and Ashay Athalye (MIT SOUL) for the incredible production! Was fun to do this with @RShprints! #MachineLearning #GenerativeAI #MIT #DiffusionModels #AI
Peter Holderrieth tweet media
English
15
394
2.2K
522.7K
Ryosuke Matsuda retweetet
Chieh-Hsin (Jesse) Lai
Chieh-Hsin (Jesse) Lai@JCJesseLai·
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!
Chieh-Hsin (Jesse) Lai tweet media
English
53
489
2.4K
840.3K
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
#NLP2026 にて主著1件の発表をします! LINEヤフーとの共同研究で,先日CVPR2026 mainに採択された内容を発表します! 長尺動画に関するベンチマークを提案していますので,マルチモーダルに興味ある方はぜひ見に来て下さい!📹️ 日時:3/10 (火) 11:15-12:45 場所:C2-01 (C会場 2F大会議室202)
Ryosuke Matsuda tweet media
日本語
0
4
28
1.9K
Ryosuke Matsuda retweetet
David Fan
David Fan@DavidJFan·
[1/9] What happens when you treat vision as a first-class citizen during multimodal pretraining? To find out, we studied the design space of training Transfusion-style models that input and output all modalities, from scratch. Here is what we learned about visual representations, data, world modeling, architecture, and scaling behavior! Paper: arxiv.org/abs/2603.03276 Website: beyond-llms.github.io @TongPetersb, @DavidJFan, @__JohnNguyen__, @ellisbrown, @GaoyueZhou, @JasonQSY, @boyangzheng, @webalorn, @han_junlin, @rob_fergus, @NailaMurray, @gh_marjan, @ml_perception, Nicolas Ballas, @_amirbar, Michael Rabbat, Jakob Verbeek, @LukeZettlemoyer, @koustuvsinha, @ylecun, @sainingxie
English
12
62
301
49.6K
Ryosuke Matsuda retweetet
Haruto Yoshida
Haruto Yoshida@yoshida_NLP·
大規模視覚言語モデル内部におけるダイアグラムの表現を分析した論文が arXiv で公開されました! #NLP2026 でも発表予定です!
Haruto Yoshida@yoshida_NLP

🚀 New paper on arXiv! arxiv.org/abs/2603.02865… 🤔 How do LVLMs internally form representations of nodes and edges? 💡 Node representations form early, whereas edge representations form late. Feedback is welcome! 1/N

日本語
0
4
33
2.5K
Naka!
Naka!@naka_BB5·
@VolumeisRyo すごすぎ!!おめでとう!!!
日本語
1
0
1
165
Ryosuke Matsuda retweetet
Ryosuke Matsuda
Ryosuke Matsuda@VolumeisRyo·
主著論文が #CVPR2026 にmainでAcceptされました!!🎉 @CVPR M1 のうちに国際会議に採択されて,大変光栄に思います. 沢山の助言や指導をして頂き,共著者の感謝申し上げます.🙏
日本語
1
10
75
5.4K
Ryosuke Matsuda retweetet
Haruto Yoshida
Haruto Yoshida@yoshida_NLP·
共著論文が #CVPR2026 に採択されました🎉 M1 で CVPR はすごい!めでたい!
日本語
0
3
40
2.6K
Ryosuke Matsuda retweetet
LINEヤフー Tech
LINEヤフー Tech@lycorptech_jp·
LINEヤフー Tech Blog 🆕 『高性能な日本語マルチモーダル基盤モデル「clip-japanese-base-v2」の公開』 - 日本語特化CLIPを高性能化し公開 - 大規模データ収集と精密フィルタによる精度の底上げ - 知識蒸留によるさらなる精度改善 techblog.lycorp.co.jp/ja/20251218a
日本語
0
49
148
48.3K