Brian B. Moser

51 posts

Brian B. Moser

@bmoser1995

Ph.D., Senior Researcher at German Research Center for Artificial Intelligence.

Kaiserslautern, Germany Katılım Nisan 2018

53 Takip Edilen35 Takipçiler

Sabitlenmiş Tweet

Brian B. Moser@bmoser1995·9 Eki

🎉 Our paper “Unlocking Dataset Distillation with Diffusion Models” has been accepted at #NeurIPS 25! We show how to unlock end-to-end dataset distillation through diffusion models by tackling the vanishing gradient problem! 📄 : arxiv.org/abs/2403.03881 #DiffusionModels

English

4.9K

Brian B. Moser@bmoser1995·25 Şub

I am also very happy to share that we got accepted for #CVPR 2026 with this amazing paper! #ai #cvf #ieee #ml

Kwang Moo Yi@kwangmoo_yi

Adamkiewicz et al., "When Pretty Isn't Useful: Investigating Why Modern Text-to-Image Models Fail as Reliable Training Data Generators" Interesting that while we have "better" image generators, their usefulness as synthetic data generators is declining. Do we need a pivot?

English

660

Brian B. Moser retweetledi

Rosinality@rosinality·19 Ara

Simple vision pretraining by predicting next step embedding. The embedding itself is trained along with this while stop grad is applied when it is used as a target.

English

400

21.3K

Brian B. Moser@bmoser1995·17 Ara

#Meta just released SAM Audio: Segment Anything, but for sound. It’s actually so cool: isolate a voice/instrument/noise with a prompt. Now imagine Meta Ray-Ban (probably a feature already in the making): choose the person you want to listen to… and hear only them. #AI

AI at Meta@AIatMeta

🔉 Introducing SAM Audio, the first unified model that isolates any sound from complex audio mixtures using text, visual, or span prompts. We’re sharing SAM Audio with the community, along with a perception encoder model, benchmarks and research papers, to empower others to explore new forms of expression and build applications that were previously out of reach. 🔗 Learn more: go.meta.me/568e5d

English

Brian B. Moser retweetledi

Ko Watanabe@ko_watanabe_en·14 Ara

We did some extended research against robot this time!!! Please check it out! SensHRPS: Sensing Comfortable Human-Robot Proxemics and Personal Space With Eye-Tracking arxiv.org/abs/2512.08518

Ko Watanabe@ko_watanabe_en

New publication available in Arxiv!!! SensPS: Sensing Personal Space Comfortable Distance between Human-Human Using Multimodal Sensors arxiv.org/abs/2502.07441

English

203

Brian B. Moser@bmoser1995·8 Ara

What a transformative experience at #NeurIPS. Lessons learned: - History rhymes: RL and Meta-Learning are back on the menu. - Things move fast and going to move faster. As a scientist, plan your next years carefully! I will definitely. - Networking is king.

English

Brian B. Moser retweetledi

Yaroslav Bulatov@yaroslavvb·8 Ara

This paper is being presented at NeurIPS workshop @neur_reps right now, upper ballroom 6a, 3:35-5:00

Micah Goldblum@micahgoldblum

An LLM-generated paper is in the top 17% of ICLR submissions in terms of average reviewer score, having received two 8's. The paper has tons of BS jargon and hallucinated references. Fortunately, one reviewer actually looked at the paper and gave it a zero. 1/3

English

27.8K

Brian B. Moser@bmoser1995·7 Ara

Merging workshop orals and posters in one room feels like a bad idea tbh… #neurips

English

Brian B. Moser@bmoser1995·7 Ara

@MilesCranmer Could have been vibe coded far better :D

English

161

Miles Cranmer@MilesCranmer·6 Ara

It's crazy how much a conference's app impacts the overall experience. I miss Whova!

English

6.6K

Brian B. Moser@bmoser1995·6 Ara

@alfcnz Sorry again for the inconvenience! Some problems with the printing service caused the right side of the poster becoming increasingly smearing although we provided a PDF… The associated paper: arxiv.org/abs/2403.03881

English

Alfredo Canziani@alfcnz·6 Ara

Did I just take my glasses off? 😥😥😥

English

4.9K

Brian B. Moser retweetledi

Peter Richtarik@peter_richtarik·13 Kas

I am an AC for ICLR 2026. One of the papers in my batch was just withdrawn. The authors wrote a brief response, explaining why the reviewers failed at their job. I agree with most of their comments. The authors gave up. They are fed up. Just like many of us. I understand. We pretend the emperor has clothes, but he is naked. Here is the final part of their withdrawal notice. I took the liberty to make it public, to highlight that what we are doing with AI conference reviews these last few years is, basically, madness. --- Comment: We thank the reviewers for their time. However, upon reading the reviews for our paper, it became immediately apparent that the four "reject" ratings are not based on good-faith academic disagreement, but on a critical failure to read the submitted paper. The reviews are rife with demonstrably false claims that are directly contradicted by the text. The core justifications for rejection rely on asserting that key components are "missing" when they are explicitly detailed in the manuscript. Some specific examples are (and many are even fake claims). Claim: Harder tasks like GSM8K are missing. Fact: GSM8K results are in many tables, like Table 2 (Section 4.2) and Appendix G. Claim: The method does not use per-layer ranks. Fact: This is the entire point of our method. The reviewer clearly mistook our method for the baselines. (Section 2, Table 1). Claim: The GP kernel is not specified. Fact: It is specified in Appendix E (Table 6). Claim: There is no ablation of the method's three stages. Fact: Section 4.4 ("Ablation Study") and Appendix J are dedicated to this. Reviewers have a fundamental responsibility to read and evaluate the work they are assigned. The nature of these errors is so fundamental, so systemic in overlooking explicit content, that it goes far beyond what "limited time" or "oversight" can explain. This work has gone through several rounds of revision over the last year. In earlier submissions, the paper usually received borderline or weak-accept scores. Numerous signs strongly suggest that some reviewers are relying entirely on AI tools to automatically generate peer reviews, rather than fulfilling their fundamental responsibility of personally reading and evaluating manuscripts. We strongly protest this. This is a gross disrespect to the authors. It is a flagrant desecration of the reviewer's sacred duty. It fundamentally undermines the integrity of the entire peer-review process. Given that the reviews are not based on the actual content of our paper, we have decided to withdraw the submission. We leave this comment so that future readers of the OpenReview page are aware that the items described as "missing" are already present in the submitted manuscript. These negative reviews for this submission are factually unsound and do not reflect the content of the paper. We cannot and will not accept an assessment that is not based on the work we actually submitted.

English

205

1.5K

149.5K

Brian B. Moser@bmoser1995·28 Eki

@alex_prompter It’s not MIT, either correct or delete, simple as that

English

Alex Prompter@alex_prompter·26 Eki

MIT just made vibe coding an official part of engineering 💀 MIT just formalized "Vibe Coding" – the thing you've been doing for months where you generate code, run it, and if the output looks right you ship it without reading a single line. turns out that's not laziness. it's a legitimate software engineering paradigm now. they analyzed 1000+ papers and built a whole Constrained Markov Decision Process to model what you thought was just "using ChatGPT to code." they formalized the triadic relationship: your intent (what/why) + your codebase (where) + the agent's decisions (how). which means the shift already happened. you missed it. there was no announcement, no transition period. one morning you woke up writing functions and by lunch you were validating agent outputs and convincing yourself you're still "a developer." but you're not. not in the way you used to be. here's what actually broke my brain reading this 42-page survey: better models don't fix anything. everyone's obsessing over GPT-5 or Claude 4 or whatever's next, and the researchers basically said "you're all looking at the wrong variable." success has nothing to do with model capability. it's about context engineering – how you feed information to the agent. it's about feedback loops – compiler errors + runtime failures + your gut check. it's about infrastructure – sandboxed environments, orchestration platforms, CI/CD integration. you've been optimizing prompts while the actual problem is your entire development environment. they found five models hiding in your workflow and you've been accidentally mixing them without realizing it: - Unconstrained Automation (you just let it run), - Iterative Conversational Collaboration (you go back and forth), - Planning-Driven (you break tasks down first), - Test-Driven (you write specs that constrain it), - Context-Enhanced (you feed it your entire codebase through RAG). most teams are running 2-3 of these simultaneously. no wonder nothing works consistently. and then the data says everything: productivity losses. not gains. losses. empirical studies showing developers are SLOWER with autonomous agents when they don't have proper scaffolding. because we're all treating this like it's autocomplete on steroids when it's actually a team member that needs memory systems, checkpoints, and governance. we're stuck in the old mental model while the ground shifted beneath us. the bottleneck isn't the AI generating bad code. it's you assuming it's a tool when it's actually an agent. What this actually means (and why it matters): → Context engineering > prompt engineering – stop crafting perfect prompts, start managing what the agent can see and access → Pure automation is a fantasy – every study shows hybrid models win; test-driven + context-enhanced combinations actually work → Your infrastructure is the product now – isolated execution, distributed orchestration, CI/CD integration aren't "nice to have" anymore, they're the foundation → Nobody's teaching the right skills – task decomposition, formalized verification, agent governance, provenance tracking... universities aren't preparing anyone for this → The accountability crisis is real – when AI-generated code ships a vulnerability, who's liable? developer? reviewer? model provider? we have zero frameworks for this → You're already behind – computing education hasn't caught up, graduates can't orchestrate AI workflows, the gap is widening daily the shift happened. you're in it. pretending you're still "coding" is living in denial.

English

220

469

2.7K

348.9K

Brian B. Moser retweetledi

OguRyu🇩🇪@Oguryu417·12 Haz

🚀 Excited to present our #CVPR2025 Highlight paper!! 🎨 TKG-DM: Training-free Chroma Key Content Generation with Diffusion Models 📍 Poster #227 @ ExHall D 🗓️ Sat, June 14 | 10:30 am–12:30 pm CDT 🎯 Fore/background separation via latent noise control — no fine-tuning needed!

English

6.5K

Brian B. Moser retweetledi

Ko Watanabe@ko_watanabe_en·14 Haz

Please check this/our poster out!!! 🥳

OguRyu🇩🇪@Oguryu417

準備完了ー

English

229

Brian B. Moser@bmoser1995·29 May

@_akhaliq Amazing Work!!!

English

945

AK@_akhaliq·29 May

Chain-of-Zoom Extreme Super-Resolution via Scale Autoregression and Preference Alignment

English

121

771

174.8K

Brian B. Moser retweetledi

Ko Watanabe@ko_watanabe_en·28 May

Best short paper award!!! #ETRA2025

English

1.4K

Brian B. Moser retweetledi

Stanislav Frolov@stfrolov·14 Nis

Happy to share that TKG-DM, a training-free chroma key content generation diffusion model was accepted to CVPR 25. Project led by @Oguryu417 Paper: arxiv.org/abs/2411.15580 Code: github.com/ryugo417/TKG-DM

English

1.3K

Brian B. Moser retweetledi

OguRyu🇩🇪@Oguryu417·5 Nis

This work has been selected as highlight🥚🥚😎😎

OguRyu🇩🇪@Oguryu417

Exciting news! 🎉 Our paper “TKG-DM: Training-free Chroma Key Content Generation Diffusion Model” has been accepted to #CVPR2025! 🥳 Stay tuned for more updates! #AI #DiffusionModel #TKG

English

4.4K

Brian B. Moser retweetledi

Stanislav Frolov@stfrolov·3 Nis

Checkout PromptMap, presented at IUI'25, a new interaction style with text-to-image models/data that allows users to freely explore a vast collection of synthetic prompts through a map-like view with semantic zoom. Paper: arxiv.org/abs/2503.09436 Code: github.com/Bill2462/promp…

English

210

Brian B. Moser@bmoser1995·1 Nis

Our paper „Distill the Best, Ignore the Rest: Improving Dataset Distillation with Loss-Value-Based Pruning“ got accepted for #IJCNN 2025 :-) In this paper, we showed how #dataset #distillation can profit from #coresets :) arXiv: arxiv.org/abs/2411.12115 #AI #NeuralNetworks #ML

English

472

Keşfet

@neur_reps @MilesCranmer @alfcnz @alex_prompter @_akhaliq @Oguryu417 @elonmusk @BarackObama