

Guardrails with custom polices are hard for models trained on safety and harm-related datasets. But what if you trained a guardian model on arbitrary rules? Introducing DynaGuard, a guardian model for custom policies: arxiv.org/abs/2509.02563
Bayan Bruss
825 posts

@cbbruss
VP of Applied AI Research @CapitalOne | Adjunct @Georgetown | Simple baselines, practical implementations.


Guardrails with custom polices are hard for models trained on safety and harm-related datasets. But what if you trained a guardian model on arbitrary rules? Introducing DynaGuard, a guardian model for custom policies: arxiv.org/abs/2509.02563





LLaVA-Critic-R1 Your Critic Model is Secretly a Strong Policy Model



100 page prompt is crazy

So excited to join the new @SimonsFdn Simons Collaboration on the Physics of Learning and Neural Computation, to further advance our understanding (mech interp) of learning & reasoning in large networks, including classical deep nets and other bio-inspired network models!

Today we're releasing NVIDIA Nemotron Nano v2 - a 9B hybrid SSM that is 6X faster than similarly sized models, while also being more accurate. Along with this model, we are also releasing most of the data we used to create it, including the pretraining corpus. Links to the models, datasets, and tech report are here: research.nvidia.com/labs/adlr/NVID…








i'm increasingly convinced that "transformative ai" is going to look like an abundance of specialized models for everything from drug design to weather sims to robotics to supply chains, not one agent to rule them all. we're going to need a lot more ai researchers



Francois Chollet says human-level AI is still far off true intelligence means learning new skills fast, but LLMs are very poor at this we're only taking baby steps in adapting to new situations in real time "LLMs might help, but they won't be the core of real intelligence"




What I look for when hiring? EXTREME PARANOIA about code and data


🚨 Olympiad math + AI: We ran Google’s Gemini 2.5 Pro on the fresh IMO 2025 problems. With careful prompting and pipeline design, it solved 5 out of 6 — remarkable for tasks demanding deep insight and creativity. The model could win gold! 🥇 #AI #Math #LLMs #IMO2025

The code and model weights for this paper are finally open! Despite being a little late for releasing them, I hope you will find them useful! Code: github.com/facebookresear… Models: - (ViT-G): huggingface.co/lavoies/llip-v… - (ViT-B): huggingface.co/lavoies/llip-v…