Roger Girgis

16 posts

Roger Girgis

@rogg1111

PhD student @ Mila with Prof Christopher Pal. RS@Torc Robotics

Katılım Mayıs 2015

174 Takip Edilen187 Takipçiler

Roger Girgis retweetledi

Alexandre@alexpiche_·4 Kas

In-flight weight updates have gone from a “weird trick” to a must to train LLMs with RL in the last few weeks. If you want to understand the on-policy and throughput benefits here’s the CoLM talk @DBahdanau and I gave: youtu.be/Z1uEuRKACRs

YouTube

English

143

68.8K

Roger Girgis retweetledi

Luke Rowe@Luke22R·7 Nis

Scenario Dreamer has been accepted at #CVPR2025! Website: …ceton-computational-imaging.github.io/scenario-dream… We train a vectorized latent diffusion model to synthesize high-fidelity driving simulation environments (agents+map). Scenario Dreamer enables fully data-driven closed-loop generative simulation!

GIF

English

424

29.3K

Roger Girgis retweetledi

Luke Rowe@Luke22R·5 Kas

Resharing as CtRL-Sim was accepted at #CoRL2024, which we will present this week in Munich! Also, all the code is now available, so you can start training your own controllable and reactive driving agents: github.com/montrealroboti…

Luke Rowe@Luke22R

How can we generate interesting edge cases to test our autonomous vehicles in simulation? We propose CtRL-Sim, a novel framework for closed-loop behaviour simulation that enables fine-grained control over agent behaviours. 🧵 1/8 arxiv.org/abs/2403.19918

English

1.9K

Roger Girgis retweetledi

Felix Heide@_FelixHeide_·5 Kas

Check out CtRL-Sim later this week at #CoRL2024 in Munich! How can we generate interesting edge cases to test autonomous vehicles in simulation? We propose CtRL-Sim for closed-loop behavior simulation that enables fine-grained control over agent behaviors. CtRL-Sim leverages return-conditioned offline reinforcement learning with exponential tilting for multi-agent closed-loop, reactive, and controllable behavior simulation. CtRL-Sim can modify existing traffic scenarios to generate a wide range of agent behaviors by “tilting” each agent towards good (positive tilting) or bad (negative tilting) driving behaviors. Project Page: torc.ai/knowledge-cent… Work led by Luke Rowe and Roger Girgis with amazing collaborators Anthony Gosselin, Bruno Carrez, Florian Golemo, Liam Paull, and Chris Pal.

English

2.6K

Roger Girgis retweetledi

Alexandre@alexpiche_·3 Tem

Introducing ReSearch: An iterative self-reflection algorithm that enhances LLM's self-restraint abilities: • Encouraging abstention when uncertain • Producing accurate, informative content when confident Result: Significant accuracy boost for Llama2 7B Chat and Mistral 7B! 🚀

GIF

English

100

18.3K

Roger Girgis retweetledi

Ge Ya (Olga) Luo@OOOOLGAluo·18 Haz

We’re thrilled to introduce Ctrl-V! Our new video diffusion model uses 2D and 3D bounding boxes to predict & control object motions in videos. Learn more, read the full report & access our code on our website: oooolga.github.io/ctrl-v.github.…

English

6.6K

Roger Girgis@rogg1111·18 Haz

Check out our awesome new work, CtRL-Sim, a framework for reactive and Controllable behaviour simulation using Factorized Return-conditioned supervised learning.

Luke Rowe@Luke22R

English

141

Roger Girgis retweetledi

Jamie Shotton@Jamie_Shotton·5 Oca

One of the hardest challenges in developing AI for autonomous vehicles is evaluating the performance of our driving models. Why? (A short 🧵on our latest research on multi-agent RL).

English

437

141.7K

Roger Girgis retweetledi

Julien Roy@juleroy13·23 Ağu

Your RL agent is not behaving as expected? Try Direct Behavior Specification via Constrained RL! In our #ICML2022 paper we propose to use a special family of constraints to specify behavior instead of forcing everything into a single reward. Blog is out: tinyurl.com/3z6zy6n2

English

Roger Girgis retweetledi

Simon Guiroy@GuiroySimon·23 Ağu

Our paper (accepted at @CoLLAs_Conf) is out! arxiv.org/abs/2208.02377 We show that Meta-Learning generalization to novel OOD task distributions can be inferred from the neural activation dynamics from a few unlabeled examples, and we propose Activation-Based Early-Stopping (ABE).

GIF

English

Roger Girgis retweetledi

Marco Pavone@drmapavone·13 Ağu

We have open sourced github.com/nvr-avg/trajda…! It's a new, unified interface to many trajectory forecasting datasets, greatly simplifying the process of training and evaluating a forecasting model on multiple motion datasets! @iamborisi @NVIDIADRIVE

English

Roger Girgis retweetledi

Dmitri Dolgov@dmitri_dolgov·1 Haz

According to the @USDOT, someone runs a red light every ~20 min. Proud of the work by our team to make roads safer, like in this situation where the @Waymo driver in fully-auto mode (no human in the driver’s seat) safely reacts to a car blowing through a red light in Phoenix.

GIF

English

147

Roger Girgis@rogg1111·23 Mar

Joint work with @chrisjpal, @felipealcm, @shoddy_robots, @SamiraEKahou, @jimdsouza, @_FelixHeide_, and Martin Weiss at @Mila_Quebec and @Algolux @etsmtl @polymtl. 🚗+🤖s -->😄🙏 🧵4/4

English

Roger Girgis@rogg1111·23 Mar

…And the best thing? With our seed parameter trick, you can train it on a GTX 1080Ti in 3h. No more TPUs and days worth of training. Low computation => easier on the 🌎. 🧵3/4

English

Roger Girgis@rogg1111·23 Mar

Transformers are the go-to model for language, but they can also be used for motion prediction in cars 🚗 & robots 🤖. Our #ICLR2022 spotlight paper presents AutoBots, a fast & effective multi-agent trajectory forecasting Transformer 🧵1/4 arXiv: arxiv.org/abs/2104.00563

GIF

English

124

Keşfet

@DBahdanau @CoLLAs_Conf @iamborisi @NVIDIADRIVE @USDOT @Waymo @chrisjpal @felipealcm