Roger Girgis

16 posts

Roger Girgis

Roger Girgis

@rogg1111

PhD student @ Mila with Prof Christopher Pal. RS@Torc Robotics

Katılım Mayıs 2015
174 Takip Edilen187 Takipçiler
Roger Girgis retweetledi
Alexandre
Alexandre@alexpiche_·
In-flight weight updates have gone from a “weird trick” to a must to train LLMs with RL in the last few weeks. If you want to understand the on-policy and throughput benefits here’s the CoLM talk @DBahdanau and I gave: youtu.be/Z1uEuRKACRs
YouTube video
YouTube
English
1
29
143
68.8K
Roger Girgis retweetledi
Luke Rowe
Luke Rowe@Luke22R·
Scenario Dreamer has been accepted at #CVPR2025! Website: …ceton-computational-imaging.github.io/scenario-dream… We train a vectorized latent diffusion model to synthesize high-fidelity driving simulation environments (agents+map). Scenario Dreamer enables fully data-driven closed-loop generative simulation!
GIF
English
7
72
424
29.3K
Roger Girgis retweetledi
Luke Rowe
Luke Rowe@Luke22R·
Resharing as CtRL-Sim was accepted at #CoRL2024, which we will present this week in Munich! Also, all the code is now available, so you can start training your own controllable and reactive driving agents: github.com/montrealroboti…
Luke Rowe@Luke22R

How can we generate interesting edge cases to test our autonomous vehicles in simulation? We propose CtRL-Sim, a novel framework for closed-loop behaviour simulation that enables fine-grained control over agent behaviours. 🧵 1/8 arxiv.org/abs/2403.19918

English
0
7
15
1.9K
Roger Girgis retweetledi
Felix Heide
Felix Heide@_FelixHeide_·
Check out CtRL-Sim later this week at #CoRL2024 in Munich! How can we generate interesting edge cases to test autonomous vehicles in simulation? We propose CtRL-Sim for closed-loop behavior simulation that enables fine-grained control over agent behaviors. CtRL-Sim leverages return-conditioned offline reinforcement learning with exponential tilting for multi-agent closed-loop, reactive, and controllable behavior simulation. CtRL-Sim can modify existing traffic scenarios to generate a wide range of agent behaviors by “tilting” each agent towards good (positive tilting) or bad (negative tilting) driving behaviors. Project Page: torc.ai/knowledge-cent… Work led by Luke Rowe and Roger Girgis with amazing collaborators Anthony Gosselin, Bruno Carrez, Florian Golemo, Liam Paull, and Chris Pal.
English
0
6
25
2.6K
Roger Girgis retweetledi
Alexandre
Alexandre@alexpiche_·
Introducing ReSearch: An iterative self-reflection algorithm that enhances LLM's self-restraint abilities: • Encouraging abstention when uncertain • Producing accurate, informative content when confident Result: Significant accuracy boost for Llama2 7B Chat and Mistral 7B! 🚀
GIF
English
1
43
100
18.3K
Roger Girgis retweetledi
Ge Ya (Olga) Luo
Ge Ya (Olga) Luo@OOOOLGAluo·
We’re thrilled to introduce Ctrl-V! Our new video diffusion model uses 2D and 3D bounding boxes to predict & control object motions in videos. Learn more, read the full report & access our code on our website: oooolga.github.io/ctrl-v.github.…
English
3
10
40
6.6K
Roger Girgis retweetledi
Jamie Shotton
Jamie Shotton@Jamie_Shotton·
One of the hardest challenges in developing AI for autonomous vehicles is evaluating the performance of our driving models. Why? (A short 🧵on our latest research on multi-agent RL).
English
10
70
437
141.7K
Roger Girgis retweetledi
Julien Roy
Julien Roy@juleroy13·
Your RL agent is not behaving as expected? Try Direct Behavior Specification via Constrained RL! In our #ICML2022 paper we propose to use a special family of constraints to specify behavior instead of forcing everything into a single reward. Blog is out: tinyurl.com/3z6zy6n2
English
0
11
29
0
Roger Girgis retweetledi
Simon Guiroy
Simon Guiroy@GuiroySimon·
Our paper (accepted at @CoLLAs_Conf) is out! arxiv.org/abs/2208.02377 We show that Meta-Learning generalization to novel OOD task distributions can be inferred from the neural activation dynamics from a few unlabeled examples, and we propose Activation-Based Early-Stopping (ABE).
GIF
English
3
25
93
0
Roger Girgis retweetledi
Marco Pavone
Marco Pavone@drmapavone·
We have open sourced github.com/nvr-avg/trajda…! It's a new, unified interface to many trajectory forecasting datasets, greatly simplifying the process of training and evaluating a forecasting model on multiple motion datasets! @iamborisi @NVIDIADRIVE
English
0
7
40
0
Roger Girgis retweetledi
Dmitri Dolgov
Dmitri Dolgov@dmitri_dolgov·
According to the @USDOT, someone runs a red light every ~20 min. Proud of the work by our team to make roads safer, like in this situation where the @Waymo driver in fully-auto mode (no human in the driver’s seat) safely reacts to a car blowing through a red light in Phoenix.
GIF
English
7
26
147
0
Roger Girgis
Roger Girgis@rogg1111·
…And the best thing? With our seed parameter trick, you can train it on a GTX 1080Ti in 3h. No more TPUs and days worth of training. Low computation => easier on the 🌎. 🧵3/4
English
1
0
3
0
Roger Girgis
Roger Girgis@rogg1111·
Transformers are the go-to model for language, but they can also be used for motion prediction in cars 🚗 & robots 🤖. Our #ICLR2022 spotlight paper presents AutoBots, a fast & effective multi-agent trajectory forecasting Transformer 🧵1/4 arXiv: arxiv.org/abs/2104.00563
GIF
English
2
24
124
0