Benjamin Fuhrer

@BenjaminFuhrer

参加日 Ekim 2020

939 フォロー中53 フォロワー

Benjamin Fuhrer@BenjaminFuhrer·9 Tem

I’ll be presenting GBRL next Wednesday at ICML this year. Come chat! #icml2025

AK@_akhaliq

Gradient Boosting Reinforcement Learning Neural networks (NN) achieve remarkable results in various tasks, but lack key characteristics: interpretability, support for categorical features, and lightweight implementations suitable for edge devices. While ongoing efforts aim to address these challenges, Gradient Boosting Trees (GBT) inherently meet these requirements. As a result, GBTs have become the go-to method for supervised learning tasks in many real-world applications and competitions. However, their application in online learning scenarios, notably in reinforcement learning (RL), has been limited. In this work, we bridge this gap by introducing Gradient-Boosting RL (GBRL), a framework that extends the advantages of GBT to the RL domain. Using the GBRL framework, we implement various actor-critic algorithms and compare their performance with their NN counterparts. Inspired by shared backbones in NN we introduce a tree-sharing approach for policy and value functions with distinct learning rates, enhancing learning efficiency over millions of interactions. GBRL achieves competitive performance across a diverse array of tasks, excelling in domains with structured or categorical features.

English

189

Benjamin Fuhrer@BenjaminFuhrer·7 Tem

Heading to #ICML2025 to share our GBRL poster! • Gradient-boosted trees for RL • Strong performance & OOD robustness • CUDA-fast, SB3-ready, lightweight deployment Let’s talk in Vancouver! 📄 arxiv.org/pdf/2407.08250 💻 github.com/NVlabs/gbrl @DalalGal @ChenTessler #RL #AI

English

539

Benjamin Fuhrer@BenjaminFuhrer·27 Eyl

@liimeleemon Congrats! Great work!

English

Quentin Delfosse@liimeleemon·27 Eyl

So happy that our paper Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents (arxiv.org/pdf/2401.05821) has been accepted at NeurIPS 2024! 🎉 If you are wondering why RL agents cannot generalize to new scenarios and how to mitigate it, check it out !

English

3.7K

Benjamin Fuhrer@BenjaminFuhrer·2 Kas

@Yampeleg בוצע.

עברית

Yam Peleg@Yampeleg·2 Kas

הזרוע הצבאית של ההסברה. שמעתם ש: - "ישראל הרגה את האזרחים של עצמה ב7.10?" - "יש לישראל אולפן להפקת סרטונים של חמאס הורג אזרחים?" - "אין עדויות לאונס?" לא יהיה. זה נגמר. המטרה: להשמיד את החשבונות שמפיצים עלינו שקרים. הם צריכים להחסם. דפ"א: דיווח מאסיבי מאלפי חשבונות ישראלים לכל אחד ואחד מהשקרים שהם מפיצים. בלי בוטים. בלי טריקים. רק אנשים אמיתיים. [שחלקם איבדו את המשפחות והחברים שלהם ונמאס להם לשמוע את הזבל הזה כבר] אנחנו מסוגלים לגרום לזה להעלם אם נעבוד ביחד. איך זה עובד? 1. אני אפרסם בכל יום רשימה מסודרת של שקרים. 2. כולנו עוברים עליהם אחד אחד ומדווחים עליהם. 30 שניות ביום. זהו. בלי Community Notes. רק Report. הם צריכים להמחק. --- בנק מטרות - 02.11.2023: 1. "ההרוגים בדרום נקלעו לחילופי אש ונפגעו בטעות:" x.com/partisangirl/s… 2. "שני לוק בכלל בחיים." x.com/partisangirl/s… 3. "חמאס מטפל יפה בקשישים וילדים:" x.com/partisangirl/s… 4. "הרקטות לא אמיתיות:" x.com/shaykhsulaiman… 5. "כנסיות שלא באמת הפצצנו:" x.com/shaykhsulaiman… 6. "האולפן שלנו שבו מפיקים את הסרטונים של חמאס יורה באזרחים:" x.com/shaykhsulaiman… רשאים, אש. --- שתפו את זה בבקשה עם כל העולם ואחותו. אני לא מנסה להגדיל את חשבון הטוויטר שלי. פשוט נשבר לי. ---

עברית

145

13.5K

Benjamin Fuhrer がリツイート

Gal Dalal@DalalGal·26 Haz

We released a multi-agent RL framework for network congestion control with the first public realistic network simulator! github.com/NVlabs/RLCC. Based on the amazing work of @BenjaminFuhrer and @ChenTessler

English

1.8K

Benjamin Fuhrer がリツイート

Gal Dalal@DalalGal·7 Tem

Very excited to present the first #AI-based datacenter network congestion control solution that finally runs on live NICs and beats all other SOTA competitors! @BenjaminFuhrer @tesslerc @YShpigelman blogs.nvidia.com/blog/2022/06/1… #NVIDIA #reinforcementlearning #cloudnetworking

English

Benjamin Fuhrer@BenjaminFuhrer·2 Şub

I just published in @TDataScience Integer-Only Inference for Deep Learning in Native C towardsdatascience.com/integer-only-i…

English

ディスカバー

@DalalGal @ChenTessler @liimeleemon @Yampeleg @YShpigelman @TDataScience @elonmusk @BarackObama