The NetHack Learning Environment

105 posts

The NetHack Learning Environment banner
The NetHack Learning Environment

The NetHack Learning Environment

@NetHack_LE

Official handle for the NetHack Learning Environment (https://t.co/vgI9FU0vn3)

Mazes of Menace انضم Nisan 2021
28 يتبع1.1K المتابعون
The NetHack Learning Environment أُعيد تغريده
Tim Rocktäschel
Tim Rocktäschel@_rockt·
Happy "@NetHack_LE is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com). For a recent blog post on what makes it so hard for AI, check out @HenaffMikael's analysis: mikaelhenaff.substack.com/p/first-nethac…
Tim Rocktäschel tweet media
English
3
26
137
26.7K
Mikael Henaff
Mikael Henaff@HenaffMikael·
@dgant @_rockt @NetHack_LE Probably, since you're presumably been able to make progress in the real world which is more complex than NetHack.
English
1
0
1
88
Tim Rocktäschel
Tim Rocktäschel@_rockt·
Great post by @HenaffMikael (after ascending, what an achievement!) on what makes @NetHack_LE so extremely difficult for AI (even LLMs: balrogai.com). "While NetHack is complex in comparison to other RL benchmarks, it still contains only a tiny fraction of the complexity of the real world (its source code is 4.2MB, which provides an upper bound on its Kolmogorov complexity). As long as we can’t reliably solve this game for which we can easily collect lifetimes worth of data, have access to detailed textual resources (and even the underlying source code), and large-scale datasets of human gameplay, I think AGI remains a ways off."
Mikael Henaff@HenaffMikael

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some @NetHack_LE challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…

English
1
9
36
6.2K
The NetHack Learning Environment أُعيد تغريده
Mikael Henaff
Mikael Henaff@HenaffMikael·
A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some @NetHack_LE challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…
Mikael Henaff tweet media
English
5
12
60
11.4K
The NetHack Learning Environment أُعيد تغريده
Stephen Oman
Stephen Oman@stephen_oman·
Happy to announce the latest release of @NetHack_LE (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.
Stephen Oman tweet media
English
1
3
28
7.6K
The NetHack Learning Environment أُعيد تغريده
Mikayel Samvelyan
Mikayel Samvelyan@_samvelyan·
⚔️ MiniHack Updates! ⚔️ 1️⃣ MiniHack 1.0.0 is here! Following popular demand, it now supports the new Gymnasium API and is built on NLE 1.1.0. Huge thanks to @Stephen_Oman (maintainer of @NetHack_LE ) for his outstanding contribution! 🙌
GIF
English
3
12
65
5K
The NetHack Learning Environment أُعيد تغريده
Martin Klissarov
Martin Klissarov@MartinKlissarov·
Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first method that, without expert labeled datasets, solves compositional tasks requiring hundreds of steps for completion. All the modules within MaestroMotif are learned from interaction: from the highest level of planning to the lowest-level of sensorimotor control. On the open-ended domain of NetHack, it surpasses existing approaches, including those that are fine-tuned specifically for each task. At the heart of MaestroMotif is the idea that decomposing a task into subtasks significantly helps decision making. MaestroMotif leverages an agent designer's intuition about a domain to identify important skills and describe them in natural language. These short descriptions then get converted into adaptable hierarchical agents through AI feedback and in-context learning. Our paper was recently published at ICLR 2025 and we open-source the whole project including the code, prompts and pre-trained models. Paper: arxiv.org/abs/2412.08542 Code: github.com/mklissa/maestr… NotebookLM Podcast: bit.ly/4jLi6mo This work was done with the amazing @HenaffMikael, @robertarail, @shagunsodhani, Pascal Vincent, @yayitsamyzhang, @pierrelux, Doina Precup, with equal supervision by @MarlosCMachado and @proceduralia. Take a look at the following thread:
English
6
52
201
80.1K
Andrej Karpathy
Andrej Karpathy@karpathy·
I quite like the idea using games to evaluate LLMs against each other, instead of fixed evals. Playing against another intelligent entity self-balances and adapts difficulty, so each eval (/environment) is leveraged a lot more. There's some early attempts around. Exciting area.
León@LeonGuertler

Perfect timing, we are just about to publish TextArena. A collection of 57 text-based games (30 in the first release) including single-player, two-player and multi-player games. We tried keeping the interface similar to OpenAI gym, made it very easy to add new games, and created an online leaderboard (you can let your model compete online against other models and humans). There are still some kinks to fix up, but we are actively looking for collaborators :) If you are interested check out textarena.ai, DM me or send an email to guertlerlo@cfar.a-star.edu.sg Next up, the plan is to use R1 style training to create a model with super-human soft-skills (i.e. theory of mind, persuasion, deception etc.)

English
253
408
5.9K
978.2K
The NetHack Learning Environment أُعيد تغريده
The NetHack Learning Environment أُعيد تغريده
Ethan Mollick
Ethan Mollick@emollick·
@SteveStricklan6 ARC will be solved before Nethack
English
1
4
43
15.4K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@_rockt @PaglieriDavide As fast as nethack is, the main difficulty for RL is that it is too slow. I think we would have seen much more progress if it trained at 500k sps like our new envs
English
2
0
2
368