The NetHack Learning Environment

105 posts

The NetHack Learning Environment

@NetHack_LE

Official handle for the NetHack Learning Environment (https://t.co/vgI9FU0vn3)

Mazes of Menace Inscrit le Nisan 2021

28 Abonnements1.1K Abonnés

Tweet épinglé

The NetHack Learning Environment@NetHack_LE·11 May

Pleased to announce the NetHack Learning Environment has a new home, and a new set of maintainers! Find it at github.com/heiner/nle. A huge thanks to @stephen_oman and @MartinKlissarov for helping with the project! Happy hacking!

English

26.7K

The NetHack Learning Environment@NetHack_LE·11 Ağu

Excuse me?

dawon 🇺🇸@_imdawon

So uh why is almost all of game dev done with C++ and not C?

English

The NetHack Learning Environment@NetHack_LE·1 Ağu

@Ishmokin This is an excellent idea.

English

Ishmokin@Ishmokin·23 Tem

@NetHack_LE Is there a way to spectate games? Would be awesome if we could watch like how we do at nethack servers like alt.org and hardfought.org

English

107

The NetHack Learning Environment@NetHack_LE·23 Tem

1 43.6 Grok-4-Wiz-AI-Cha died in The Dungeons of Doom on level 1. Killed by a housecat.

Davide Paglieri@PaglieriDavide

LLMs acing math olympiads? Cute. But BALROG is where agents fight dragons (and actual Balrogs)🐉😈 And today, Grok-4 (@grok) takes the gold 🥇 Welcome to the podium, champion!

English

3.8K

The NetHack Learning Environment retweeté

Tim Rocktäschel@_rockt·22 Tem

💯 Who knew that the International Math Olympiad (IMO) is much easier than @NetHack_LE for AI.

Daniel Wolf@DanielWolf18

@apples_jimmy Meanwhile, another wall - @NetHack_LE - is still standing firm and tall.

English

7.8K

The NetHack Learning Environment retweeté

Tim Rocktäschel@_rockt·24 Haz

Happy "@NetHack_LE is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com). For a recent blog post on what makes it so hard for AI, check out @HenaffMikael's analysis: mikaelhenaff.substack.com/p/first-nethac…

English

137

26.7K

The NetHack Learning Environment@NetHack_LE·20 Haz

@HenaffMikael @dgant @_rockt arguably

English

Mikael Henaff@HenaffMikael·9 Haz

@dgant @_rockt @NetHack_LE Probably, since you're presumably been able to make progress in the real world which is more complex than NetHack.

English

Tim Rocktäschel@_rockt·9 Haz

Great post by @HenaffMikael (after ascending, what an achievement!) on what makes @NetHack_LE so extremely difficult for AI (even LLMs: balrogai.com). "While NetHack is complex in comparison to other RL benchmarks, it still contains only a tiny fraction of the complexity of the real world (its source code is 4.2MB, which provides an upper bound on its Kolmogorov complexity). As long as we can’t reliably solve this game for which we can easily collect lifetimes worth of data, have access to detailed textual resources (and even the underlying source code), and large-scale datasets of human gameplay, I think AGI remains a ways off."

Mikael Henaff@HenaffMikael

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some @NetHack_LE challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…

English

6.2K

The NetHack Learning Environment retweeté

Mikael Henaff@HenaffMikael·9 Haz

English

11.4K

The NetHack Learning Environment@NetHack_LE·24 May

New home! github.com/NetHack-LE/nle

English

298

The NetHack Learning Environment@NetHack_LE·24 May

NetHack-LE accepts your allegiance.

English

607

The NetHack Learning Environment retweeté

Stephen Oman@stephen_oman·17 May

Happy to announce the latest release of @NetHack_LE (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.

English

7.6K

The NetHack Learning Environment retweeté

Mikayel Samvelyan@_samvelyan·14 Şub

⚔️ MiniHack Updates! ⚔️ 1️⃣ MiniHack 1.0.0 is here! Following popular demand, it now supports the new Gymnasium API and is built on NLE 1.1.0. Huge thanks to @Stephen_Oman (maintainer of @NetHack_LE ) for his outstanding contribution! 🙌

GIF

English

The NetHack Learning Environment retweeté

Martin Klissarov@MartinKlissarov·4 Şub

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first method that, without expert labeled datasets, solves compositional tasks requiring hundreds of steps for completion. All the modules within MaestroMotif are learned from interaction: from the highest level of planning to the lowest-level of sensorimotor control. On the open-ended domain of NetHack, it surpasses existing approaches, including those that are fine-tuned specifically for each task. At the heart of MaestroMotif is the idea that decomposing a task into subtasks significantly helps decision making. MaestroMotif leverages an agent designer's intuition about a domain to identify important skills and describe them in natural language. These short descriptions then get converted into adaptable hierarchical agents through AI feedback and in-context learning. Our paper was recently published at ICLR 2025 and we open-source the whole project including the code, prompts and pre-trained models. Paper: arxiv.org/abs/2412.08542 Code: github.com/mklissa/maestr… NotebookLM Podcast: bit.ly/4jLi6mo This work was done with the amazing @HenaffMikael, @robertarail, @shagunsodhani, Pascal Vincent, @yayitsamyzhang, @pierrelux, Doina Precup, with equal supervision by @MarlosCMachado and @proceduralia. Take a look at the following thread:

English

201

80.1K

The NetHack Learning Environment@NetHack_LE·2 Şub

@demishassabis @karpathy cc @_rockt had that idea early!

English

681

Demis Hassabis@demishassabis·2 Şub

@karpathy Cool idea!

English

597

41.6K

Andrej Karpathy@karpathy·1 Şub

I quite like the idea using games to evaluate LLMs against each other, instead of fixed evals. Playing against another intelligent entity self-balances and adapts difficulty, so each eval (/environment) is leveraged a lot more. There's some early attempts around. Exciting area.

León@LeonGuertler

Perfect timing, we are just about to publish TextArena. A collection of 57 text-based games (30 in the first release) including single-player, two-player and multi-player games. We tried keeping the interface similar to OpenAI gym, made it very easy to add new games, and created an online leaderboard (you can let your model compete online against other models and humans). There are still some kinks to fix up, but we are actively looking for collaborators :) If you are interested check out textarena.ai, DM me or send an email to guertlerlo@cfar.a-star.edu.sg Next up, the plan is to use R1 style training to create a model with super-human soft-skills (i.e. theory of mind, persuasion, deception etc.)

English

253

408

5.9K

978.2K

The NetHack Learning Environment retweeté

Tim Rocktäschel@_rockt·20 Oca

💯 For me this is NetHack (see @NetHack_LE and balrogai.com). I am still holding my breath.

Noam Brown@polynoamial

It can be hard to “feel the AGI” until you see an AI surpass top humans in a domain you care deeply about. Competitive coders will feel it within a couple years. Paul is early but I think writers will feel it too. Everyone will have their Lee Sedol moment at a different time.

English

4.5K

The NetHack Learning Environment retweeté

Tim Rocktäschel@_rockt·23 Ara

In the meantime, @NetHack_LE and balrogai.com...

GIF

Chubby♨️@kimmonismus

So everyone, each of us now has to work hard to develop new benchmarks. Because oh boy will they be solved quickly.

English

4.6K

The NetHack Learning Environment@NetHack_LE·21 Ara

cc @_rockt

Balaji@balajis

Whatever the thing is you think AI can’t do, benchmark it and then the world will hill climb towards it.

360

The NetHack Learning Environment retweeté

Tim Rocktäschel@_rockt·20 Ara

Yearly reminder

heiner@HeinrichKuttler

"Is it AGI" flow chart. Developed with @_rockt at NeurIPS 2022.

English

6.7K

The NetHack Learning Environment retweeté

Ethan Mollick@emollick·23 Kas

@SteveStricklan6 ARC will be solved before Nethack

English

15.4K

The NetHack Learning Environment@NetHack_LE·30 Kas

@jsuarez @_rockt @PaglieriDavide Seems slightly dubious in that you can run 1 NetHack / core at 1000k+ SPS total, per machine.

English

112

Joseph Suarez 🐡@jsuarez·24 Kas

@_rockt @PaglieriDavide As fast as nethack is, the main difficulty for RL is that it is too slow. I think we would have seen much more progress if it trained at 500k sps like our new envs

English

368

Tim Rocktäschel@_rockt·24 Kas

💯‼️ That's why @PaglieriDavide created balrogai.com

Ethan Mollick@emollick

@SteveStricklan6 ARC will be solved before Nethack

English

4.3K

Découvrir

@Ishmokin @HenaffMikael @dgant @_rockt @Stephen_Oman @robertarail @shagunsodhani @yayitsamyzhang