Alexey Skrynnik

83 posts

Alexey Skrynnik banner
Alexey Skrynnik

Alexey Skrynnik

@Tviskaron

Katılım Eylül 2013
51 Takip Edilen36 Takipçiler
Karolis Jucys
Karolis Jucys@Karolis_Ram·
Sad to see that the update made the huge simplification of MineRL even less transparent. Without it, DeepMind’s Dreamer would get ~0 score. Before:”We following prior work and increase the speed at which blocks break” Now:”We follow the block breaking setting of prior work” 1/8
Danijar Hafner@danijarh

🌎 Excited to share a major update of the DreamerV3 agent! A couple of smaller changes, more benchmarks, and substantially improved performance. 👇 Main differences from our earlier preprint:

English
5
9
85
25.7K
Keisuke Okumura
Keisuke Okumura@_kei18·
@Tviskaron No no, 🍯 survived ! Just forgot to include it in the photo. I enjoy lovely gifts l
English
1
0
1
42
Keisuke Okumura
Keisuke Okumura@_kei18·
開けても開けても出てくるマトリョーシカ🪆
Keisuke Okumura tweet media
日本語
1
0
7
707
Mikhail Burtsev
Mikhail Burtsev@MikhailBurtsev·
When senior colleagues join the team at academic race.
Mikhail Burtsev tweet media
English
2
0
5
422
Alexey Skrynnik retweetledi
Negar Arabzadeh
Negar Arabzadeh@NegarEmpr·
What a way to wrap up @IgluContest! Our paper “IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents” accepted to @SIGIRConf including: 1) rich multi-modal dataset 2) A data collection tool 3) An online eval framework #SIGIR2025
English
1
1
20
8K
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
@_kei18 Thank you, Keisuke :) However, I believe MAPF-GPT can be further improved by RL fine-tuning. P.S. LaCAM is great!
English
1
0
0
92
Keisuke Okumura
Keisuke Okumura@_kei18·
MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale arxiv.org/abs/2409.00134 I like this direction, rather than developing better RL policies for MAPF.
Keisuke Okumura tweet media
English
3
1
6
2.5K
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
MAPF-GPT performs exceptionally well on unseen instances and outperforms state-of-the-art learnable solvers such as SCRIMP and DCC, having better runtime efficiency. (2/3)
Alexey Skrynnik tweet media
English
1
0
0
87
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
I’m excited to announce our recent preprint titled MAPF-GPT, a GPT-like model designed for MAPF problems. It is trained using pure imitation learning on trajectories generated by LaCAM. (1/3)
Alexey Skrynnik tweet media
English
1
0
0
80
Alexey Skrynnik retweetledi
ICAPS Conference
ICAPS Conference@ICAPSConference·
PRL (Workshop) @ ICAPS 2024 *CALL* - If you're bridging the gap between AI Planning and Reinforcement Learning, join the PRL workshop - prl-theworkshop.github.io/prl2024-icaps/. Submissions due March 22
English
0
3
5
625
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
More details: ar5iv.labs.arxiv.org/html/2310.01207. Big thanks to my co-authors Anton Andreychuk, Maria Nesterova, Konstantin Yakovlev, and Aleksandr Panov. The second paper, which I will detail after the camera-ready version, focuses on using Neural MCTS for the LMAPF tasks. (7/7)
English
0
0
0
135
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
Against centralized MAPF algorithms, Follower excels, especially under a strict 1-second time constraint in scenarios with a large number of agents (over 160 for the Warehouse map). (6/7)
Alexey Skrynnik tweet media
English
1
0
0
168
Alexey Skrynnik
Alexey Skrynnik@Tviskaron·
I'm excited to announce that two of my papers have been accepted at AAAI-2024, delving into lifelong multi-agent pathfinding (LMAPF) using RL and planning techniques. Here's the thread: (1/7).
English
1
0
0
362