Alexey Skrynnik

83 posts

Alexey Skrynnik

@Tviskaron

Katılım Eylül 2013

51 Takip Edilen36 Takipçiler

Alexey Skrynnik@Tviskaron·29 Haz

@FlexAppeall @Karolis_Ram Asynchronous PPO for large-scale experiments, i.e. github.com/alex-petrenko/…

English

Andile@FlexAppeall·28 Haz

@Tviskaron @Karolis_Ram By APPO you mean Augmented PPO? Like this? ojs.aaai.org/index.php/AAAI…

English

Karolis Jucys@Karolis_Ram·1 May

Sad to see that the update made the huge simplification of MineRL even less transparent. Without it, DeepMind’s Dreamer would get ~0 score. Before:”We following prior work and increase the speed at which blocks break” Now:”We follow the block breaking setting of prior work” 1/8

Danijar Hafner@danijarh

🌎 Excited to share a major update of the DreamerV3 agent! A couple of smaller changes, more benchmarks, and substantially improved performance. 👇 Main differences from our earlier preprint:

English

25.7K

Alexey Skrynnik@Tviskaron·9 May

@_kei18 Glad to hear the honey is safe!😊

English

Keisuke Okumura@_kei18·9 May

@Tviskaron No no, 🍯 survived ! Just forgot to include it in the photo. I enjoy lovely gifts l

English

Keisuke Okumura@_kei18·5 May

開けても開けても出てくるマトリョーシカ🪆

日本語

707

Alexey Skrynnik@Tviskaron·11 Nis

@MikhailBurtsev 😄😄😄

QME

Mikhail Burtsev@MikhailBurtsev·11 Nis

When senior colleagues join the team at academic race.

English

422

Alexey Skrynnik retweetledi

Negar Arabzadeh@NegarEmpr·5 Nis

What a way to wrap up @IgluContest! Our paper “IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents” accepted to @SIGIRConf including: 1) rich multi-modal dataset 2) A data collection tool 3) An online eval framework #SIGIR2025

English

Alexey Skrynnik@Tviskaron·13 Mar

Decentralized MAPF? :)

Massimo@Rainmaker1973

Two equally smart Amazon robots

Español

200

Alexey Skrynnik@Tviskaron·23 Oca

I’m happy to share that our paper, POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding, has been accepted to the ICLR 2025 Conference! arxiv.org/abs/2407.14931 openreview.net/forum?id=6VgwE… See you in Singapore!

English

304

Alexey Skrynnik@Tviskaron·10 Ara

I’m happy to announce that our paper, MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale, has been accepted to the AAAI 2025 Conference! arxiv.org/abs/2409.00134 github.com/CognitiveAISys…

English

193

Alexey Skrynnik@Tviskaron·10 Eyl

@_kei18 Thank you, Keisuke :) However, I believe MAPF-GPT can be further improved by RL fine-tuning. P.S. LaCAM is great!

English

Keisuke Okumura@_kei18·4 Eyl

MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale arxiv.org/abs/2409.00134 I like this direction, rather than developing better RL policies for MAPF.

English

2.5K

Alexey Skrynnik@Tviskaron·6 Eyl

Here are the links to the preprint: arxiv.org/abs/2409.00134 and the source code: github.com/Cognitive-AI-S…. The repository includes training code, pre-trained weights for the 2M, 6M, and 85M models (uploaded to HuggingFace), and a dataset of 1 billion observation/action pairs.

English

Alexey Skrynnik@Tviskaron·6 Eyl

MAPF-GPT performs exceptionally well on unseen instances and outperforms state-of-the-art learnable solvers such as SCRIMP and DCC, having better runtime efficiency. (2/3)

English

Alexey Skrynnik@Tviskaron·6 Eyl

I’m excited to announce our recent preprint titled MAPF-GPT, a GPT-like model designed for MAPF problems. It is trained using pure imitation learning on trajectories generated by LaCAM. (1/3)

English

Alexey Skrynnik retweetledi

ICAPS Conference@ICAPSConference·2 Mar

PRL (Workshop) @ ICAPS 2024 *CALL* - If you're bridging the gap between AI Planning and Reinforcement Learning, join the PRL workshop - prl-theworkshop.github.io/prl2024-icaps/. Submissions due March 22

English

625

Alexey Skrynnik@Tviskaron·9 Ara

More details: ar5iv.labs.arxiv.org/html/2310.01207. Big thanks to my co-authors Anton Andreychuk, Maria Nesterova, Konstantin Yakovlev, and Aleksandr Panov. The second paper, which I will detail after the camera-ready version, focuses on using Neural MCTS for the LMAPF tasks. (7/7)

English

135

Alexey Skrynnik@Tviskaron·9 Ara

Against centralized MAPF algorithms, Follower excels, especially under a strict 1-second time constraint in scenarios with a large number of agents (over 160 for the Warehouse map). (6/7)

English

168

Alexey Skrynnik@Tviskaron·9 Ara

I'm excited to announce that two of my papers have been accepted at AAAI-2024, delving into lifelong multi-agent pathfinding (LMAPF) using RL and planning techniques. Here's the thread: (1/7).

English

362

Keşfet

@FlexAppeall @Karolis_Ram @_kei18 @MikhailBurtsev @IgluContest @SIGIRConf @elonmusk @BarackObama