Sabitlenmiş Tweet

This week's blog post explores methods for incorporating longer-term context in transformers!
Featuring 6 unique approaches:
- Sparse Transformers
- Adaptive Span Transformers
- Transformer-XL
- Compressive Transformers
- Reformer
- Routing Transformer
pragmatic.ml/a-survey-of-me…
English




















