Phillip Rust

50 posts

Phillip Rust

Phillip Rust

@rust_phillip

Research Scientist @AIatMeta (FAIR) • PhD @coastalcph

Paris, France Katılım Temmuz 2020
601 Takip Edilen388 Takipçiler
Sabitlenmiş Tweet
Phillip Rust
Phillip Rust@rust_phillip·
Happy to share our paper on language modelling with pixels has been accepted to ICLR‘23 (notable-top-5% / oral) 🎉. Big thanks and congrats to Team-PIXEL @jonasflotz @ebugliarello @esalesk @mdlhx @delliott and looking forward to presenting in Kigali! 🌍 #ICLR2023
Emanuele Bugliarello@ebugliarello

Tired of tokenizers/subwords? Check out PIXEL, a new language model that processes written text as images📸 “Language Modelling with Pixels” 📄 arxiv.org/abs/2207.06991 🧑‍💻github.com/xplip/pixel 🤖huggingface.co/Team-PIXEL/pix… by @rust_phillip @jonasflotz me @esalesk @mdlhx @delliott

English
9
33
229
34.8K
Phillip Rust retweetledi
Jinpeng Wang
Jinpeng Wang@awinyimgprocess·
Humans see text — but LLMs don’t. I wrote a short blog post exploring how models can perceive text visually rather than tokenize it: 🔗 csu-jpg.github.io/Blog/people_se… From PIXEL, CLIPPO, VisInContext, VIST to DeepSeek-OCR, this is a quick story of how vision-centric modeling is changing how machines read, and a reflection on some of our own small efforts in the past two years.
English
8
39
216
38.4K
Phillip Rust
Phillip Rust@rust_phillip·
I will be presenting this work in-person at ACL🇹🇭 this week. Drop by if you'd like to chat! Oral: Today (Monday) 16:30 Poster: Tuesday (Tomorrow) 10:30 - 12:00
Phillip Rust@rust_phillip

Introducing “Towards Privacy-Aware Sign Language Translation at Scale” We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data. 📄: arxiv.org/abs/2402.09611 🧵(1/9)

English
0
1
21
1.3K
Phillip Rust
Phillip Rust@rust_phillip·
For more experiments and all the details, check out our arXiv preprint linked above. We are working on releasing our code and data, so stay tuned! 👨‍💻 🧵(8/9)
English
1
0
2
261
Phillip Rust
Phillip Rust@rust_phillip·
Introducing “Towards Privacy-Aware Sign Language Translation at Scale” We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data. 📄: arxiv.org/abs/2402.09611 🧵(1/9)
Phillip Rust tweet media
English
1
7
19
4K
Phillip Rust retweetledi
Tianyu Gao
Tianyu Gao@gaotianyu1350·
New preprint "Improving Language Understanding from Screenshots" w/ @zwcolin @AdithyaNLP @danqi_chen. We improve language understanding abilities of screenshot LMs, an emerging family of models that processes everything (including text) via visual inputs arxiv.org/abs/2402.14073
GIF
English
6
43
186
21.3K
Phillip Rust retweetledi
Desmond Elliott
Desmond Elliott@delliott·
In PHD: Pixel-Based Language Modeling of Historical Documents with @NadavBorenstein @rust_phillip and @IAugenstein, we apply pixel language models to processing historical document and to more standard NLP classification tasks too. See it in Poster Session 6 on Sunday 10th.
Desmond Elliott tweet mediaDesmond Elliott tweet media
English
1
5
21
1.9K
Phillip Rust retweetledi
Desmond Elliott
Desmond Elliott@delliott·
In Text Rendering Strategies for Pixel Language Models with @jonasflotz @rust_phillip and @esalesk, we design new text renderers for visual language processing to improve performance or to squeeze the model down to just 22M parameters. See it in Poster Session 2 on Friday 8th.
Desmond Elliott tweet mediaDesmond Elliott tweet media
English
1
4
15
1.5K
Phillip Rust retweetledi
AI at Meta
AI at Meta@AIatMeta·
Introducing SeamlessM4T, the first all-in-one, multilingual multimodal translation model. This single model can perform tasks across speech-to-text, speech-to-speech, text-to-text translation & speech recognition for up to 100 languages depending on the task. Details ⬇️
English
53
421
1.7K
592.6K
Phillip Rust retweetledi
Desmond Elliott
Desmond Elliott@delliott·
📢 I am hiring a postdoc to join our project on pixel-based natural language processing. The position is based in Copenhagen 🇩🇰 for 18 months. Applications are due by March 29 employment.ku.dk/faculty/?show=…. Informal inquiries are welcome.
Desmond Elliott@delliott

Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje… I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…

English
0
20
32
11.2K