Phillip Rust

50 posts

Phillip Rust

Phillip Rust

@rust_phillip

Research Scientist @AIatMeta (FAIR) • PhD @coastalcph

Paris, France เข้าร่วม Temmuz 2020
596 กำลังติดตาม391 ผู้ติดตาม
ทวีตที่ปักหมุด
Phillip Rust
Phillip Rust@rust_phillip·
Happy to share our paper on language modelling with pixels has been accepted to ICLR‘23 (notable-top-5% / oral) 🎉. Big thanks and congrats to Team-PIXEL @jonasflotz @ebugliarello @esalesk @mdlhx @delliott and looking forward to presenting in Kigali! 🌍 #ICLR2023
Emanuele Bugliarello@ebugliarello

Tired of tokenizers/subwords? Check out PIXEL, a new language model that processes written text as images📸 “Language Modelling with Pixels” 📄 arxiv.org/abs/2207.06991 🧑‍💻github.com/xplip/pixel 🤖huggingface.co/Team-PIXEL/pix… by @rust_phillip @jonasflotz me @esalesk @mdlhx @delliott

English
9
33
229
34.7K
Phillip Rust รีทวีตแล้ว
Yi Lin Sung
Yi Lin Sung@yilin_sung·
Tough week! I also got impacted less than 3 months after joining. Ironically, I just landed some new RL infra features the day before. Life moves on. My past work spans RL, PEFT, Quantization, and Multimodal LLMs. If your team is working on these areas, I’d love to connect.
Jiaxun Cui 🐿️@cuijiaxun

Meta has gone crazy on the squid game! Many new PhD NGs are deactivated today (I am also impacted🥲 happy to chat)

English
42
58
500
173.5K
Phillip Rust รีทวีตแล้ว
Jinpeng Wang
Jinpeng Wang@awinyimgprocess·
Humans see text — but LLMs don’t. I wrote a short blog post exploring how models can perceive text visually rather than tokenize it: 🔗 csu-jpg.github.io/Blog/people_se… From PIXEL, CLIPPO, VisInContext, VIST to DeepSeek-OCR, this is a quick story of how vision-centric modeling is changing how machines read, and a reflection on some of our own small efforts in the past two years.
English
8
39
216
38.1K
Phillip Rust
Phillip Rust@rust_phillip·
I will be presenting this work in-person at ACL🇹🇭 this week. Drop by if you'd like to chat! Oral: Today (Monday) 16:30 Poster: Tuesday (Tomorrow) 10:30 - 12:00
Phillip Rust@rust_phillip

Introducing “Towards Privacy-Aware Sign Language Translation at Scale” We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data. 📄: arxiv.org/abs/2402.09611 🧵(1/9)

English
0
1
21
1.3K
Phillip Rust
Phillip Rust@rust_phillip·
For more experiments and all the details, check out our arXiv preprint linked above. We are working on releasing our code and data, so stay tuned! 👨‍💻 🧵(8/9)
English
1
0
2
261
Phillip Rust
Phillip Rust@rust_phillip·
Introducing “Towards Privacy-Aware Sign Language Translation at Scale” We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data. 📄: arxiv.org/abs/2402.09611 🧵(1/9)
Phillip Rust tweet media
English
1
7
20
3.9K
Phillip Rust รีทวีตแล้ว
Tianyu Gao
Tianyu Gao@gaotianyu1350·
New preprint "Improving Language Understanding from Screenshots" w/ @zwcolin @AdithyaNLP @danqi_chen. We improve language understanding abilities of screenshot LMs, an emerging family of models that processes everything (including text) via visual inputs arxiv.org/abs/2402.14073
GIF
English
6
43
186
21.3K
Phillip Rust รีทวีตแล้ว
Desmond Elliott
Desmond Elliott@delliott·
In PHD: Pixel-Based Language Modeling of Historical Documents with @NadavBorenstein @rust_phillip and @IAugenstein, we apply pixel language models to processing historical document and to more standard NLP classification tasks too. See it in Poster Session 6 on Sunday 10th.
Desmond Elliott tweet mediaDesmond Elliott tweet media
English
1
5
21
1.9K
Phillip Rust รีทวีตแล้ว
Desmond Elliott
Desmond Elliott@delliott·
In Text Rendering Strategies for Pixel Language Models with @jonasflotz @rust_phillip and @esalesk, we design new text renderers for visual language processing to improve performance or to squeeze the model down to just 22M parameters. See it in Poster Session 2 on Friday 8th.
Desmond Elliott tweet mediaDesmond Elliott tweet media
English
1
4
15
1.5K
Phillip Rust รีทวีตแล้ว
AI at Meta
AI at Meta@AIatMeta·
Introducing SeamlessM4T, the first all-in-one, multilingual multimodal translation model. This single model can perform tasks across speech-to-text, speech-to-speech, text-to-text translation & speech recognition for up to 100 languages depending on the task. Details ⬇️
English
54
424
1.7K
592.5K
Phillip Rust รีทวีตแล้ว
Desmond Elliott
Desmond Elliott@delliott·
📢 I am hiring a postdoc to join our project on pixel-based natural language processing. The position is based in Copenhagen 🇩🇰 for 18 months. Applications are due by March 29 employment.ku.dk/faculty/?show=…. Informal inquiries are welcome.
Desmond Elliott@delliott

Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje… I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…

English
0
20
32
11.2K