Machine Quest

370 posts

Machine Quest

@machine_quest

I doubt therefore I think therefore I exist

Entrou em Mayıs 2022

852 Seguindo34 Seguidores

Tweet fixado

Machine Quest@machine_quest·18 Şub

Happy to have been part of this paper. This is my first EACL paper, but wont be the last. I thank everyone who enabled me to reach this height, especially my supervisors Dr. @ransurangika and Dr. @NisansaDdS

Nisansa de Silva@NisansaDdS

In our upcoming @eaclmeeting paper on the utilization potential of the quality of web-mined corpora, we discuss how you may build better translation models by automatically sorting the training samples and using the top samples. Paper: arxiv.org/abs/2402.07446 1/n

English

281

Machine Quest@machine_quest·31 Ara

@BetaTomorrow @rasbt How do you measure data complexity? Specially for textual data?

English

deep Manifold@BetaTomorrow·29 Ara

These are important points and worth further exploration. Learning complexity arises from data complexity, which is largely rooted in high-order nonlinearity. This is precisely where traditional supercomputing historically hit its limits, and AI systems are now encountering the same fundamental barrier.

English

680

Sebastian Raschka@rasbt·29 Ara

One of the underrated papers this year: "Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful" (arxiv.org/abs/2507.07101) (I can confirm this holds for RLVR, too! I have some experiments to share soon.)

English

174

1.4K

90.4K

Machine Quest@machine_quest·19 Ara

@mervenoyann This awesome! But subscription is pretty expensive though..

English

merve@mervenoyann·18 Ara

learning.oreilly.com/library/view/v…

ZXX

4.4K

merve@mervenoyann·18 Ara

Chapter 5 of the Vision Language Models book is out and it's all about pre-training, illustrated and hands-on 🧑🏻‍🎨👩🏻‍🎨👨🏻‍🎨 here's a sneak peek 👀

English

452

25.3K

Machine Quest@machine_quest·16 Ara

@PontiEdoardo @FSoudan Thank you for sharing, will have a read!

English

Edoardo Ponti@PontiEdoardo·16 Ara

@machine_quest @FSoudan This work takes inspiration from BLT, H-Net, as well as from earlier work, including Dynamic Token Pooling (DTP) aclanthology.org/2023.acl-long.… As far as I know, in DTP we were the first to introduce the idea of end-to-end tokenisation for autoregressive Transformers.

English

Edoardo Ponti@PontiEdoardo·15 Ara

Finally, you can count the r's in strawberry and check if 3.11 is higher than 3.9 without tokenisation interfering: Here's Bolmo, a fully open byte-level LLM with latent tokenisation, derived from a SOTA LLM (Olmo 3). Promising on coding and char-level understanding!

Ai2@allen_ai

Introducing Bolmo, a new family of byte-level language models built by "byteifying" our open Olmo 3—and to our knowledge, the first fully open byte-level LM to match or surpass SOTA subword models across a wide range of tasks. 🧵

English

4.2K

Machine Quest@machine_quest·16 Ara

@PontiEdoardo @FSoudan By latent tokenization you mean the concept introduced by byte latent transformer? aclanthology.org/2025.acl-long.… Was this concept existing before this paper?

English

Edoardo Ponti@PontiEdoardo·15 Ara

@FSoudan That's the magic of latent tokenisation! The sequence length is shortened in internal layers. So, depending on the compression ratio, the model may be *even faster* than its subword-level counterpart.

English

Machine Quest@machine_quest·12 Ara

What a wonderful resource on GPU programming

Marc Lelarge 🌻@marc_lelarge

@machine_quest Enjoy! x.com/marc_lelarge/s…

English

Machine Quest@machine_quest·12 Ara

@marc_lelarge Thank you very much! Weekend is going to be fun with gpus!

English

Marc Lelarge 🌻@marc_lelarge·12 Ara

@machine_quest Enjoy! x.com/marc_lelarge/s…

Marc Lelarge 🌻@marc_lelarge

Learn 𝗚𝗣𝗨 𝗽𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 from the ground up: begin with Numba for low-level control, then progress to Triton to write high-performance kernels in a Python-like language. A hands-on Jupyter notebook to get you started quickly.

English

224

Marc Lelarge 🌻@marc_lelarge·12 Ara

Ready for tomorrow’s lecture: 𝗣𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝗼𝗻 𝗚𝗣𝗨𝘀

English

709

27.9K

Machine Quest@machine_quest·12 Ara

@marc_lelarge Thank you!!! Amazing!

English

834

Marc Lelarge 🌻@marc_lelarge·12 Ara

English

125

1.2K

54.4K

Machine Quest@machine_quest·12 Ara

@marc_lelarge Really looking forward to this and diving into this during my weekend. I appreciate your opensource policy!

English

159

Marc Lelarge 🌻@marc_lelarge·12 Ara

@machine_quest I will put the notebook online tomorrow.

English

554

Machine Quest@machine_quest·20 Kas

@dhgottesman @megamor2 @TelAvivUni Yayy!! All the best!

English

Daniela Gottesman@dhgottesman·20 Kas

@machine_quest @megamor2 @TelAvivUni Thank you! It’s been resolved :)

English

Mor Geva@megamor2·18 Kas

✨ New course materials: Interpretability of LLMs✨ This semester I'm teaching an active-learning grad course at @TelAvivUni on LLM interpretability, co-developed with my student @dhgottesman. We're releasing the materials as we go, so they can serve as a resource for anyone curious about how LLMs work from the inside: github.com/mega002/llm-in…

English

109

796

49.8K

Machine Quest@machine_quest·20 Kas

@megamor2 @TelAvivUni @dhgottesman Thank you! Raised a small issue in the git, please have a look :-)

English

Mor Geva@megamor2·20 Kas

@machine_quest @TelAvivUni @dhgottesman yes!

224

Machine Quest@machine_quest·5 Kas

@jxmnop Congrats Dr Jack Morris, well deserved!

English

dr. jack morris@jxmnop·5 Kas

i passed!

dr. jack morris@jxmnop

defending today 🥲

English

226

4.6K

276.4K

Machine Quest@machine_quest·5 Kas

@elliotarledge This is awesome, such an awesome material to dive into pytorch as well

English

113

Elliot Arledge@elliotarledge·4 Kas

github.com/Infatoshi/all-…

ZXX

5.5K

Elliot Arledge@elliotarledge·4 Kas

which is why i made this for FREE

Sarabjeet singh@Sarabjeet___

COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY COURSES ARE A WASTE OF MONEY

English

139

2.3K

124.6K

Machine Quest@machine_quest·4 Kas

@jxmnop All the very best!!! Would love to read your thesis when its public

English

dr. jack morris@jxmnop·4 Kas

defending today 🥲

English

244

105

3.3K

446.6K

Machine Quest@machine_quest·28 Eki

@jxmnop @goyal__pramod Wow! This is brilliant. Question: the written column in your spreadsheet, is it conference date or the arxiv date ?

English

dr. jack morris@jxmnop·5 Ara

from 2017–2020 i was learning ML. i didn't publish any research (and hadn't yet); i just trained a lot of tiny models and read a paper every single day i maintained a giant spreadsheet with notes about each paper along with random thoughts. was a great way to learn

English

125

1.8K

123.8K

Machine Quest retweetou

Pedro Domingos@pmddomingos·26 Eki

Hinton is no longer afraid of superintelligence.

English

570

622

4.1K

814.9K

Machine Quest@machine_quest·24 Eki

@zouharvi Congrats, this is truly an exceptional acheivement, would love to hear your advice on applying for such prestigious awards.

English

223

Vilém Zouhar @ EACL@zouharvi·24 Eki

Grateful to receive the Google PhD Fellowship!🙂 I am not secretive about having applied to 4 similar fellowships during my PhD before and didn't succeed. Still, refining my research statement (part of the application) helped me tremendously in finding out the real interesting..

Google.org@Googleorg

🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: goo.gle/43wJWw8

English

268

31.5K

Machine Quest@machine_quest·14 Eki

@saurabhtwq Which cloud platform are you using?

English

371