Ismail Elezi

609 posts

Ismail Elezi

Ismail Elezi

@Ismail_Elezi

Principal Research Scientist at @Huawei Noah Ark in London (before that @TUM @Nvidia). Views are my own.

Katılım Aralık 2011
293 Takip Edilen507 Takipçiler
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Massive respect to @AnthropicAI for standing by their principles. The entire fate of humanity might be dependent on not doing stupid things required by people who do not know anything about AI.
English
0
0
2
100
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Deadline time: 49/56 (87.5%) of reviews submitted. 7/14 papers have 4 reviews. 7/14 papers have 3 reviews. All reviews are atleast ok level. One of the best reviews is from one of the biggest names on the field, which means that yes, you have enough time to write a good review.
English
1
0
0
179
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Update: 24 hours to go: 36/56 (64%) of reviews submitted. 2/14 papers have 4 reviews. 5/14 papers have 3 reviews. 6/14 papers have 2 reviews. 1/14 papers have 1 review.
English
1
0
0
237
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Stats for a first time AC in @NeurIPSConf, while in the last 48h of the reviewing deadline. 27/56 (48%) of reviews submitted 1/14 papers has 4 reviews 3/14 papers have 3 reviews 4/14 papers have 2 reviews 6/14 papers have 1 review
English
3
0
2
482
Ismail Elezi retweetledi
Xin Wen
Xin Wen@_xwen_·
Can we decouple semantics from spectrum for image tokenizers? Our answer: the 1D Semanticist tokenizer. We push the burden of photo-realistic image generation to diffusion decoders, and let the tokens focus on the semantic structure. The PCA-like structure is induced by nested CFG (dropout), allowing plausible reconstruction & generation with very few tokens, and coarse-to-fine hierarchy. Thanks to semantic-spectrum decoupling, our tokenizer also achieves strong performance on ImageNet linear probing, indicating potential for unified understanding and generation. To check for more details, see you 1-2 pm TODAY at ExHall D, GMCV Workshop! "Principal Components" Enable A New Language of Images Project Page: visual-gen.github.io/semanticist/ Code: github.com/visual-gen/sem…
Xin Wen tweet media
English
2
5
15
18.3K
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Really cool project with a simple general solution, while reaching SOTA results, lead by @t_gaintseva.
Tatiana Gaintseva@t_gaintseva

A new paper is out! CASteer: Steering Diffusion Models for Controllable Generation Arxiv link: arxiv.org/abs/2503.09630 Code: github.com/Atmyre/CASteer Diffusion models are powerful, but their generation process can be difficult to control, which poses safety risks (e.g., generating images with nudity/violence). There are many ideas on how to address this, but most of the existing approaches are limited in what issues they can handle and often require additional training. CASteer, on the other hand, is capable of handling broad range of tasks, while being completely training-free! CASteer works by constructing a special steering vector for each cross-attention layer in a diffusion model using prompt pairs that capture dspecific concepts. By adding or subtracting these vectors from the outputs of cross-attention layers during inference, we gain fine-grained control over the entire generation process. We can build steering vectors for any kind of concept, and this allows for broad range of manipulations over images being generated. We can add/remove objects (e.g., apples), alter abstract attributes (e.g., nudity), do style transfer, identity manipulation (switching Leonrado DiCaprio to Keanu Reevs), concept interpolation (going from cat to giragge), and more (see picture). Simplicity of CASteer allows for easy incorporation of it into most of the modern DMs. I would like to thank ChengCheng Ma and @Ismail_Elezi , who provided invaluable assistance in this project, as well as my university supervisors: Ziquan Liu, Martin Benning, Gregory Slabaugh and Jiankang Deng. Hope to have further great collaborations!

English
0
1
4
233
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
@PrannayKaul @JiankangDeng And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR).
English
0
0
4
722
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, ...
Ismail Elezi tweet mediaIsmail Elezi tweet media
English
1
0
8
1.1K
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
#CVPR has done a very bad job in paper assignment this time for me. Out of 4 papers I am reviewing, only one has something to do with me, and even that is related to 2-4 years old papers I have. Not sure if I was unlucky, or the matching system has gone downhill.
English
0
0
1
240
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
Max Tegmark talk at #NeurIPS2024 was funny, obvious, entertaining and concerning at the same time. The best AI talk I have heard in a very long time.
Ismail Elezi tweet media
English
0
0
1
248
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
We offer long internships (6+ months), competitive salaries, an office in the center of London, and a very diverse group (very gender-balanced, researchers from 8 countries working on a wide range of topics).
English
0
0
0
96
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
have topic match (VLLMs, LLMs, multimodality learning, or diffusion) and are interested in doing an internship at Huawei Research Center in London, please write to me and let’s have a chat in the conference.
English
1
0
0
126
Ismail Elezi
Ismail Elezi@Ismail_Elezi·
I am very happy to attend NeurIPS in Vancouver when together with @Miles12Roy we will be presenting our VeLora paper on Thu 12 Dec 4:30 p.m. PST — 7:30 p.m. PST.
English
1
0
4
207