Xuefei Ning

@FiNingm

Katılım Şubat 2015

165 Takip Edilen34 Takipçiler

Xuefei Ning retweetledi

Xihui Liu@XihuiLiu·9 Ara

Excited to share our recent work GenMAC: a multi-agent collaboration framework for compositional text-to-video generation. Project page: karine-h.github.io/GenMAC/ Paper: arxiv.org/pdf/2412.04440 huggingface.co/papers/2412.04… @KaiyiHUANG84276 @yukun6414 @FiNingm @lin_zinan

English

11.5K

Xuefei Ning retweetledi

Zinan Lin@lin_zinan·1 Ara

[𝗜𝗻𝘁𝗲𝗿𝗻 𝗛𝗶𝗿𝗶𝗻𝗴]We are hiring a [𝐒𝐩𝐫𝐢𝐧𝐠 𝟐𝟎𝟐𝟓] [𝐟𝐮𝐥𝐥-𝐭𝐢𝐦𝐞] intern working on 𝗣𝗿𝗶𝘃𝗮𝘁𝗲 𝗘𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻. If you are interested, please apply here jobs.careers.microsoft.com/global/en/job/… and send me an email: zinanlin at microsoft dot com

Zinan Lin@lin_zinan

#ICML2024 Spotlight <Differentially Private Synthetic Data via Foundation Model APIs 2: Text> was accepted at ICML 2024 as Spotlight! Unfortunately, we are not attending ICML in person. But feel free to reach out to us if you are interested! Paper: arxiv.org/abs/2403.01749

English

Xuefei Ning retweetledi

Zinan Lin@lin_zinan·24 Ara

🚀 Image AR models (𝗩𝗔𝗥 & 𝗟𝗹𝗮𝗺𝗮𝗚𝗲𝗻) can be distilled to 𝗢𝗡𝗘 step (up to 𝟮𝟭𝟴𝘅 𝗳𝗮𝘀𝘁𝗲𝗿) for the first time! See 𝑫𝒊𝒔𝒕𝒊𝒍𝒍𝒆𝒅 𝑫𝒆𝒄𝒐𝒅𝒊𝒏𝒈 ↓ 𝗪𝗲𝗯𝘀𝗶𝘁𝗲: imagination-research.github.io/distilled-deco… 𝗣𝗮𝗽𝗲𝗿: arxiv.org/abs/2412.17153 huggingface.co/papers/2412.17… (1/n)

English

Xuefei Ning retweetledi

AK@_akhaliq·14 Haz

DiTFastAttn Attention Compression for Diffusion Transformer Models Diffusion Transformers (DiT) excel at image and video generation but face computational challenges due to self-attention's quadratic complexity. We propose DiTFastAttn, a novel post-training compression

English

142

17.4K

Xuefei Ning retweetledi

Zinan Lin@lin_zinan·1 Ağu

Thank @_akhaliq for featuring Skeleton-of-Thoughts! Check out the recorded demo: sites.google.com/view/sot-llm/h… It is just a start--more work to do towards a usable tool. But we genuinely believe in the potential of this data-driven direction to make LLMs more efficient and powerful!

AK@_akhaliq

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding paper page: huggingface.co/papers/2307.15… This work aims at decreasing the end-to-end generation latency of large language models (LLMs). One of the major causes of the high generation latency is the sequential decoding approach adopted by almost all state-of-the-art LLMs. In this work, motivated by the thinking and writing process of humans, we propose "Skeleton-of-Thought" (SoT), which guides LLMs to first generate the skeleton of the answer, and then conducts parallel API calls or batched decoding to complete the contents of each skeleton point in parallel. Not only does SoT provide considerable speed-up (up to 2.39x across 11 different LLMs), but it can also potentially improve the answer quality on several question categories in terms of diversity and relevance. SoT is an initial attempt at data-centric optimization for efficiency, and reveal the potential of pushing LLMs to think more like a human for answer quality.

English

407

Xuefei Ning retweetledi

Zinan Lin@lin_zinan·1 Ağu

Thank @omarsar0 for featuring Skeleton-of-Thoughts! Check out the recorded demo: sites.google.com/view/sot-llm/h… It is just a start--more work to do towards a usable tool. But we genuinely believe in the potential of this data-driven direction to make LLMs more efficient and powerful!

elvis@omarsar0

Skeleton-of-Thought: LLMs can do parallel decoding Interesting prompting strategy which firsts generate an answer skeleton and then performs parallel API calls to generate the content of each skeleton point. Reports quality improvements in addition to speed-up of up to 2.39x. Big deal given how costly in terms of latency some tasks are. This a great paper to rethink the necessity of sequential decoding of current LLMs. arxiv.org/abs/2307.15337

English

8.1K

Xuefei Ning retweetledi

Zinan Lin@lin_zinan·27 Tem

#ICML Accelerate 𝐒𝐭𝐚𝐛𝐥𝐞 𝐃𝐢𝐟𝐟𝐮𝐬𝐢𝐨𝐧 by 𝟐𝐱 through a new perspective! Visit poster #102 on Thursday, July 27th, at 10:30 am. [𝐏𝐚𝐩𝐞𝐫] OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models arxiv.org/abs/2306.08860 (1/3)

English

407

Xuefei Ning retweetledi