

HKUNLP
12 posts

@hkunlp2020
We are a group of researchers working on natural language processing in the Department of Computer Science at the University of Hong Kong.









DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Apple introduces DiffuCoder, a 7B diffusion LLM trained on 130B tokens of code authors also propose a diffusion-native RL training framework, coupled-GRPO Decoding of dLLMs differ from autoregression: 1) dLLMs can decide how causal their generation should be without relying on semi-AR decoding 2) increasing the sampling temperature diversifies not only token choices but also their generation order










We are kicking off a series of seminars at @hkunlp2020. @siyan_zhao will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: hku.zoom.us/j/97925412724?…