Zhiyuan Liu@NUS

34 posts

Zhiyuan Liu@NUS

@acharkq

Postdoc at NUS | AI for Science | Multimodal & Generative Models (Diffusion & AR) | PhD from NUS

Singapore Katılım Temmuz 2017

239 Takip Edilen115 Takipçiler

Sabitlenmiş Tweet

Zhiyuan Liu@NUS@acharkq·4 Kas

✨How can an LM understand 2D molecular graphs 🧬? 📢 In this EMNLP2023 work, we propose MolCA, a multi-modal LM that can interpret 2D molecular graphs. paper: huggingface.co/papers/2310.12… demo: 8b8760bb1ba284ef54.gradio.live 👇(1/4)

English

21.1K

Zhiyuan Liu@NUS retweetledi

Rob Tang 🦞@XiangruTang·6d

🦞 Excited to announce Claw4S Conference!!! A new kind of AI4Science conference where you submit skills, not papers. Instead of static PDFs, you submit a SKILL.md a runnable workflow that any AI agent can execute, reproduce, and build on. Deadline: Apr 5, 2026 Prize pool: $50,200!!! 👉 claw.stanford.edu With @lecong and @Charles_Y_Wu

English

221

29.7K

Zhiyuan Liu@NUS retweetledi

Tencent HY@TencentHunyuan·5 Mar

One static model does not fit all😭 We just dropped our latest work: Functional Neural Memory. Instead of static models, we generate custom "parameters" for every single input. ✅Prompt your model anytime ✅Instant personalization ✅Better instruction following ✅Flexible & dynamic memory (w/o memory bank✌️) (🧵1/6)

English

141

331

67.2K

Zhiyuan Liu@NUS@acharkq·29 Eyl

@mengyer One crucial dilemma when choosing to stay in academia

English

147

Mengye Ren@mengyer·28 Eyl

If I have a brilliant research idea, I should a) Try it out with the help of AI in a few days OR b) Spend a month to write a grant, possibly get rejected in 6 months or get to be the luckiest top 10% to get funded in a year just to recruit one student to start from ground zero.

English

134

16.6K

Zhiyuan Liu@NUS@acharkq·19 Eyl

@JohnJumperSci A bit late to this post, but the mission of using LLMs for scientific discovery resonates deeply with my own research focus. Your team's work is pioneering. Should any similar roles open up, I would be thrilled to be considered. My portfolio is here: acharkq.github.io

English

John Jumper@JohnJumperSci·8 May

We are expanding our efforts in LLM-based scientific discovery and are hiring for several roles. Come join my team and work with us on the future of natural language scientific AI! job-boards.greenhouse.io/deepmind/jobs/… job-boards.greenhouse.io/deepmind/jobs/… See RS and RE roles above.

English

240

62.3K

Zhiyuan Liu@NUS@acharkq·16 Eyl

@PanLiangming @PKU1898 Big congrats!

English

411

Liangming Pan@PanLiangming·16 Eyl

Life update: I've joined the School of Computer Science at Peking University @PKU1898 as an Assistant Professor! I'm looking for Ph.D./intern/visiting researchers for my new research group. If you are interested in NLP and LLM, check my research at liangmingpan.bio

English

311

24.9K

Zhiyuan Liu@NUS retweetledi

Rob Tang 🦞@XiangruTang·14 Ağu

🧬✨Excited to introduce CellForge: Agentic Design of Virtual Cell Models - the first fully autonomous AI system for single-cell perturbation modeling! 🌟 This is what the future of computational biology looks like - AI scientists designing AI models! 🚀 What makes it special: - Zero human intervention: From raw data → optimized models → executable code - Multi-agent collaboration: 5 specialized AI experts working together - Cross-modal: Works with scRNA-seq, scATAC-seq, CITE-seq 🔬 CellForge doesn't just pick existing models - it designs novel architectures through collaborative AI reasoning, then writes, tests & refines production-ready code automatically. 📊 Validated on 6 datasets (gene knockouts, drugs, cytokines) - consistently outperforms task-specific SOTA methods like scGPT & GEARS. Paper: arxiv.org/pdf/2508.02276 Code: github.com/gersteinlab/Ce…

English

214

6.1K

Zhiyuan Liu@NUS retweetledi

Rob Tang 🦞@XiangruTang·14 Ağu

🗓️ AI4Science BoF Session at #ACL2024 📅 Date: August 14 🕥 Time: 10:30 AM - 12:00 PM 📍 Location: Room Lotus 10, 22nd floor Agenda: 🎤 Keynote by @CarlEdwards_NLP: 30 mins 🎤 Keynote by @YinFang1105: 30 mins 🗣️ Panel Discussion: 30 mins with @CarlEdwards_NLP, Dr. @YinFang1105, & Dr. @acharkq. Don't miss this engaging session! #LLMs #Science #ACL24

English

Zhiyuan Liu@NUS retweetledi

Yaorui SHI@shiyaorui·16 Haz

🚀 Can LMs plan experimental procedures of chemical reactions? #ACL2024 Findings! We present 🧪ReactXT🧪, an LM that can output step-by-step actions to execute chemical reactions, retrosynthesis and molecule captioning. 🏠Paper: huggingface.co/papers/2405.14… 👇(1/6)

English

199

Zhiyuan Liu@NUS@acharkq·21 Mar

We have also achieved improved performances on tasks of molecule-text retrieval and molecule captioning.

English

Zhiyuan Liu@NUS@acharkq·21 Mar

By tuning and evaluating on 3D-MoIT, we demonstrate that 3D-MoLM can be used to predict quantum chemistry of molecules, like HOMO, LUMO, and H-L Gap. Specifically, 3D-MoLM shows comparable performances to its 3D molecule encoder.

English

103

Zhiyuan Liu@NUS@acharkq·21 Mar

Many thanks to @_akhaliq for sharing our ICLR 2024 work! We have a live demo at: 5392def3bf3be3f70d.gradio.live Code is at: github.com/lsh0520/3D-MoLM 📢 We present 3D-MoLM, a multi-modal Language Model that intergrate the power of Llama-2 and UniMol for 3D molecule understanding.

AK@_akhaliq

Towards 3D Molecule-Text Interpretation in Language Models Language Models (LMs) have greatly influenced diverse domains. However, their inherent limitation in comprehending 3D molecular structures has considerably constrained their potential in the biomolecular domain. To

English

1.7K

Zhiyuan Liu@NUS@acharkq·23 Kas

@EduardoSlonski @ylecun @DrJimFan @RichardSSutton I also agree with the new architecture, especially considering the senatorial data, audio, and video data requires more effective processors rather than gluing everything on LMs

English

1.4K

Eduardo Slonski@EduardoSlonski·23 Kas

1) We use a lot of data. You’re forgetting the huge amount of video, audio and sensorial data we receive all the time. Not to mention the encoded “instructions” from DNA. We’re not trained from scratch and our output is much more general than that of LLMs 2) I agree with you about new architectures

English

138

63.3K

Jim Fan@DrJimFan·23 Kas

It’s pretty obvious that synthetic data will provide the next trillion high-quality training tokens. I bet most serious LLM groups know this. The key question is how to SUSTAIN the quality and avoid plateauing too soon. The Bitter Lesson by @RichardSSutton continues to guide AI development: there’re only 2 paradigms that scale indefinitely with compute: Learning & Search. It’s true in 2019 at the time of writing, true today, and I bet will hold true till the day we solve AGI. incompleteideas.net/IncIdeas/Bitte…

English

139

288

2.5K

1.6M

Zhiyuan Liu@NUS@acharkq·19 Kas

@BlancheMinerva @GoogleAI sorry, but it seems the upload date is 2022 instead of 2023…

English

1.8K

Stella Biderman@BlancheMinerva·19 Kas

TIL: @GoogleAI's 1.6T parameter mixture-of-experts encoder-decoder model is available under an Apache 2.0 license! Trained on public data too.

English

485

136.8K

Zhiyuan Liu@NUS@acharkq·16 Kas

@XueFz @m__dehghani @YangYou1991 @AixinSG @GoogleAI Congrats! Fuzhao!

Català

127

Fuzhao Xue (Frio)@XueFz·16 Kas

Super thrilled to announce that I've been awarded the 2023 Google PhD Fellowship! Enormous gratitude to my wonderful mentors/advisors who championed my application: @m__dehghani, @YangYou1991, @AixinSG, and to all my incredible collaborators. A heartfelt thanks to @GoogleAI and @Google for their generous support. Excited for this journey ahead! 🚀 #GooglePhDFellowship

Google AI@GoogleAI

In 2009, Google created the PhD Fellowship Program to recognize and support outstanding graduate students pursuing exceptional research in computer science and related fields. Today, we congratulate the recipients of the 2023 Google PhD Fellowship! goo.gle/3PYfLXl

English

255

65.4K

Zhiyuan Liu@NUS@acharkq·13 Kas

@jw2yang4ai Very interesting! We should reserve the term ‘visual prompt’ specifically to inserting texts/marks into images 🤔🤔

English

Jianwei Yang@jw2yang4ai·13 Kas

Very good probing test! Seems GPT-4V still faces difficulties to understand the layout of collated images. But with a little cue (i.e., a mark), it can do much better. Like we showed in our Set-of-Mark Prompting work, visual prompt is much better than text to convey visual hints.

Xin Eric Wang@xwang_lk

The famous "Chihuahua or Muffin" problem in computer vision is considered solved by GPT-4V on social media. But really? The answer is NO. GPT-4V cannot reason well about the same images in the original "Chihuahua or Muffin" grid when they are in a different layout. I experimented by rearranging the same images from the classic 4x4 grid into a different layout. First, GPT-4V does not directly recognize the content in details and miscounts the number of images. Then, when being asked about the third image on the top row, GPT-4V misrecognizes a Chihuahua as a muffin. So the "Chihuahua or Muffin" has not been solved yet. But how can GPT-4V work so well on the original image? My guess is that since that image is everywhere, GPT-4V was very likely to be trained on it and memorize its labels.

English

12.2K

Zhiyuan Liu@NUS@acharkq·9 Kas

@_akhaliq Great work indeed! I have found a demo at the website: next-chatv.github.io

English

277

Zhiyuan Liu@NUS@acharkq·9 Kas

@XueFz I get it now. The purpose is to have the strongest 7b lm without budget constraints, considering lower inference cost can offset training cost. It then makes more sense to distill a 70B lm. Consider GPT turbo is 20b, I wonder if they do the same distillation from a larger model?

English

Fuzhao Xue (Frio)@XueFz·8 Kas

Why there are few Open Distlled LLMs so far? Any difficulty? Or no benefit observed? I mean using LLaMA-70B to pretrain a 7B model or so.. Not the SFT-style distillation.

English

12.8K

Zhiyuan Liu@NUS@acharkq·8 Kas

@alignment_lab @XueFz @ontocord I agree, forwarding a 70B model can be too expensive to train a 7B model.

English

123

Alignment Lab AI@alignment_lab·8 Kas

it would be very expensive to get that much distillation data from llama 70b and even more to pretrain on it! i have absolutely been doing everything in my power to make it work anyways though with @ontocord if youre interested, maybe you can revitalize the group! open source though!

English

651

Keşfet

@lecong @Charles_Y_Wu @mengyer @JohnJumperSci @PanLiangming @PKU1898 @CarlEdwards_NLP @YinFang1105