jzb

57 posts

jzb

@rucjzb

Katılım Mayıs 2017

373 Takip Edilen36 Takipçiler

jzb retweetledi

Graham Neubig@gneubig·1 May

How can we vibe code while still maintaining code quality? Over the past year, I've shifted 95% of my development from manually writing code to using coding agents. I wrote this blog on some tricks I learned to work successfully with agents: all-hands.dev/blog/vibe-codi…

English

180

67.9K

jzb retweetledi

Graham Neubig@gneubig·19 Ara

How far are we from having competent AI co-workers that can perform tasks as varied as software development, project management, administration, and data science? In our new paper, we introduce TheAgentCompany, a benchmark for AI agents on consequential real-world tasks.

English

144

829

126.6K

jzb retweetledi

Mustafa Suleyman@mustafasuleyman·1 Eki

Today we’re launching our new Copilot experience. I truly believe we can deliver a calmer, more helpful and supportive era of technology, with a Copilot that is now more intuitive, more personalized, and secure. Learn more, download, and enjoy. At Microsoft AI, we are creating an AI companion for everyone. This is the first step.

English

160

906

309.9K

jzb retweetledi

Zhiqing Sun@EdwardSun0909·21 Mar

🌟Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision 🌟 arxiv.org/abs/2403.09472 How can we keep improving AI systems when their capabilities surpass those of human supervisors? (1/n)

English

256

106.7K

jzb retweetledi

Srini Iyer@sriniiyer88·27 Şub

New paper! How to train LLMs to effectively answer questions on new documents? Introducing *pre-instruction-tuning* - instruction-tuning *before* continued pre-training — significantly more effective than traditional instruction-tuning after PT. arxiv.org/abs/2402.12847

English

145

17.3K

jzb@rucjzb·26 Ara

@saizhang0 My question is "Is it normal to have clusters with >100 8*A100 nodes in academia"? 😀

English

5.5K

Sai Zhang@saizhang0·25 Ara

Someone is running jobs on 128 GPU nodes (~1000 A100) during holidays. Is this a normal PhD life?

English

624.7K

jzb retweetledi

Zora Wang@ZhiruoW·15 Kas

Everyone is using RAG, but most of the retrieved context is noisy! 🚨 Introducing FilCo: “Learning to Filter Context for Retrieval-Augmented Generation” TL;DR: Get rid of the irrelevant content using FilCo, and you'll get better outputs. Preprint: arxiv.org/abs/2311.08377

English

284

51.1K

jzb retweetledi

Zhiqing Sun@EdwardSun0909·11 Eki

🚀 Can RLAIF fully replace RLHF to align language models from scratch, enhancing both their alignment and capabilities? SALMON introduces a principle-following reward model in the realm of self-alignment, using just 6 ICL exemplars and 31 principles to outperform LLaMA-2-Chat!

English

296

100K

jzb retweetledi

Chunting Zhou@violet_zct·23 May

How do you turn a language model into a chatbot without any user interactions? We introduce LIMA: a LLaMa-based model fine-tuned on only 1,000 curated prompts and responses, which produces shockingly good responses. * No user data * No mode distillation * No RLHF

English

222

179.8K

jzb retweetledi

Luyu Gao@luyu_gao·23 May

[1/4] Large language models (LLMs) tend to hallucinate, especially when generating long outputs. We present active retrieval augmented generation, in which an LLM actively decides when and what to retrieve throughout the generation process.

GIF

English

484

131K

jzb retweetledi

John Nay@johnjnay·12 May

Active LLM Retrieval Augmented Generation -Iteratively uses a prediction of upcoming sentence to anticipate future content which is used as query to retrieve relevant docs to regenerate sentence -On 4 long-form generation tasks: superior / competitive arxiv.org/abs/2305.06983

English

393

133K

jzb retweetledi

Qian Liu@sivil_taram·18 Nis

🔥If data is the oil, have we exhausted the mine? Introducing our latest work, 🎉. "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning"💪 💡TL;DR: Generate tons of train examples via symbolic tasks to boost data quantity for instruction tuning🚀 1/3

English

14.4K

jzb retweetledi

Jinlan Fu@JinlanFu·21 Şub

Can the text evaluator be customized for different/new evaluation aspects without training? Our GPTScore achieves customized, multifaceted, and training-free using emergent abilities of PLM, i.g., instruction and in-context learning. Paper: arxiv.org/pdf/2302.04166…

GIF

English

11.8K

jzb retweetledi

Graham Neubig@gneubig·10 Ara

Retrieving information for QA is critical for reliability, but training separate retriever/reader models is cumbersome. At #EMNLP2022 we present Retrieval as Attention (ReAtt), a single retrieve/read model that is effective, generalizable, and adaptable arxiv.org/abs/2212.02027 🧵

English

127

jzb retweetledi

Uri Alon@urialon1·21 Kas

📢 New Paper: Program-aided Language models Prompting methods such as chain-of-thought (@_jasonwei) employ LLM for decomposing the problem into steps *and* solving each step. Instead, PaL decomposes the problem into *programmatic* steps and solves using a Python interpreter. 1/4

English

497

jzb retweetledi

AI at Meta@AIatMeta·28 Eyl

(1/4) Introducing EditEval, an instruction-based evaluation suite leveraging high-quality existing & new datasets for automatic evaluation of editing capabilities. At present, comprehensive eval of editing capabilities (i.e., fixing wrong info or reorganizing text) is lacking.

English

524

jzb retweetledi

Chunting Zhou@violet_zct·23 Eyl

I'm excited to share our work on a new sequence modeling architecture called Mega: Moving Average Equipped Gated Attention. Mega achieves SOTA results on multiple benchmarks, including NMT, Long Range Arena, language modeling, ImageNet and raw speech classification.

English

364

jzb retweetledi

Graham Neubig@gneubig·23 Eyl

MEGA is a new method for modeling long sequences based on the surprisingly simple technique of taking the moving average of embeddings. Excellent results, outperforming strong competitors such as S4 on most tasks! Strongly recommend that you check it out: arxiv.org/abs/2209.10655

Chunting Zhou@violet_zct

English

230

jzb retweetledi

Google DeepMind@GoogleDeepMind·16 Eyl

Internship applications are now open! This year we have opportunities across various teams and offices 🌎🌍 Apply today via dpmd.ai/internships and learn more about the experience below ⬇️ #DeepMindInterns

Google DeepMind@GoogleDeepMind

Former intern turned intern mentor, @reverettai, describes his journey to DeepMind, sharing tips and advice for aspiring DeepMinders. Learn more about #internships and #LifeAtDeepMind on our blog: dpmd.ai/RichardQA-TW

English

119

408

jzb retweetledi

Graham Neubig@gneubig·12 Eyl

Happy to announce that I've formed a company, Inspired Cognition (inspiredco.ai) together with @stefan_fee and @odashi_en! Our goal is to make it easier and more efficient to build AI systems (particularly NLP) through our tools and expertise. 1/2

English

438

Keşfet

@saizhang0 @_jasonwei @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA