Daniel Justus

179 posts

Daniel Justus

@Daniels_Data

Research scientist at @graphcoreai. Working to make #AI more efficient.

London 参加日 Aralık 2017

490 フォロー中157 フォロワー

Daniel Justus がリツイート

Dobrik Georgiev@DobrikG·20 Şub

Query-based KG RAG is finally SOTA. 🚀 The results: 📈 +16-23% gains ⚡ Up to 167x faster processing 🧠 Inductive (works with unseen graphs/relations) 🎯 Zero-shot A joint work with our factuality team within @GCResearchTeam (1/X)🧵

English

9.8K

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·21 Kas

How does the structure of a Knowledge Graph influence model accuracy in #DrugDiscovery? Our comprehensive study with @AstraZeneca on the effects of graph topology on Knowledge Graph Completion models has just been published in Bioinformatics! Learn more in the paper and the blog post below! 👇

English

377

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·19 Kas

🚨 Graphcore is hiring AI Research Interns! 🚨 Join us to work at the intersection of hardware and AI and help shape the future of AI systems. Whether you're excited about efficient inference, large-scale training, or advancing frontier-model capabilities, we’ve got cutting-edge projects for you to dive into. Interested? Apply below 👇

English

972

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·6 Kas

Our picks for October’s Papers of the Month are here. Out of 49 shortlisted papers, we spotlight 4 that stand out for their clever ideas on making #LLMs faster, smarter, and more efficient! 📊 First up, Grouped Lattice Vector Quantisation introduces a novel technique for a fine-grained post-training quantisation of LLMs, retaining good performance even at low bit widths. 🌫️ In Planned Diffusion, @danielmisrael and colleagues combine autoregressive and diffusion models. While the autoregressive model creates a scaffold and plan, the diffusion model fills the gaps, achieving extremely low-latency text generation. 🤔 Is your LLM overthinking it? Rethinking Thinking addresses the problem of lengthy reasoning chains by bounding their thinking space and gradually distilling their thoughts, speeding up reasoning without losing depth. 🕸️ Finally, When Structure Doesn’t Help compares techniques for how LLMs read text attributed graphs. The results are rather surprising: sometimes, too much structure can hurt. Check out our summaries 👇

English

733

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·16 Eki

LLM using too many reasoning tokens? 😕 Generation slow? 🐌 Or simply too many steps before EOS? 🪜🪜🪜 Douglas Orr (@douglasahorr), our beloved research scientist, has got you covered! He will tell you the remedies to all of the above in the shortest time possible. Registration link in the 🧵 below! (Special thanks to @CodeWordsAI and @join_ef)

English

418

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·7 Tem

It's time for June's Papers of the Month! This time, we cover: ➡️Why Gradients Rapidly Increase Near the End of Training ➡️ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries ➡️Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation 🧵

English

388

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·4 Haz

As we hurtle into the summer, it’s time for May’s Papers of the Month! This month, we cover Parallel Scaling Laws for Language Models, Alpha Evolve, Soft Thinking and Spurious Rewards! 🧵

English

216

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·22 May

Our latest work uses theory from the '50s to figure out how to design weight quantisation formats for LLM inference. It's called Optimal Formats for Weight Quantisation and has just hit arXiv. 1/6

English

429

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·8 May

It's time for April's Papers of the Month! This month, we cover: ➡️ Motion Prompting: Controlling Video Generation with Motion Trajectories ➡️ Inference-Time Scaling for Generalist Reward Modeling ➡️ M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models! 🧵

English

359

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·7 Nis

Spring is here and so is Papers of the Month! In this March edition, we cover Transformers without Normalisation, Compute Optimal Scaling of Skills, Overtrained Language Models Are Harder to Fine-Tune, and Multi-Domain Distribution Learning for De Novo Drug Design! 🧵

English

558

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·6 Mar

February might have been the shortest month, but it wasn’t short of papers! In this edition of Papers of the Month, we cover Distillation Scaling Laws, Matryoshka Quantisation, ParetoQ, and Scaling Test-Time Compute with Latent Reasoning! 🧵

English

422

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·8 Oca

Each month our team writes up summaries and analysis of our favourite ML papers. For December we cover: The Byte Latent Transformer, Large Concept Models, Memory Layers & Phi-4 — all grouped under the title "Spend Your FLOPs Wisely". Here's what we made of them 🧵

English

629

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·2 Oca

We've written an interactive deep dive on Llama 3.2 Vision, alongside a full plain-PyTorch implementation (link in 🧵) Here's an attention head from the vision encoder in action - the implicit segmentation is quite impressive!

GIF

English

130

9.6K

Daniel Justus がリツイート

Graphcore@graphcoreai·11 Kas

Join us in creating the next generation of AI compute. We've just announced the creation of 75 new jobs at Graphcore. Check out the opportunities at graphcore.ai/jobs

English

1.8K

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·1 Eki

Our Papers of the Month for September is now live! We cover: - LLM self-correction via RL - Trillion-token FP8 training - SOAP (Shampoo + Adam) - Generative models for crystals All framed in terms of "proper conditioning" (🧵) graphcore-research.github.io/papers-of-the-…

English

4.3K

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·3 Eyl

Our Papers of the Month for August is now live! This time we're digging in to: Spectra, Scaling LLM Test-Time Compute, and Training Language Models on the Knowledge Graph 🧵 graphcore-research.github.io/papers-of-the-…

English

525

Daniel Justus がリツイート

Josef Dean@JosefNDean·20 Ağu

Sure matplotlib is cool, but what if I want to load my loss curves into the 2006 hit Flash game LineRider?

English

802

6.3K

438.2K

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·8 Ağu

We've written a roundup of ICML and the papers we found interesting. For all those keen on sparsity, speculative sampling and schnitzel... graphcore-research.github.io/posts/icml/

English

275

Daniel Justus がリツイート

Graphcore Research@GCResearchTeam·6 Ağu

Out latest edition of Papers of the Month is now available! This month we give our take on: Scaling Exponents, Million Expert MOE, Vocabulary Scaling Laws and RAG vs Long Contexts 🧵 graphcore-research.github.io/papers-of-the-…

English

9.9K

Daniel Justus がリツイート

Luka Ribar@luka_ribar·23 Tem

Excited to present our SparQ Attention paper tomorrow at @icmlconf ! If you're not around to chat to us in person, check out the recent blog graphcore-research.github.io/graphcore-rese… written by Luke explaining our method for speeding up long-sequence transformer inference!

English

141

ディスカバー

@GCResearchTeam @AstraZeneca @danielmisrael @douglasahorr @CodeWordsAI @join_ef @icmlconf @elonmusk