Hans-Peter Zorn

4K posts

Hans-Peter Zorn banner
Hans-Peter Zorn

Hans-Peter Zorn

@data_hpz

Head of AI @inovexgmbh #nlproc

Karlsruhe, Germany Katılım Nisan 2015
9.6K Takip Edilen9.6K Takipçiler
Hans-Peter Zorn
Hans-Peter Zorn@data_hpz·
How do you handle context loss when your AI agent works with large specifications? Tulla is my experimental open-source implementation of Semantic SDD. It is research, beware. But I am open for feedback. no warranties. github.com/hpzorn/tulla 6/6
English
0
0
1
74
Hans-Peter Zorn
Hans-Peter Zorn@data_hpz·
SDD: hand the construction crew a bunch of textfiles and hope they remember page 42. Semantic SDD: feed them verified instructions for the specific brick they're holding. 5/6
English
1
0
1
67
Hans-Peter Zorn
Hans-Peter Zorn@data_hpz·
Spec-Driven Development is the current trend: write a detailed spec, let the AI code. But there the fundamental flaw. 1/6
English
1
1
1
215
Hans-Peter Zorn retweetledi
David Pfau
David Pfau@pfau·
In retrospect, I probably spent too much time grumbling over shifting standards for what counts as "AGI" and not enough time focusing on the massive tidal wave of AI coming straight at us.
English
13
9
218
19.1K
Hans-Peter Zorn retweetledi
Lukas Ziegler
Lukas Ziegler@lukas_m_ziegler·
When robots take the night shift shopping spree! 🛍️ Robots navigate through dm-drogerie markt Deutschland stores at night to create a digital replica of the store's layout, known as a "digital twin." Developed Ubica Robotics GmbH, these autonomous robots scan shelves to provide real-time information about item positions, pricing, stock gaps, and store layouts. 🏪 This data serves multiple purposes, such as improving staff routes, enhancing inventory management, and informing the creation of planograms for more efficient store layouts. It combines digital twin with robotics and it's really cool use case. What are your thoughts? ~~ ♻️ Join the weekly robotics newsletter, and never miss any news → ziegler.substack.com
English
37
81
552
64.8K
Hans-Peter Zorn retweetledi
AI Coffee Break with Letitia
AI Coffee Break with Letitia@AICoffeeBreak·
How do LLMs pick the next word? They don’t choose words directly: they only output word probabilities. 📊 Greedy decoding, top-k, top-p, min-p are methods that turn these probabilities into actual text. In this video, we break down each method and show how the same model can sound dull, brilliant, or unhinged – just by changing how it samples.
AI Coffee Break with Letitia tweet media
English
1
8
39
2.4K
Hans-Peter Zorn retweetledi
Joel Grus 🤠
Joel Grus 🤠@joelgrus·
@growing_daniel unlike claude code, junior devs learn from their mistakes instead of having to be corrected for the same things over and over and over again until the sun burns out
Scenic Oaks, TX 🇺🇸 English
8
4
326
10.9K
Hans-Peter Zorn retweetledi
Keyon Vafa
Keyon Vafa@keyonV·
Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵
English
210
989
6.7K
1.4M
Hans-Peter Zorn retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
This is really BAD news of LLM's coding skill. ☹️ The best Frontier LLM models achieve 0% on hard real-life Programming Contest problems, domains where expert humans still excel. LiveCodeBench Pro, a benchmark composed of problems from Codeforces, ICPC, and IOI (“International Olympiad in Informatics”) that are continuously updated to reduce the likelihood of data contamination.
Rohan Paul tweet media
English
100
304
2K
460.8K
Hans-Peter Zorn retweetledi
Omar Khattab
Omar Khattab@lateinteraction·
After ~6 years of building these types of architectures (starting with BERT, eg see Baleen), I think calling these multi-agent systems is a distraction. This is just software. Happens to be AI software. It doesn’t seem so complicated once you internalize it’s just a program.
Omar Khattab tweet media
Anthropic@AnthropicAI

New on the Anthropic Engineering blog: how we built Claude’s research capabilities using multiple agents working in parallel. We share what worked, what didn't, and the engineering challenges along the way. anthropic.com/engineering/bu…

English
40
120
1.3K
228.6K
Hans-Peter Zorn retweetledi
prayingforexits 🏴‍☠️
Bro how was the show Silicon Valley so consistently 10 years ahead of its time 🤣
English
189
2K
27.6K
1.5M
Hans-Peter Zorn retweetledi
Bojan Tunguz
Bojan Tunguz@tunguz·
Bojan Tunguz tweet media
ZXX
69
243
3.3K
221.8K
Hans-Peter Zorn retweetledi
Frank Hutter
Frank Hutter@FrankRHutter·
The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19
Frank Hutter tweet media
English
36
243
1.4K
263K