Fabian Schreiber

2.6K posts

Fabian Schreiber

Fabian Schreiber

@Fabschreiber

Medical Scientist @Charité, AI Sepsis researcher previous BI, IBM, EBI, Sanger

Berlin, Germany Katılım Şubat 2009
409 Takip Edilen329 Takipçiler
elvis
elvis@omarsar0·
That’s important. Like when developing a skill the best approach I have found to optimize the thing is to test it on actual tasks. And improve it progressively. Indexing everything all at once might be a trap like you said. Like with everything I feel like the test and verification are important for it to be of any use.
English
1
0
5
933
elvis
elvis@omarsar0·
A few notes on how to get started with building LLM Knowledge Bases. @karpathy popularized it but most people don't know where to start. Everyone should be creating LLM Wikis. Live session tomorrow. Shared a repo example and a Skill coming soon. academy.dair.ai/blog/how-to-bu…
elvis tweet media
English
19
26
196
20K
David Dao
David Dao@dwddao·
I created a team of coding agents that were productively building and coordinating with each other. However, once I added a "manager" agent, it started pinging the engineers every 10 seconds for product feedback, interrupting them in their flow 😂 I don't know what is more real
David Dao tweet mediaDavid Dao tweet media
English
1
1
11
524
Rosa Fernández
Rosa Fernández@Rosamygale·
Second day of our workshop @IBE_Barcelona learning about genome annotation with BRAKER, TSEBRA, GALBA, Tiberius & more with the one and only Katharina Hoff! 🧬🧩🪱
Rosa Fernández tweet media
English
2
3
33
1.3K
Fabian Schreiber retweetledi
Philipp Schmid
Philipp Schmid@_philschmid·
This came unexpected! @OpenAI released Swarm, a lightweight library for building multi-agent systems. Swarm provides a stateless abstraction to manage interactions and handoffs between multiple agents and does not use the Assistants API. 🤔 How it works: 1️⃣ Define Agents, each with its own instructions, role (e.g., "Sales Agent"), and available functions (will be converted to JSON structures). 2️⃣ Define logic for transferring control to another agent based on conversation flow or specific criteria within agent functions. This handoff is achieved by simply returning the next agent to call within the function. 3️⃣ Context Variables provide initial context and update them throughout the conversation to maintain state and share information between agents. 4️⃣ Client run() initiate and manage the multi-agent conversation. It needs an initial agent, user messages, and context and returns a response containing updated messages, context variables, and the last active agent. Insights: 🔄 Swarm manages a loop of agent interactions, function calls, and potential handoffs. 🧩 Agents encapsulate instructions, available functions (tools), and handoff logic. 🔌 The framework is stateless between calls, offering transparency and fine-grained control. 🛠️ Swarm supports direct Python function calling within agents. 📊 Context variables enable state management across agent interactions. 🔄 Agent handoffs allow for dynamic switching between specialized agents. 📡 Streaming responses are supported for real-time interaction. 🧪 The framework is experimental. Maybe to collect feedback? 🔧 Flexible and works with any OpenAI client, e.g., Hugging Face TGI or vLLM-hosted models.
Philipp Schmid tweet media
English
64
328
2.4K
259.8K
Fabian Schreiber
Fabian Schreiber@Fabschreiber·
@HamelHusain timely AF! I have a customer meeting next week and want to think about evals right from the start
English
0
0
0
170
Hamel Husain
Hamel Husain@HamelHusain·
I've condensed my advice about how to think about LLM evals into this 15 min talk. I follow the same process on every project to improve AI products. We show how we use evals to create a data flywheel to move from demo to production-ready products.
Hamel Husain tweet media
AI Engineer@aiDotEngineer

How to construct domain-specific LLM evaluation systems A talk by @HamelHusain, LLM consultant at Parlance Labs, and Emil Sedgh, CTO of @rechathq youtube.com/watch?v=eLXF0V…

English
5
31
265
35.1K
Fabian Schreiber retweetledi
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
Open Source AI/ML is on fire today! 🔥 Multilingual (29) Qwen 2.5 just dropped w/ 128K context too! The 72B rivals Llama 3.1 405B and beats Mistral Large 2 (123B) ⚡ > Trained on an extensive dataset containing up to 18 trillion tokens > It surpasses its predecessor, Qwen2, with significantly higher scores on MMLU (85+), HumanEval (85+), and MATH (80+) benchmarks > Excels in instruction following, generating lengthy texts (over 8K tokens), and understanding structured data like tables. It also shows significant progress in generating structured outputs, particularly JSON. > Supports over 29 languages, including major global languages, and can handle up to 128K tokens, with a text generation capacity of 8K tokens. They release specialised models as well: 1. Qwen2.5: 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B 2. Qwen2.5-Coder: 1.5B, 7B, and 32B on the way 3. Qwen2.5-Math: 1.5B, 7B, and 72B. Kudos to @Alibaba_Qwen team for shipping high quality model checkpoints! 🐐
Vaibhav (VB) Srivastav tweet media
English
5
48
253
69K
Fabian Schreiber retweetledi
Ben Klieger
Ben Klieger@benklieger·
Inspired by the new o1 model, I hacked together g1, powered by Llama-3.1 on @GroqInc. It uses reasoning chains to solve problems. It solves the Strawberry problem ~70% of the time, with no fine tuning or few shot techniques. A thread 🧵 (with GitHub repo!)
Ben Klieger tweet mediaBen Klieger tweet media
English
55
125
763
260.6K
Kai Kupferschmidt
Kai Kupferschmidt@kakape·
Wir haben gestern Abend noch eine kurze Folge @pandemiapodcast aufgenommen, um über die aktuellen Entwicklungen bei #mpox zu sprechen. Die kommt dann Anfang der nächsten Woche. Aber zur Vorbereitung hab ich mir noch einmal unsere mpox-Folge von Ende Juni angehört und ganz ehrlich da ist eigentlich alles drin. Also hört euch das ruhig (nochmal) an: podcasts.apple.com/de/podcast/pan…
Deutsch
2
12
62
6K
Dan Jackson
Dan Jackson@northumbriana·
Pleased to report that much of Northern England looks absolutely idyllic this evening.
Dan Jackson tweet media
English
30
78
2.5K
87.5K
Fabian Schreiber
Fabian Schreiber@Fabschreiber·
@katharina_hoff Sorry to hear that! I fully support you, especially the concept of family friendly meetings. I mean, even those 1% have families, don't they? Keep fighting, it will be worth it!
English
0
0
0
98
Fabian Schreiber
Fabian Schreiber@Fabschreiber·
@koehrsen_will +1 for VsCode's debugging tools. They make debugging so easy (as compared to jupyter notebooks).
English
0
0
1
23
Fabian Schreiber retweetledi
Slava Bobrov
Slava Bobrov@slava__bobrov·
Brain cells vs viruses vs bacteria. Microorganisms size comparison: #neuroscience
English
17
364
1.2K
78.7K
Erik Sonnhammer
Erik Sonnhammer@eriksonnhammer·
Vårt projekt "Spatially and temporally resolved gene regulation networks to understand liver cancer formation" var ett av de 224 som fick anslag från @Cancerfonden! Tack alla som skänker gåvor till forskningen och gör detta möjligt! #tillsammansmotcancer #cancerfonden
Svenska
1
0
3
151
Daniel Wefers
Daniel Wefers@DanielWefers·
Wer hat noch nicht?
Daniel Wefers tweet media
Deutsch
8
0
11
2.6K
Miguel Robitzky
Miguel Robitzky@miguelrausa·
möchte hier noch jemand zu bluesky rübergerettet werden?
Deutsch
28
1
20
3.7K
𝙧𝙤𝙩𝙚𝙧 𝙥𝙖𝙣𝙙𝙚𝙧
Solltet ihr einen Code für die andere App haben, oder noch einen suchen, schreibt es mal in die Kommentare. Hat letztes Mal gut geklappt und vielleicht kriegen wir so noch ein paar Leute rüber 😊 Löscht bitte euren Kommentar, wenn ihr euren Code habt/losgeworden seid
Deutsch
2.2K
91
1.1K
562.7K
Fabian Schreiber retweetledi
Jeremy Howard
Jeremy Howard@jeremyphoward·
Here, in full directly on Twitter, is "A Hackers' Guide to Language Models". This 90 minute tutorial is designed to be the one place I point coders at when they ask "hey, tell me everything I need to know about LLMs!" It covers both @OpenAI models and open source ones in depth.
English
78
1K
6K
776.7K