Dr. Daniel Bender

8.9K posts

Dr. Daniel Bender banner
Dr. Daniel Bender

Dr. Daniel Bender

@drdanielbender

Yes, I'm obsessed with 🦞 @OpenClaw. No, I won't blindly trust it ◆ I teach you the way of responsible AI: your data, your rules ◆ PhD in computer science ◆ Dad

Germany 🇩🇪 Katılım Ocak 2022
1.3K Takip Edilen5K Takipçiler
Sabitlenmiş Tweet
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
Ever wondered how model quantization (FP16, Q8, Q4) *really* affects performance? There's an analogy that makes the trade-offs crystal clear... and it involves something you might drink. 😉🍺 Kudos to @jtdavies for this brilliant comparison. 🙏 See the image for the full explanation! 👇
Dr. Daniel Bender tweet media
English
1
2
17
3K
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
The AI event in Cologne mentioned at the end of the space: x.com/WolframRvnwlf/…
Wolfram Ravenwolf@WolframRvnwlf

I'm speaking at AIDev 6 in Cologne on 2 June about WolfBench.ai and why one score is not enough for evaluating AI agents. Agent performance depends on more than the model: harnesses, tools, task design, reliability, and real-world failure modes matter. A leaderboard number alone won't tell you whether an agent will actually survive contact with production. Excited to discuss practical agent evals – and to hear @jphme on secure online agent deployment. Registration is free but limited. Link in comments.

English
0
0
2
58
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
The Google I/O demo at creating a stylized video from a video input and guiding images looked outstanding. It was stated that the model is available from today world-wide in the Gemini App. So far, it is not available for me in Germany. 🥲
Google DeepMind@GoogleDeepMind

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

English
0
0
0
181
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@karpathy @IrenaCronin Super nice, looking forward to what you will push forward. That is a big win for Anthropic. Congrats to them for hiring one the most recognized AI masterminds!
English
0
0
0
67
Andrej Karpathy
Andrej Karpathy@karpathy·
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English
7.1K
9.8K
128.4K
18.4M
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@NousResearch Is the video created with the Hermus video skill? Looks amazing! How much work was it to create the video?
English
0
0
0
65
Nous Research
Nous Research@NousResearch·
Hermes Agent v0.14.0 - “The Foundation Release” Changelog below
English
219
439
4.4K
575.9K
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
The hard question is no longer how to build it. It’s whether we should build it. For teams, agreeing on what to build is becoming the bottleneck in the age of AI agents. For solo developers, that bottleneck is smaller. You can just decide. That might be one of the underrated advantages of building alone right now.
GIF
English
0
0
3
152
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@Scobleizer As much as I love local AI, for coding I still prefer to work with the best available model as a single mistake can cost you hours in time and millions in tokens.
English
1
0
2
123
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
If you enjoy working and playing with AI models - and you're near Cologne, Germany - AIDEV 6 on June 2nd is the place to be!👇 I still have great memories of my first AIDEV 3. It was where I met @WolframRvnwlf and @jtdavies in person for the first time, and we've stayed in touch ever since. I will be there and can't wait for another great event!
Wolfram Ravenwolf@WolframRvnwlf

I'm speaking at AIDev 6 in Cologne on 2 June about WolfBench.ai and why one score is not enough for evaluating AI agents. Agent performance depends on more than the model: harnesses, tools, task design, reliability, and real-world failure modes matter. A leaderboard number alone won't tell you whether an agent will actually survive contact with production. Excited to discuss practical agent evals – and to hear @jphme on secure online agent deployment. Registration is free but limited. Link in comments.

English
0
1
4
336
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@WolframRvnwlf Thats cool! I will be there and look forward to getting the latest insights from your evaluations.
English
0
0
1
28
Wolfram Ravenwolf
Wolfram Ravenwolf@WolframRvnwlf·
I'm speaking at AIDev 6 in Cologne on 2 June about WolfBench.ai and why one score is not enough for evaluating AI agents. Agent performance depends on more than the model: harnesses, tools, task design, reliability, and real-world failure modes matter. A leaderboard number alone won't tell you whether an agent will actually survive contact with production. Excited to discuss practical agent evals – and to hear @jphme on secure online agent deployment. Registration is free but limited. Link in comments.
Wolfram Ravenwolf tweet media
English
2
0
6
708
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
Prepare to get organized. Personal AI gets practical when it takes the chaos you already have in your inboxes, notes, documents, and photos and turns it into the next actions that actually matter. The best version of this is agent-first and tool-agnostic.
GIF
English
0
0
2
155
Dr. Daniel Bender retweetledi
Wolfram Ravenwolf
Wolfram Ravenwolf@WolframRvnwlf·
There's FOMO: Fear Of Missing Out. There's FOMAT: Fear Of Missing Agent Time. And then there's FOMUT - the next level of agent neurosis: Fear Of Missing Unused Tokens. > That moment when the limit resets while unused tokens remain - and a precious resource simply evaporates. 💨
Wolfram Ravenwolf tweet media
English
1
1
8
504
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@JeremyNguyenPhD I have two Hermes agents and one of them is running only local models. Currently I am using Qwen 3.6 27B, but the models are evolving so fast that this will likely change quite soon again.
English
0
0
1
30
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
OpenAI just announced they are shutting down 20+ models between July and October. GPT-4, GPT-4o, o1, o3-mini, and many others will stop working. If you built anything on these, you have three months to rewrite and re-test everything. This is what building on quicksand looks like.
GIF
English
2
1
2
292
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@Tance_Essence @omarsar0 You can already do some quite helpful stuff with a Mac with +16 GB of system memory. But yes, with a high end computer you would be able to run better models locally.
English
0
0
0
24
elvis
elvis@omarsar0·
You don't have to choose between either. It's best to use a combination of them. My advice is to learn how to use a few of these models in different harnesses. Learn to combine their strengths. Open-weight models are just as good these days. Give yourself the flexibility.
elvis tweet media
English
16
6
40
7.2K
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
If you're using Hermes Agent, take a closer look at the built-in tools and skills. There are 100+ built in, and you definitely won't need all of them. Deactivate what you don't use with `hermes skills config` and `hermes tools list/deactivate`.
Dr. Daniel Bender tweet media
English
0
0
1
176
Dr. Daniel Bender
Dr. Daniel Bender@drdanielbender·
@icreatelife Fully agree, Kris! I just try to share the message that there are privacy-friendly options if needed. But yeah, it is crazy what is possible today, be it with cloud or local models.
English
0
0
0
14
Kris Kashtanova
Kris Kashtanova@icreatelife·
@drdanielbender I do have a computer :) and run models locally but most people I have don't. It's good to have a lot of options.
English
1
0
1
13