David Mezzetti

2.2K posts

David Mezzetti banner
David Mezzetti

David Mezzetti

@DavidMezzetti

Founder @neumll | Creator of TxtAI

Washington DC Metro 🇺🇸 Katılım Şubat 2013
35 Takip Edilen558 Takipçiler
David Mezzetti
David Mezzetti@DavidMezzetti·
One common take I've seen on the LiteLLM breach is this: pin your dependencies to a specific version. Yes, this would fix this specific issue but the vast majority of security risks are found and patched over time. You're likely more vulnerable if you don't upgrade vs if you do.
English
0
0
1
45
David Mezzetti retweetledi
NeuML
NeuML@neumll·
This 970K parameter model retains 98% of the performance of the original 110M medical embeddings model. huggingface.co/NeuML/biomedbe…
English
0
1
3
63
David Mezzetti retweetledi
NeuML
NeuML@neumll·
TxtAI has embeddings databases, pipelines, agents and workflows. One little known but powerful feature of TxtAI is that it can export any of it's functionality as an OpenAI endpoint. Check out this example for now. github.com/neuml/txtai/bl…
NeuML tweet media
English
0
1
2
83
David Mezzetti
David Mezzetti@DavidMezzetti·
It's always a good idea to rotate your keys on a recurring basis and/or have an expiration date to force this.
English
0
0
1
17
David Mezzetti
David Mezzetti@DavidMezzetti·
An automated publishing workflow is convenient but is it necessary? Thinking about the LiteLLM compromise yesterday, a manual step to push to PyPI certainly could have helped. Why? Because the API token wouldn't have been stored on a server.
English
0
0
1
51
David Mezzetti
David Mezzetti@DavidMezzetti·
PSA: If you use LiteLLM and you installed a version this morning. Luckily the package is now quarantined but a good idea to rotate your tokens if you did use it. github.com/BerriAI/litell…
English
2
0
2
106
David Mezzetti
David Mezzetti@DavidMezzetti·
A simple JEPA world model in 15M parameters! This is the exact kind of thinking we need not just brute forcing more GPUs at the problem. It's clear that efficiency is not good for everyone though. arxiv.org/abs/2603.19312…
English
0
0
1
49
David Mezzetti
David Mezzetti@DavidMezzetti·
💬 Talk is much cheaper than action. AI is amazing in many ways but it has trade offs like anything else.
English
0
0
1
21
David Mezzetti retweetledi
NeuML
NeuML@neumll·
Our PubMedBERT embeddings model is the most downloaded open model for medical vector embeddings. Over 1M downloads this month! huggingface.co/NeuML/pubmedbe…
NeuML tweet media
English
0
1
2
93
David Mezzetti retweetledi
NeuML
NeuML@neumll·
One of the most accessed notebooks on TxtAI's GitHub page covers the Semantic Graph. TxtAI can automatically build a graph of related nodes using it's vector similarity model. Learn more here. github.com/neuml/txtai/bl…
NeuML tweet media
English
0
1
3
96
David Mezzetti
David Mezzetti@DavidMezzetti·
Knowing what to build is much more important than the how. The best question an engineer asks is "Did you think of doing this instead of that?". So regardless of the technical stack, that part of the process is still an important human task.
English
0
0
0
24
David Mezzetti
David Mezzetti@DavidMezzetti·
Local AI might not always be the easiest way to do it but it's the only way you can guarantee control over your stack. Sending your every thought and request to AI Vendor APIs isn't the best idea for you long term.
David Mezzetti tweet media
English
0
0
2
21
David Mezzetti retweetledi
NeuML
NeuML@neumll·
Deep dive into the new TxtAI Agents Toolkit. It's worth the watch if you care about Local AI. youtube.com/watch?v=RDNaFX…
YouTube video
YouTube
English
0
1
2
99
David Mezzetti
David Mezzetti@DavidMezzetti·
Vector search works well for non-exhaustive relevance-driven search. What if you need to run an exhaustive search? Straight keyword search with term expansion could be the best solution. There is no one-size-fits-all retrieval strategy.
English
0
0
1
47