Michael Chenetz

3.4K posts

Michael Chenetz banner
Michael Chenetz

Michael Chenetz

@mchenetz

Host of #TechNOut #podcast | #futurist | #AI | #cloudnative | #contentcreator | #guitar

Philly Katılım Eylül 2009
3.7K Takip Edilen1.6K Takipçiler
Michael Chenetz
Michael Chenetz@mchenetz·
Built and open-sourced SPLAI: a distributed AI execution fabric that treats AI workloads like Kubernetes treats containers. What makes it different: ✅ Distributed-first execution across CPU/GPU/edge workers ✅ DAG-based planning + policy-aware scheduling ✅ OpenAI-compatible API layer for drop-in integration ✅ Centralized model prefetch to workers (including Hugging Face) ✅ Optional auto-install of missing models on workers at request time ✅ End-to-end observability, retries, leases, and audit trails ✅ Runs local, on VMs, or on Kubernetes with Helm This is for teams that want production AI execution without being locked into centralized GPU stacks. If you’re building serious AI pipelines and want infra-aware orchestration, I’d love feedback from operators, platform engineers, and AI teams. github.com/mchenetz/SPLAI #AIInfrastructure #MLOps #PlatformEngineering #Kubernetes #OpenSource #LLMOps #DistributedSystems #DevOps #SPLAI
Michael Chenetz tweet media
English
0
0
0
90
Michael Chenetz
Michael Chenetz@mchenetz·
One of the coolest business cards i ever got was that time i met @stevewoz ! I love the metal business card!
Michael Chenetz tweet media
English
0
0
3
253
Darren Shepherd
Darren Shepherd@ibuildthecloud·
@mchenetz @llama_index @langchain It's a terrible terrible rabbit hole. Lang Chain is the starter project to get your feet wet and so that you can abandon it and start doing real work. What you really need at the end of the day is the OpenAI Chat Completion API and maybe the assistant API.
English
1
0
1
33
Darren Shepherd
Darren Shepherd@ibuildthecloud·
@mchenetz @llama_index What gptscript is really good at is really interacting with systems, not data. But data is often a system. And that's really the best way to do it. So use vector DBs to create a search API, once you have that the next step (do RAG) is trivial.
English
2
0
2
102
Darren Shepherd
Darren Shepherd@ibuildthecloud·
@mchenetz @llama_index There's nothing really. We were doing that but then abandoned it. It's a good question why and I don't have a great answer. What are you trying to accomplish?
English
1
0
0
38
Michael Chenetz
Michael Chenetz@mchenetz·
Great news for one of the tools i use the most for #AI development with local models!
ollama@ollama

Ollama 0.2 is here! Concurrency is now enabled by default. ollama.com/download This unlocks 2 major features: Parallel requests Ollama can now serve multiple requests at the same time, using only a little bit of additional memory for each request. This enables use cases such as: - Handling multiple chat sessions at the same time - Hosting code completion LLMs for your team - Processing different parts of a document simultaneously - Running multiple agents at the same time Run multiple models Ollama now supports loading different models at the same time. This improves several use cases: - Retrieval Augmented Generation (RAG): both the embedding and text completion models can be loaded into memory simultaneously. - Agents: multiple versions of an agent can now run simultaneously - Running large and small models side-by-side Models are automatically loaded and unloaded based on requests and how much GPU memory is available.

English
0
0
1
225
Michael Chenetz
Michael Chenetz@mchenetz·
Ever wonder what the "Temperature" setting does in Generative AI? It can significantly impact your output. Check out the chart below to understand what the values mean. Essentially, it determines how creative the output should be. #AI #GenerativeAI #Technology #Innovation
Michael Chenetz tweet media
English
0
0
0
127
Michael Chenetz
Michael Chenetz@mchenetz·
I had so much fun chatting with Carlos Santana on #CloudUnfiltered! Carlos has such an amazing view on where the industry is going and always keeps his eye on the future. Check out the latest episode to here this really engaging conversation! @OutshiftByCisco #EKS @AWS #devops #platformengineering #AI
Carlos Santana@csantanapr

🌟 I’m thrilled to share my recent podcast interview on Cloud Unfiltered with @mchenetz In this episode, we explored: - The need for standardized platforms to streamline cloud-native development. - How Platform Engineering bridges the gap between feature engineers and infrastructure teams. - The innovative CNOE project, aiming to simplify the creation and management of internal developer platforms. It’s a deep dive into the practical aspects and strategic importance of Platform Engineering, especially for organizations leveraging AWS and EKS. 📺 Watch to the full interview on youtube: youtube.com/watch?v=N_TcGE… 🎙️ Listen to the interview on your podcast player: cloudunfiltered.substack.com/p/the-world-of…

English
1
1
3
390
Robert Sirchia
Robert Sirchia@robertsirc·
Yes emergency dental work needed today.
Robert Sirchia tweet media
English
8
0
15
976
Michael Chenetz
Michael Chenetz@mchenetz·
Created a new #opensource #AI tool for converting your videos into audio, transcribing them and then auto generating whatever you want from #OpenAI. #podcast @OpenAI This was created out of frustration when i would have to do all these things manually. Now i have a batch script that runs that does the following with this tool: 1. Converts video to audio 2. Transcribes 3. Creates Youtube Description File Monitor: TBD Web Interface: TBD Docker Image: TBD Self Hosted Transcribe: TBD More to come! Released in the next week or so.
Michael Chenetz tweet media
English
0
0
0
184
Anaïs Urlichs
Anaïs Urlichs@urlichsanais·
Let me brag: I did all this myself: painting wall & ceiling, cutting wood panels and applying wallpaper -- every guest using the WC is going to be amazed 💁🏻‍♀️
Anaïs Urlichs tweet mediaAnaïs Urlichs tweet media
English
7
1
59
3.5K