

TechMD
2.4K posts

@TechMDAI
🔬 MD fueled by a deep passion for Medicine & Tech. 🌐 Exploring the frontiers of VR/AR/MR & BCI. 🤖 AI enthusiast.



Some of you noticed limits drained faster in Codex, we root caused it to an optimization that we rolled back that had an impact on cache hit rates when compacting across long running sessions. We fixed this and have now reset usage limits for all accounts. Enjoy the weekend.

CODEX LIMITS ARE FIXED!


got vllm studio running from my mac mini against the dgx spark today added hermes as a selectable agent runtime, wired it through an openai compatible bridge, fixed lan access, and imported my local model zoo into launchable recipes 21 models detected 12 vllm recipes 9 llama.cpp gguf recipes this is the kind of workflow i want locally mac mini as the control surface dgx spark as the inference box hermes as the operator layer vllm studio as the model dashboard no cs degree, just building the stack piece by piece with ai as the teacher




We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀