EchoFox

1.1K posts

EchoFox

@FoxEcho8

On a comeback

Katılım Temmuz 2022

1.4K Takip Edilen77 Takipçiler

EchoFox retweetledi

hopecore@dailyhopecores·5d

ZXX

5.1K

39.8K

476.1K

EchoFox retweetledi

Uncle Ruckus@Emarged·5d

“Until death, all defeat is psychological."

scar@imfat

Can a 29-year-old start all over again?

English

606

67.6K

395.7K

16.5M

EchoFox retweetledi

kipp@sulfuroxideseer·6d

Anarch97@anarch97

Edits are probably the most important art form right now

ZXX

123

10.3K

113.8K

2.3M

EchoFox retweetledi

JT@jiratickets·6d

karpathy pulling up to the office for his first day on the research team

MTS@MTSlive

SITUATION DETECTED: Andrej @Karpathy has joined Anthropic.

English

723

14.7K

879.4K

EchoFox@FoxEcho8·19 May

Damn

Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

EchoFox retweetledi

Kitten 🐈@kitten_beloved·12 May

This is like one of those audio journals you find on a corpse in a sci-fi horror game

kache@yacineMTB

At my company we stopped doing code reviews. There's no point now

English

111

39.8K

804.4K

EchoFox retweetledi

des@dotnetschizo·9 May

sorry bro i’ll make the bios interface using react next time

Bealyread@bealyread

The worst bios interface award goes to

English

207

2.1K

45.6K

929.4K

EchoFox retweetledi

LaurieWired@lauriewired·7 May

@kayleecodez hate to say it, but everyone that rejects kubernetes inevitably ends up recreating it from first principles lol

English

1.2K

240K

EchoFox retweetledi

ani@anirudhbv_ce·8 May

We finally know why LLMs hallucinate. It's not the model. It's the geometry. @OpenAI text-embedding-3-large: 91/3072 dimensions do real work. @GeminiApp gemini-embedding-001: 80/3072 dimensions do real work. ~97% of your vector database is mathematically empty. Your RAG system is retrieving from noise. @ashwingop and I present "The Geometry of Consolidation" - a proof that RAG compression has a hard floor no algorithm can beat, set by a single spectral number your embedding model cannot escape. Every hallucination your RAG pipeline produces? This is why. Paper + results: github.com/niashwin/geome…

English

148

458

3.7K

272.1K

EchoFox retweetledi

gaurav@gaxrav·4 May

adulting is basically arriving at the same truths as your father, but from first principles.

English

106

3.2K

30.8K

1.3M

EchoFox retweetledi

Nand@n@nandantwts·4 May

Kill Them With Your Success And Bury Them With Your Smile Vijay took it seriously 🫡

English

845

6.5K

117.3K

EchoFox retweetledi

SemiAnalysis@SemiAnalysis_·16 Nis

to be clear, NVIDIA is NOT a car

English

195

2.7K

280.7K

EchoFox@FoxEcho8·7 Nis

@sriniksv Trying to msg you on twitter regarding a suggestion and can't seem to do it. Could you msg me?

English

Srinivas@sriniksv·6 Nis

Mohan Reddyᴿᴱᴮᴱᴸᵂᴼᴼᴰ@MohanReddy92380

Today 5000 steps only

ZXX

1.5K

EchoFox@FoxEcho8·6 Nis

@livingdevops What do u think is a good course or resource for MLOps? I have been looking for them online.

English

Akhilesh Mishra@livingdevops·5 Nis

Everyone tells you companies use Kubernetes for MLOps. Nobody shows you how. MLOps on Kubernetes follows the same pattern as everything else. You solve one problem, and then the next. You have a model that works on your laptop. > You need to train it on real data at scale. >Your laptop has 16GB RAM, and the dataset is 200GB. >Training locally is not an option. So you run training as a Kubernetes Job. > A Job spins up a pod, runs training to completion, and terminates. > You get GPU nodes for training and release them when done. > You are not paying for idle GPU capacity. But training one model takes hours. > You need to run 50 experiments with different hyperparameters. > Running them one by one means waiting days for results. So you run parallel Jobs. > 50 pods are training simultaneously. > Each has different parameters. > Results come back in hours, not days. But now you have 50 trained models and no idea which one performed best. > You have no record of what parameters produced what result. > Next week, nobody remembers what worked. So you add experiment tracking. > MLflow running on Kubernetes. Every training job automatically logs parameters, metrics, and artifacts. > You always know which model came from which experiment. But your best model is sitting in an S3 bucket doing nothing. > It needs to serve predictions to your application. Spinning up a Flask app manually on an EC2 machine is neither repeatable nor scalable. So you deploy the model as a Kubernetes Deployment behind a Service. > Your model server runs as a container. > It scales with HPA when prediction requests increase. > It restarts automatically when it crashes. But your model gets stale. > Real-world data drifts from training data over time. > Predictions start degrading, and nobody notices until users complain. So you add monitoring. > Your model server emits prediction metrics to Prometheus. > Grafana dashboards show prediction distribution over time. > Data drift triggers an alert before accuracy degrades in production. But fixing drift means retraining. > Retraining manually means someone has to remember how to do it all. > Pull fresh data, run the job, evaluate the model, and deploy it. > That is four steps where humans make mistakes. So you build a pipeline. > Kubeflow Pipelines or Argo Workflows on Kubernetes. > Fresh data arrives, retraining triggers automatically. > New model evaluated against the old one, better model promoted to production, bad model gets discarded automatically. > Nobody touches it manually. That full loop is what MLOps on Kubernetes actually means.

English

3.9K

EchoFox@FoxEcho8·29 Mar

LOL!!

pH@pHequals7

this is what i think is going on inside anthropic whenever claude faces an outage and they nerf tf out of opus 4.6

QST

EchoFox retweetledi

SweetDee@RealUnsweetDee·22 Mar

Ummm…pretty sure you can tho…

doomer@uncledoomer

i dont know which of you needs to hear this, but you cant change the outcome of the situation by monitoring it

English

378

4.8K

70.9K

1.4M

EchoFox@FoxEcho8·22 Mar

@ConsciousRide Any resource?

English

12.1K

Akshay Shinde@ConsciousRide·22 Mar

As an AI engineer. Please learn: - Python (deeply - it is still king in 2026) - Core ML/DL (transformers, attention, backprop, optimization, loss functions) - Frameworks (PyTorch 2.x / JAX - pick one deeply; understand both eventually) - Model architectures (LLMs, diffusion, multimodal, MoE basics) - Fine-tuning & PEFT (LoRA/QLoRA, adapters, full fine-tune trade-offs) - Data pipelines (cleaning, augmentation, tokenization, dataloaders, streaming) - Evaluation (benchmarks, perplexity, BLEU/ROUGE/BERTScore, human eval, RAGAS) - Serving & inference (vLLM, TGI, TorchServe, ONNX, TensorRT, quantization) - Prompt engineering + RAG + agents + tool calling patterns - MLOps (tracking experiments, versioning models/data, monitoring drift)

SumitM@SumitM_X

As a backend engineer. Please learn: - System Design (scalability, microservices) -APIs (REST, GraphQL, gRPC) -Database Systems (SQL, NoSQL) -Distributed Systems (consistency, replication) -Caching (Redis, Memcached) -Security (OAuth2, JWT, encryption) -DevOps (CI/CD, Docker, Kubernetes) -Performance Optimization (profiling, load balancing) -Cloud Services (AWS, GCP, Azure) -Monitoring (Prometheus, Grafana) Pick up a language.. Stop jumping from one language to the other

English

243

2.2K

149.4K

EchoFox retweetledi