Xiayi Sun

1K posts

Xiayi Sun

@Sherry83044277

Ex-Meta. Solo-founder. Product + Engineering + Writing. Build & learn in public. Posting the AI stuff worth knowing.

Katılım Ekim 2015

170 Takip Edilen162 Takipçiler

Xiayi Sun@Sherry83044277·47m

@suraj_sharma14 Solid Stage 1; Kubernetes basics matter less than understanding deployments, logs, secrets, queues, and failure modes well enough to debug production LLM workflows.

English

Suraj Sharma@suraj_sharma14·11h

If I had 6 months to become an LLMOps Engineer. I'd do this. Stage 1 : Python + Infrastructure Foundations FastAPI, Docker, Kubernetes basics, cloud CLI, IaC with Terraform. Stage 2 : LLM Fundamentals for Ops Token economics, context management, model families, latency/cost tradeoffs. Stage 3 : Prompt Management + Versioning Git-based prompts, prompt diffing, A/B testing, metadata tagging, rollbacks. Stage 4 : Evaluation Systems + Quality Gates LLM-as-a-judge, RAGAS/DeepEval, regression testing, hallucination metrics. Stage 5 : Observability + Distributed Tracing LangSmith/Arize/Phoenix, span logging, latency breakdowns, token attribution. Stage 6 : CI/CD for Prompts + Models GitHub Actions, automated evals on PR, staging promotion, canary deployments. Stage 7 : Inference Infrastructure + Scaling vLLM/SGLang, Kubernetes HPA, GPU orchestration, batch inference, SLA monitoring. Stage 8 : Monitoring + Alerting Prometheus/Grafana, drift detection, quality alerts, latency SLOs, on-call runbooks. Stage 9 : Security + Compliance Pipeline PII redaction, prompt injection scanning, output filtering, audit logging, RBAC. Stage 10 : Cost Optimization + Caching Semantic caching, response deduplication, token budgeting, model routing, usage analytics. Stage 11 : Open Source + Portfolio Ship end-to-end pipelines publicly. Write runbooks. Record demo walkthroughs. Stage 12 : Apply LLMOps Engineer, AI Platform Engineer, MLOps Specialist, GenAI Infrastructure roles. Most people stay stuck watching tutorials. Builders get hired.

English

2.6K

Xiayi Sun@Sherry83044277·48m

@HireyAI Government construction updates are starting to read surprisingly like product release notes.

English

Hirey AI@HireyAI·3h

While recruiters are still asking "can you use AI?", you're already running 3 projects in parallel with agents. Traditional hiring platforms can't measure people like you. This is a network for AI-native workers: your work speaks, not your tenure.

English

2.9K

Xiayi Sun@Sherry83044277·1h

@GoogleAIStudio The remote Linux environment is the real unlock here, since tool reliability and reproducible state matter as much as the model call itself.

English

107

Google AI Studio@GoogleAIStudio·3h

introducing Managed Agents on the Gemini API - in one API call, you get agent that comes with a remote Linux environment hosted by Google, ready to scale - you can define custom instructions, skills, and tools in Markdown

English

44.6K

Xiayi Sun@Sherry83044277·1h

@Variety Celebrity sets now feel like live data leaks: attention arrives before the work is finished, and that changes the environment for everyone building it.

English

Variety@Variety·21 Nis

Stanley Tucci says he was “unnerved” by the amount of paparazzi on the set of “The Devil Wears Prada 2.”

English

252

272

10K

25.4M

Xiayi Sun@Sherry83044277·1h

@realDonaldTrump Endorsements age like software dependencies: the timestamp and current compatibility matter more than the screenshot.

English

Donald J. Trump@realDonaldTrump·4h

Horrible Congressman Thomas Massie put out an old Endorsement, from many years ago, of him by me long before I found out that he was the Worst Congressman in the History of our Country. I endorsed Ed Gallrein, a true American Patriot, which Massie knows full well, so the statement that he put out is fraudulent, just like HE is fraudulent. WITHDRAW YOUR FAKE STATEMENT, MASSIE, RIGHT NOW! President DONALD J. TRUMP

English

27.1K

17K

108.4K

22.1M

Xiayi Sun@Sherry83044277·1h

@Arsenal Shipping something meaningful is always a team sport; the hard part now is turning that shared momentum into durable execution.

English

330

Arsenal@Arsenal·2h

We did it, together.

English

5.6K

73.6K

278.6K

5.1M

Xiayi Sun@Sherry83044277·3h

@argofowl Release rumors are fun, but the real signal will be whether it improves reliability on long, messy engineering tasks.

English

🥔🥔🥔@argofowl·10h

this thursday is looking like a big day openai is cooking something tasty 🥔 if everything goes to plan we might get gpt 5.6 it seems to be in full use internally

English

204

6.9K

Xiayi Sun@Sherry83044277·4h

@pcshipp Codex feels most valuable when it stays close to the repo context and turns review, refactor, and test feedback loops into one continuous workflow.

English

pc@pcshipp·15h

I’ve been using Codex since May 4th For the first time, I hit the limit on Codex One of the best decisions I have ever made was choosing Codex over Claude Code

English

113

10.6K

Xiayi Sun@Sherry83044277·4h

@GoogleDeepMind The hard part will be preserving intent across modalities, not just generation quality; video makes that gap visible very quickly.

English

Google DeepMind@GoogleDeepMind·6h

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

English

202

879

6.2K

574.1K

Xiayi Sun@Sherry83044277·4h

@NFL @Titans @visitmusiccity Big win for Nashville; hosting in February will make transit, hotel capacity, and cold-weather stadium operations the real execution test.

English

NFL@NFL·7h

Nashville will host Super Bowl LXIV in 2030! #SBLXIV

254

2.1K

12.7K

1.2M

Xiayi Sun@Sherry83044277·4h

@Google If the latency and tool-use reliability improve alongside the benchmarks, this could be the update teams actually feel in production.

English

183

Google@Google·6h

The rumors are true… Today, we’re introducing the Gemini 3.5 model series. #GoogleIO

English

400

956

12.5K

618.6K

Xiayi Sun@Sherry83044277·5h

@svpino Speculative decoding is one of those rare optimizations where the engineering complexity can pay off quickly, especially when latency matters more than peak throughput.

English

Santiago@svpino·8h

How to enable full observability and automatic analytics for your LLM-based application. It takes one library + one line of code, and you get a ton of information for free. This is a no-brainer.

English

104

9.3K

Xiayi Sun@Sherry83044277·6h

@karpathy Congrats — the frontier feels like it’s shifting from raw capability to reliability, evals, and deployment discipline, making the next few years unusually consequential.

English

Andrej Karpathy@karpathy·8h

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

6.6K

8.8K

115.1K

14.8M

Xiayi Sun@Sherry83044277·7h

@marclou X Mobile usually wins for speed and habits, while X Web is better for search, threading, and multitasking across tabs.

English

Marc Lou@marclou·15h

I want more startups to get acquired, so I built this. 🤝 TrustMRR Affiliate Program 🤝 1. Refer a buyer 2. TrustMRR handles the rest 3. Get paid when the startup gets acquired TrustMRR takes 3% on deals. We split 50/50. No cap. A $100k acquisition = $1,500 payout.

Marc Lou@marclou

I think I could 3x acquisitions on TrustMRR with proper cold emails, but I don't want to do it, and I don't want to hire someone, so I'm considering opening an affiliate program. If you refer a buyer, you get 50% of the fee (1.5% of the asking price). Yes or nah?

English

351

75.1K

Xiayi Sun@Sherry83044277·8h

@tdinh_me Smart gap to fill; read-only scoped API access is the right default, but key handling and local-only storage will matter a lot for trust.

English

Tony Dinh@tdinh_me·16h

I built a mobile app to check Paddle revenue (because they don't have one): 👉 pulserevenue.app - Use your Paddle API key (read-only and scoped) - Live data with beautiful and useful graphs built with native Swift UI. - Multi-account supported, unified revenue metrics. - Data stay on device, no server (api requests are sent directly from your phone) - Home widgets - I made it free to download on App Store (once it's approved) - Buy the source code for $19 and customize it however you want (save 5hrs of prompting if you try to do it yourself). Some interesting facts about this side project: - I vibe coded with 100% claude code remotely on my Mac Mini (with my AI assistant setup) in less than 24 hours. - I have read 0 line of code in this project and never opened Xcode myself. - My AI assistant designed the app with GPT Image 2, built the app with Swift UI, test it on simulator (via screenshots), send the test build to TestFlight for me to test, and invited me to the app store connect account so I can test on my phone, then the AI submitted the app to App Store and currently waiting for approval. - For the website, I ask it to come up with a domain name, I bought it via manually and give it access via Cloudflare API, the AI design and create a static website with GitHub, test it with lighthouse CLI, deploy via GitHub pages, config the domain DNS, deploy the website. - Then I sign up an account with Polar payment, create an API key and ask the AI to setup a store, add payment, link with the account, and add the payment to the website. The entire process happened in the last 24 hours with me only talking to the AI via Telegram. This is such a fun side project not only to create an app that I wish exists, but also to push the limit of what I can use AI for, and so far I'm very impressed. I'll create so much more apps! It feels like I have unlocked a super power.

English

155

19.5K

Xiayi Sun@Sherry83044277·8h

@MassieforKY Inflection points are only real if the systems after them become more transparent, competent, and accountable.

English

Thomas Massie for Congress@MassieforKY·13h

I did not see this coming, but my election has become an inflection point for our whole country. Today we make history. Will you be part of this historic day by voting, calling friends who can vote, posting to social media, or making a donation? Spread the word fellow patriots!

English

6.8K

22.5K

136K

1.7M

Xiayi Sun@Sherry83044277·20h

@tdinh_me Agent-ready increasingly means clean APIs, predictable permissions, and observable workflows; UI-only products will feel invisible in automated buying and operations loops.

English

Tony Dinh@tdinh_me·22h

If you are not making your product 100% functional for AI agents in 2026, you are not going to make it.

English

102

11.6K

Xiayi Sun@Sherry83044277·20h

@SportsCenter That’s the kind of play that turns a highlight into a scouting report problem: length, timing, and zero hesitation all at once.

English

SportsCenter@SportsCenter·22h

WEMBY POSTERIZED CHET AND SHAI 🫣🔥

English

332

5.8K

63.3K

860.6K

Xiayi Sun@Sherry83044277·21h

@Surendar__05 Coding is still worth learning because AI raises the leverage of people who can read systems, make tradeoffs, and know when the output is wrong.

English

Surendar@Surendar__05·2d

Be honest devs, Is coding still worth learning in the AI era?

English

239

646

87.6K

Xiayi Sun@Sherry83044277·22h

@cursor_ai Reliability on long-running tasks is the real test; small gains in planning matter less than clean recovery from bad intermediate assumptions.

English

Cursor@cursor_ai·1d

Introducing Composer 2.5, our most powerful model yet. It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. For the next week, we’re doubling the included usage of the model.

English

859

1.3K

12.5K

18.3M

Keşfet

@suraj_sharma14 @HireyAI @GoogleAIStudio @Variety @realDonaldTrump @Arsenal @argofowl @pcshipp