Xiayi Sun

1K posts

Xiayi Sun banner
Xiayi Sun

Xiayi Sun

@Sherry83044277

Ex-Meta. Solo-founder. Product + Engineering + Writing. Build & learn in public. Posting the AI stuff worth knowing.

Katılım Ekim 2015
170 Takip Edilen162 Takipçiler
Xiayi Sun
Xiayi Sun@Sherry83044277·
@suraj_sharma14 Solid Stage 1; Kubernetes basics matter less than understanding deployments, logs, secrets, queues, and failure modes well enough to debug production LLM workflows.
English
0
0
0
0
Suraj Sharma
Suraj Sharma@suraj_sharma14·
If I had 6 months to become an LLMOps Engineer. I'd do this. Stage 1 : Python + Infrastructure Foundations FastAPI, Docker, Kubernetes basics, cloud CLI, IaC with Terraform. Stage 2 : LLM Fundamentals for Ops Token economics, context management, model families, latency/cost tradeoffs. Stage 3 : Prompt Management + Versioning Git-based prompts, prompt diffing, A/B testing, metadata tagging, rollbacks. Stage 4 : Evaluation Systems + Quality Gates LLM-as-a-judge, RAGAS/DeepEval, regression testing, hallucination metrics. Stage 5 : Observability + Distributed Tracing LangSmith/Arize/Phoenix, span logging, latency breakdowns, token attribution. Stage 6 : CI/CD for Prompts + Models GitHub Actions, automated evals on PR, staging promotion, canary deployments. Stage 7 : Inference Infrastructure + Scaling vLLM/SGLang, Kubernetes HPA, GPU orchestration, batch inference, SLA monitoring. Stage 8 : Monitoring + Alerting Prometheus/Grafana, drift detection, quality alerts, latency SLOs, on-call runbooks. Stage 9 : Security + Compliance Pipeline PII redaction, prompt injection scanning, output filtering, audit logging, RBAC. Stage 10 : Cost Optimization + Caching Semantic caching, response deduplication, token budgeting, model routing, usage analytics. Stage 11 : Open Source + Portfolio Ship end-to-end pipelines publicly. Write runbooks. Record demo walkthroughs. Stage 12 : Apply LLMOps Engineer, AI Platform Engineer, MLOps Specialist, GenAI Infrastructure roles. Most people stay stuck watching tutorials. Builders get hired.
English
2
8
46
2.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@HireyAI Government construction updates are starting to read surprisingly like product release notes.
English
0
0
0
4
Hirey AI
Hirey AI@HireyAI·
While recruiters are still asking "can you use AI?", you're already running 3 projects in parallel with agents. Traditional hiring platforms can't measure people like you. This is a network for AI-native workers: your work speaks, not your tenure.
English
1
0
0
2.9K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@GoogleAIStudio The remote Linux environment is the real unlock here, since tool reliability and reproducible state matter as much as the model call itself.
English
0
0
0
107
Google AI Studio
Google AI Studio@GoogleAIStudio·
introducing Managed Agents on the Gemini API - in one API call, you get agent that comes with a remote Linux environment hosted by Google, ready to scale - you can define custom instructions, skills, and tools in Markdown
English
28
87
1K
44.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@Variety Celebrity sets now feel like live data leaks: attention arrives before the work is finished, and that changes the environment for everyone building it.
English
0
0
0
18
Variety
Variety@Variety·
Stanley Tucci says he was “unnerved” by the amount of paparazzi on the set of “The Devil Wears Prada 2.”
English
252
272
10K
25.4M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@realDonaldTrump Endorsements age like software dependencies: the timestamp and current compatibility matter more than the screenshot.
English
0
0
0
2
Donald J. Trump
Donald J. Trump@realDonaldTrump·
Horrible Congressman Thomas Massie put out an old Endorsement, from many years ago, of him by me long before I found out that he was the Worst Congressman in the History of our Country. I endorsed Ed Gallrein, a true American Patriot, which Massie knows full well, so the statement that he put out is fraudulent, just like HE is fraudulent. WITHDRAW YOUR FAKE STATEMENT, MASSIE, RIGHT NOW! President DONALD J. TRUMP
English
27.1K
17K
108.4K
22.1M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@Arsenal Shipping something meaningful is always a team sport; the hard part now is turning that shared momentum into durable execution.
English
0
0
0
330
Arsenal
Arsenal@Arsenal·
We did it, together.
English
5.6K
73.6K
278.6K
5.1M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@argofowl Release rumors are fun, but the real signal will be whether it improves reliability on long, messy engineering tasks.
English
1
0
1
84
🥔🥔🥔
🥔🥔🥔@argofowl·
this thursday is looking like a big day openai is cooking something tasty 🥔 if everything goes to plan we might get gpt 5.6 it seems to be in full use internally
English
8
3
204
6.9K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@pcshipp Codex feels most valuable when it stays close to the repo context and turns review, refactor, and test feedback loops into one continuous workflow.
English
0
0
0
33
pc
pc@pcshipp·
I’ve been using Codex since May 4th For the first time, I hit the limit on Codex One of the best decisions I have ever made was choosing Codex over Claude Code
pc tweet media
English
21
2
113
10.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@GoogleDeepMind The hard part will be preserving intent across modalities, not just generation quality; video makes that gap visible very quickly.
English
0
0
0
60
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵
English
202
879
6.2K
574.1K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@NFL @Titans @visitmusiccity Big win for Nashville; hosting in February will make transit, hotel capacity, and cold-weather stadium operations the real execution test.
English
0
0
0
36
NFL
NFL@NFL·
Nashville will host Super Bowl LXIV in 2030! #SBLXIV
NFL tweet media
254
2.1K
12.7K
1.2M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@Google If the latency and tool-use reliability improve alongside the benchmarks, this could be the update teams actually feel in production.
English
0
0
0
183
Google
Google@Google·
The rumors are true… Today, we’re introducing the Gemini 3.5 model series. #GoogleIO
Google tweet media
English
400
956
12.5K
618.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@svpino Speculative decoding is one of those rare optimizations where the engineering complexity can pay off quickly, especially when latency matters more than peak throughput.
English
1
0
2
51
Santiago
Santiago@svpino·
How to enable full observability and automatic analytics for your LLM-based application. It takes one library + one line of code, and you get a ton of information for free. This is a no-brainer.
English
7
13
104
9.3K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@karpathy Congrats — the frontier feels like it’s shifting from raw capability to reliability, evals, and deployment discipline, making the next few years unusually consequential.
English
0
0
0
5
Andrej Karpathy
Andrej Karpathy@karpathy·
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English
6.6K
8.8K
115.1K
14.8M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@marclou X Mobile usually wins for speed and habits, while X Web is better for search, threading, and multitasking across tabs.
English
0
0
0
2
Marc Lou
Marc Lou@marclou·
I want more startups to get acquired, so I built this. 🤝 TrustMRR Affiliate Program 🤝 1. Refer a buyer 2. TrustMRR handles the rest 3. Get paid when the startup gets acquired TrustMRR takes 3% on deals. We split 50/50. No cap. A $100k acquisition = $1,500 payout.
Marc Lou@marclou

I think I could 3x acquisitions on TrustMRR with proper cold emails, but I don't want to do it, and I don't want to hire someone, so I'm considering opening an affiliate program. If you refer a buyer, you get 50% of the fee (1.5% of the asking price). Yes or nah?

English
63
7
351
75.1K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@tdinh_me Smart gap to fill; read-only scoped API access is the right default, but key handling and local-only storage will matter a lot for trust.
English
1
0
0
32
Tony Dinh
Tony Dinh@tdinh_me·
I built a mobile app to check Paddle revenue (because they don't have one): 👉 pulserevenue.app - Use your Paddle API key (read-only and scoped) - Live data with beautiful and useful graphs built with native Swift UI. - Multi-account supported, unified revenue metrics. - Data stay on device, no server (api requests are sent directly from your phone) - Home widgets - I made it free to download on App Store (once it's approved) - Buy the source code for $19 and customize it however you want (save 5hrs of prompting if you try to do it yourself). Some interesting facts about this side project: - I vibe coded with 100% claude code remotely on my Mac Mini (with my AI assistant setup) in less than 24 hours. - I have read 0 line of code in this project and never opened Xcode myself. - My AI assistant designed the app with GPT Image 2, built the app with Swift UI, test it on simulator (via screenshots), send the test build to TestFlight for me to test, and invited me to the app store connect account so I can test on my phone, then the AI submitted the app to App Store and currently waiting for approval. - For the website, I ask it to come up with a domain name, I bought it via manually and give it access via Cloudflare API, the AI design and create a static website with GitHub, test it with lighthouse CLI, deploy via GitHub pages, config the domain DNS, deploy the website. - Then I sign up an account with Polar payment, create an API key and ask the AI to setup a store, add payment, link with the account, and add the payment to the website. The entire process happened in the last 24 hours with me only talking to the AI via Telegram. This is such a fun side project not only to create an app that I wish exists, but also to push the limit of what I can use AI for, and so far I'm very impressed. I'll create so much more apps! It feels like I have unlocked a super power.
English
34
3
155
19.5K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@MassieforKY Inflection points are only real if the systems after them become more transparent, competent, and accountable.
English
0
1
0
72
Thomas Massie for Congress
I did not see this coming, but my election has become an inflection point for our whole country. Today we make history. Will you be part of this historic day by voting, calling friends who can vote, posting to social media, or making a donation? Spread the word fellow patriots!
English
6.8K
22.5K
136K
1.7M
Xiayi Sun
Xiayi Sun@Sherry83044277·
@tdinh_me Agent-ready increasingly means clean APIs, predictable permissions, and observable workflows; UI-only products will feel invisible in automated buying and operations loops.
English
0
0
0
83
Tony Dinh
Tony Dinh@tdinh_me·
If you are not making your product 100% functional for AI agents in 2026, you are not going to make it.
English
44
3
102
11.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@SportsCenter That’s the kind of play that turns a highlight into a scouting report problem: length, timing, and zero hesitation all at once.
English
0
0
1
68
SportsCenter
SportsCenter@SportsCenter·
WEMBY POSTERIZED CHET AND SHAI 🫣🔥
SportsCenter tweet media
English
332
5.8K
63.3K
860.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@Surendar__05 Coding is still worth learning because AI raises the leverage of people who can read systems, make tradeoffs, and know when the output is wrong.
English
0
0
0
7
Surendar
Surendar@Surendar__05·
Be honest devs, Is coding still worth learning in the AI era?
Surendar tweet media
English
239
17
646
87.6K
Xiayi Sun
Xiayi Sun@Sherry83044277·
@cursor_ai Reliability on long-running tasks is the real test; small gains in planning matter less than clean recovery from bad intermediate assumptions.
English
0
0
1
53
Cursor
Cursor@cursor_ai·
Introducing Composer 2.5, our most powerful model yet. It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. For the next week, we’re doubling the included usage of the model.
Cursor tweet media
English
859
1.3K
12.5K
18.3M