Erik

604 posts

Erik

@try__working

The Critical Importance of Kindness

Shanghai Tham gia Nisan 2013

493 Đang theo dõi150 Người theo dõi

Tweet ghim

Erik@try__working·15 Kas

ZXX

Erik@try__working·3h

Did anyone think that uh 300 meters of scrollable chat is not good UX and not good information storage?

English

Erik@try__working·4h

@binarybits @deanwball There's no existential risk. LLMs are not AI. The only danger is people misunderstanding the technology, hyping it up, and tricking people into using it for things it cannot do. Which is what you are doing.

English

Timothy B. Lee@binarybits·11h

This is the correct view of existential risk from AI, and I'm glad @deanwball sees the same connection to Hayek's thinking that I do.

English

236

14.9K

Erik@try__working·4h

@burkeholland stop prompting the model. use a workflow skill. github.com/doubleuuser/rl…

English

Burke Holland@burkeholland·8h

We cannot continue to let AI write code the way we currently are. It’s way too good at cutting corners. I think hooks in the Copilot CLI might be the answer here. And I think we may only need two of them. preToolUse - the gate: Code gets linted or the model doesn't get to save the file. Model doesn't get to commit until all tests pass. No exceptions. The AI literally cannot move forward without passing basic muster. postToolUse - the feedback loop: Run your type checker after every edit (tsc --noEmit, whatevs). The output goes straight back into the model's context. It sees the errors. It fixes them. Loop restarts and continues until code is actually correct. No rationalizing about why it’s ok for some dumb error to persist. The model cannot write a file, pass a type check, or commit without actually doing what was asked. It’s time we held models to the same basic standards we hold for ourselves.

English

5.4K

Erik@try__working·5h

@rabi_guha Brilliant stuff. I'll look into migrating to OpenUI.

English

Rabi Shanker Guha@rabi_guha·15h

since we launched 2 weeks ago: - shadcn support - HeroUI v3 support - React Native support - react-email support - Vercel AI SDK integration - Langgraph integration Bring your own Design System. Bring your own framework. OpenUI handles the rest

Rabi Shanker Guha@rabi_guha

We built a better alternative to json-render & A2UI Up to 3x faster rendering & 67% fewer tokens. Far fewer broken outputs. We've been shipping Generative UI interfaces to 10,000+ developers for the past year. We know where JSON-based rendering breaks in production. So we built a new format - one that works with how models actually generate, not against it. And today we're open sourcing it: openui [dot] com

English

4.4K

Erik@try__working·9h

@skywind3000 Write your own emails and meeting notes.

English

LIN WEI@skywind3000·17h

AI adoption：现在我写完邮件点 "Enhance with AI." 然后 AI 重写了我的邮件，把里面 "Can we meet Thursday?" 换成了 "I'd love to explore the possibility of finding a mutually convenient time to align on this." 这到底是提效了还是降效？我把改写结果截图发公司的 ai-wins 群收到了同事们一堆大拇指和火箭的称赞表情，但是我并没给他们看我更精炼的原稿，这就是我们公司的 adoption

Peter Girnus 🦅@gothburz

My company rolled out AI tools 11 months ago. Since then, every task I do takes longer. I am not allowed to say this out loud. Not because there is a policy. There is no policy. There is something worse than a policy. There is enthusiasm. There is a Slack channel called #ai-wins where people post screenshots of AI outputs with captions like "this just saved me an hour." There is a VP who opens every all-hands with "the companies that adopt fastest win." There is a Director who renamed his team from Operations to Intelligent Operations. There is a peer review question that now asks: "How have you leveraged AI tools to enhance your workflow this quarter?" If the answer is "I haven't, because I was faster before," that is a career decision. So I leverage. Emails. Before the tools, I wrote emails. This took the amount of time it takes to write an email. I did not measure it. Nobody measured it. The email got written and sent and it was fine. Now I write the email. Then I highlight the text and click "Enhance with AI." The AI rewrites my email. It replaces "Can we meet Thursday?" with "I'd love to explore the possibility of finding a mutually convenient time to align on this." I read the rewrite. I delete the rewrite. I send my original email. This takes 4 minutes instead of 2. The 2 extra minutes are the enhancement. I do this 11 times a day. That is 22 minutes I spend each day rejecting improvements to sentences that were already finished. In #ai-wins I posted a screenshot of the rewrite. I did not post the part where I deleted it. 23 people reacted with the rocket emoji. That is adoption. Meetings. We have an AI notetaker in every meeting now. It joins automatically. It records. It transcribes. It summarizes. After each meeting I receive a 3-paragraph summary of the meeting I just attended. I read the summary. This takes 3 minutes. I was in the meeting. I know what happened. I am reading a machine's account of something I experienced firsthand. Sometimes the account is wrong. Last Tuesday it attributed a comment about Q3 revenue to me. My manager made that comment. I spent 4 minutes correcting the transcript. Before the notetaker, I did not spend 7 minutes after each meeting correcting a robot's memory of something I personally witnessed. I attend 11 meetings a week. That is 77 minutes per week supervising a transcription nobody requested. I mentioned this once. My manager said "think about the people who weren't in the meeting." The people who weren't in the meeting do not read the summaries. I checked. The read receipts show single-digit opens. The summaries exist not because they are useful but because they are there. I read them for the same reason. Documents. I write a weekly status update. Before the tools, this took 10 minutes. I typed what happened. I sent it. My manager skimmed it. The system worked. Now I open the AI writing assistant. I give it my bullet points. It produces a draft. The draft says "Significant progress was achieved across multiple workstreams." I did not achieve significant progress across multiple workstreams. I updated a spreadsheet and sent 4 emails. I rewrite the draft to say what actually happened. Then I run my rewrite through the grammar tool. It suggests I change "done" to "completed" and "next week" to "in the forthcoming period." I click Ignore 9 times. Then I send the version I would have written in 10 minutes. The process now takes 30. I have been doing this every week for 11 months. I have added 20 minutes to a task that did not need 20 more minutes. I call this efficiency. I have been calling it efficiency for 11 months. That is what efficiency means now. It means the additional time you spend to arrive at the same outcome through a longer process. Nobody has questioned this definition. I have not offered it for review. I kept a log once. 2 weeks. Every task, timed. Before-AI and after-AI. The after number was larger in every case. Every single one. Not by a little. The range was 40 to 200 percent. I deleted the log. I deleted it because it was a document that said, in plain numbers, that the AI tools make me slower. And a document like that has no place in a company where AI adoption is a strategic priority. I could not send it to my manager. He championed the rollout. I could not post it in #ai-wins. I could not raise it in a meeting because the notetaker would transcribe it and the summary would read "[Name] expressed concerns about AI tool efficacy" and that summary would be the first one anyone actually reads. So I do what everyone does. I use the tools. I spend the extra time. I post in #ai-wins. I write "leveraged AI to streamline weekly reporting" in my review and my manager gives me a 4 out of 5 for innovation. I have innovated nothing. I have added steps to processes that were already finished. I have made simple things longer and labeled the difference with words that used to mean something. Every week in #ai-wins someone posts a screenshot. And 20 people react with the rocket emoji. And nobody posts the part where they deleted the output and did the task themselves. Nobody posts the revert. Nobody posts the before-and-after timer. Nobody will. Because "I was better at my job before the AI tools" is a sentence that cannot be said out loud in any company that has decided AI is the future. Every company has decided AI is the future. So we leverage. Quietly. Adding steps. Calling them optimization. Getting slightly less done, slightly more slowly, with slightly more steps, and reporting it as progress. My yearly review is next month. There is a new section this year. "AI Impact Assessment." It asks me to quantify the hours saved by AI tools per week. I will write a number. The number will be positive. It will not be true. But the AI writing assistant will help me phrase it convincingly. That is the one thing it does well.

中文

4.7K

Erik@try__working·10h

@geoffreylitt that's what recursive workflows are for: github.com/doubleuuser/rl…

English

Geoffrey Litt@geoffreylitt·16h

A missing tool for agentic refactoring: better diffs for moved code “Ok, looks like this new file is pretty much that old file… but did it randomly mutate on the way?”

English

2.8K

Erik@try__working·11h

@encoredotdev Can we get this for VSCode and a Cloudflare integration?

English

Encore@encoredotdev·19h

we flambeéd check out the interactive demo at encore.dev

English

252

Erik@try__working·11h

@techeconomyana I will make sure to never use Claude

English

高级分析师@techeconomyana·20h

定一个金标准：我不和没有用过Claude的人做爱。

中文

44K

Erik@try__working·20h

@realNyarime Phil Schiller? What an idiot. On stage talking about USB ports.

English

奶昔🥤@realNyarime·1d

面对Neo的接口库克是否会想起2012年那场遥远的WWDC

中文

227

70.5K

Erik@try__working·20h

@alreadydawn Absolutely terrible post. 1. Nobody ever wanted to go to Jinan. Of course it's empty. 2. Wrong. You have to wait 10, sometimes 40 minutes to get a Didi during commuting hours. 3. Coco Park has never been a hot mall

English

240

alreadydawn@alreadydawn·22h

I'm seeing more and more instances of demand not keeping up with supply in China: * Hotels are abundant, even at random cities like Jinan, where rates are amazingly low. You can get a decent place at below 20 usd per night. The ones I've stayed at so far seem to lack customers in general. The hotel I stayed in Yangzhou has a coffee/drinks bar that has been unmanned for quite some time and just collecting dust. * Lines of taxis waiting for riders outside of HSR stations. Some pointed out that this has been the case for years, but the number of potential riders does not match the sheer number of cars. Even at 3rd tier cities at night where few people are out and about the station, the taxi lines stretch on forever like the Great Wall. (I get that taxis around the world have been severely impacted by ride-hailing softwares like Uber and Didi, but these great wall-like taxi lines are nuts to witness.) * Malls outside of the hot ones in Tier 1 cities (ie Coco Park in Shenzhen) are quiet. The one I'm next to here in Qingdao by their Olympics sailing center is nearly deserted, with most storefronts closed. I'm sure there are many reasons for this supply/demand problem (as do most complex issues), but a big factor has to be involution (內卷), where the Chinese try to outcompete each other at the exact same thing, driving supply way up and prices down. This all reminds me of a story shared by @spandrell4 that goes: "When a Jewish businessman opens a gas station at a new town, other Jews come to the city to open various businesses - bakery, accounting firm, law firm, restaurant. When a Chinese businessman opens a gas station at a new town, other Chinese come to the city to open gas stations."

English

286

31.3K

Erik@try__working·21h

@kakashiii111 Lmao. Nvidia knew everything. Random companies buying hundreds of millions worth of GPUs, as if Nvidia didn't know what was going on.

English

Kakashii@kakashiii111·1d

It is interesting that all these smugglers' companies have billing locations in Singapore and headquarters in the US. Not sure if Nvidia knew, but they made accounting changes twice in less than a year that may not be a coincidence, and could be related to these sales. Maybe?

Kakashii@kakashiii111

New Day, New Smuggling Charge. This Time: Thailand. The DOJ just unsealed charges against three individuals, a Chinese national and two US citizens, for conspiring to smuggle export-controlled AI chips to China through Thailand. The alleged scheme involved ordering 750 servers worth approximately $170 million from a California-based hardware company, with 600 of those servers containing chips on the Commerce Control List. Thailand-based shell companies were used as the stated end buyers. The actual destination was China. If you follow my work, none of this should surprise you. Thailand is not new to this story. Southeast Asia as a transshipment corridor is not new to this story. Front companies signing false end-user certifications is not new to this story. I documented these routes, these mechanics, and the structural incentives behind them for years. This is yet another validation, in the form of a criminal indictment, of the research I have been publishing since late 2023. Smugglers were aware of the danger and the risks and tried to hide any reference to China, similar to how SMCI used hair dryers. In their own words: "We will draw attention from US government for embargo violation." More to follow.

English

4.6K

Erik@try__working·1d

@AsiaStock The reason is you try to launch in the Chinese market while you sit in the US and make strategic decisons based on zero knowledge of the market. Chinese firms fail at going international for the same reasons.

English

332

亚洲股市 AsiaStock@AsiaStock·2d

为什么西方公司难以打入中国市场？Uber 创始人 Travis Kalanick 将与中国企业竞争分为两个阶段。第一阶段：中国竞争对手会以极高效率复制 Uber。一个新功能上线一周，就已经被完整克隆。当你把“复制”做到极致时，最终会发现没有更多东西可以复制。第二阶段：中国公司开始转向本土创新，推出更符合本地市场和文化的功能。Kalanick 举了美团的例子，比如外卖取餐柜，这种模式并非简单复制，而是在中国市场中非常高效。在AI竞赛中，这又会如何演变？

中文

161

72.9K

Erik@try__working·1d

@lauriewired What do you mean, seems likely to be aimed at the server market? Their official release and documentation says it's a server product. You just have to look at the specs to understand it's a server product.

English

LaurieWired@lauriewired·2d

open questions (feel free to reply if you know): 1. I don’t understand how this can possibly have the RVA23.1 spec already. It’s only been out for ~6 months, yet alibaba seems to be claiming tapeout (“5nm verified for production"). It’s not a paper launch; *someone* has some real silicon otherwise they wouldn’t be able to claim SPECint2017 numbers…(unless it’s only simmed?) 2. TDP is up in the air atm. My guess is the core is likely to compete in the server-market 3. Undisclosed fab…but who else could it possibly be besides TSMC? We know it’s 5nm… 4. It seems to use their own additional “Matrix Multiply Extension” for AI inference. Alibaba is claiming the C950 has “naive support for Qwen3 and DeepSeek V3”…but no figures or toks/sec. I’m not sure how much they’ve improved past this, but you can find an open source version of their MME on github. But, it hasn’t been updated since december of 2024… github.com/XUANTIE-RV/ris… (also attaching the giant chart from David Huang that shows a ton of SPECint2017/GHz scores)

English

161

11.9K

LaurieWired@lauriewired·2d

Wow, RISC-V is really gaining traction again. Alibaba just announced the Xuantie C950…which is basically claiming Apple M1 (ish) levels of performance. I don’t see a lot of people talking about it! (2.6/Ghz SPECint2017, Apple M1 P-core also around ~2.6)

English

176

2.1K

96.9K

Erik@try__working·2d

@TheStalwart China makes the best titanium bikes today

English

Joe Weisenthal@TheStalwart·2d

When I see quotes like this, the main thought I have is that there is a deep continuity in Chinese thinking about industry, this obsession with building out capacity and establishing self-reliance, and also that the gap between Mao- and post-Mao-era is overstated.

Joe Weisenthal@TheStalwart

chinatalk.media/p/the-story-of…

English

266

27.9K

Erik@try__working·2d

@adcock_brett We don't need new interfaces. Voice is a nice to have. We need new backends.

English

Brett Adcock@adcock_brett·2d

Today I'm excited to introduce Hark, a new artificial intelligence lab building the most advanced, personal intelligence in the world We've been in stealth for 8 months, assembling one of the greatest AI and hardware teams on the planet I want to explain why I started Hark and what we're focused on I've spent the last 3 years working on the hardest AI challenge imaginable: giving AI a humanoid body. On the digital side, I've been using all the existing LLM chatbots - and I have to say, they feel incredibly dumb to me AGI, in the limit, should feel like a sci-fi movie. It should be able to listen and talk. It should have persistent memory and be highly personalized. It should see and touch the world. But we're far from this today We are crafting a new interface to AGI. Intelligence that lets you offload your mental workload into a system that begins to think like you and sometimes ahead of you We started Hark with one goal: build the world's most advanced personal intelligence - paired with next-generation hardware designed to serve as a universal interface between humans and machines hark.com

English

433

478

4.6K

1.3M

Erik@try__working·2d

@DeeperThrill Your post-planning workflow can be improved significantly if you use a skill like rlm-workflow. You shouldn't need to ask for reviews manually. github.com/doubleuuser/rl…

English

Deep Thrill@DeeperThrill·3d

My coding workflow is currently: 1. Spend about ten minutes writing a plan for a new feature in a markdown file as bullet points. 2. Spend about an hour with a bunch of back forths between Opus 4.6 and Codex 5.4 xhigh, asking each one to improve the markdown plan file. 3. Read the final markdown plan, which ends up as a bunch of bullet points, includes implementation details, and is usually between 200-500 lines long. 4. Ask either Claude Code or Codex to implement it. 5. Ask the other one to review the implementation. 6. Run my /deslop command on the implementation (see my pinned tweet). 7. Deploy, test, and ask an agent to fix any bugs.

English

774

84.1K

Erik@try__working·2d

@chatsidhartha @CloudflareDev Cloudflare is new AWS and AWS is dead?

English

Sid@chatsidhartha·2d

Durable Objects are the greatest invention of our time and I guarantee you, @CloudflareDev will be the default compute provider to run agents everywhere very soon Nothing beats the speed, scale, isolation and intuitiveness of this platform

English

8.1K

Erik@try__working·2d

@fayazara wicked

English

195

Fayaz Ahmed@fayazara·2d

Here's me spinning up multiple dynamic workers on the fly in milliseconds. they're all isolated btw

Kenton Varda@KentonVarda

Dynamic Workers are now in Open Beta, all paid Workers users have access. Secure sandboxes that start ~100x faster than a container and use 1/10 the memory, so you can start one up on-demand to handle one AI chat message and then throw it away. Agents should interact with the world by writing code, not tool calls. This makes that possible at "consumer scale", where millions of end users each have their own agent writing code. blog.cloudflare.com/dynamic-worker…

English

452

84.3K

Erik@try__working·3d

@teortaxesTex GLM 5 performs similarly to GPT 5.2 (No thinking) so it's actually not such a bad showing. Kimi climbing too.

English

311

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex·3d

what's worse, the best Chinese model is quite far below Gork 4.20 (best Western models are at 0.79 now btw) WeirdML and CritPt are some of the best proxies of progress towards actual recursive self-improvement, building automated research scientists.

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) tweet media

Josh@JoshPurtell

@teortaxesTex Best Chinese model is neck and neck with gpt-5.4-nano lmoa

English

176

22.8K

Erik@try__working·3d

@whyyoutouzhele China's technological advantage and speed of development does not come from 996, but from the last 4 decades work on building various types of infrastructure. If China stopped 996 culture, it would be more innovative, more efficient and could launch better products, faster.

English

Erik@try__working·3d

@whyyoutouzhele I have more than a decade's experience working in China tech. 95% of 996 is performative; fake work. And nobody starts at 9, they start at 11. 6-7:30 people have dinner and slack off. It creates a very inefficient work environment, where there is no planning and lots of rework.

English

533

李老师不是你老师@whyyoutouzhele·4d

“中国没有生活，只有工作” 3月22日，网友转发一位外国企业家评价中国“996”：早9晚9、一周6天，“只有工作没有生活”。他称无意批评，但强调应平衡工作与生活，希望企业让员工真正发展。视频发出后迅速引发网友们的共鸣，评论区甚至有网友表示“996”算好的了，我们“777”早7到晚7，有事只能请假。还有的网友表示，“996”算好的了，之前在广州进厂都是“897"(早上八点到晚上九点，一周上七天)。

中文

107

113

1.1K

157.1K

Khám phá

@binarybits @deanwball @burkeholland @rabi_guha @skywind3000 @geoffreylitt @encoredotdev @techeconomyana