Erik

604 posts

Erik banner
Erik

Erik

@try__working

The Critical Importance of Kindness

Shanghai Tham gia Nisan 2013
493 Đang theo dõi150 Người theo dõi
Tweet ghim
Erik
Erik@try__working·
Erik tweet media
ZXX
0
0
1
0
Erik
Erik@try__working·
Did anyone think that uh 300 meters of scrollable chat is not good UX and not good information storage?
English
0
0
0
2
Erik
Erik@try__working·
@binarybits @deanwball There's no existential risk. LLMs are not AI. The only danger is people misunderstanding the technology, hyping it up, and tricking people into using it for things it cannot do. Which is what you are doing.
English
0
0
0
13
Timothy B. Lee
Timothy B. Lee@binarybits·
This is the correct view of existential risk from AI, and I'm glad @deanwball sees the same connection to Hayek's thinking that I do.
Timothy B. Lee tweet media
English
17
27
236
14.9K
Burke Holland
Burke Holland@burkeholland·
We cannot continue to let AI write code the way we currently are. It’s way too good at cutting corners. I think hooks in the Copilot CLI might be the answer here. And I think we may only need two of them. preToolUse - the gate: Code gets linted or the model doesn't get to save the file. Model doesn't get to commit until all tests pass. No exceptions. The AI literally cannot move forward without passing basic muster. postToolUse - the feedback loop: Run your type checker after every edit (tsc --noEmit, whatevs). The output goes straight back into the model's context. It sees the errors. It fixes them. Loop restarts and continues until code is actually correct. No rationalizing about why it’s ok for some dumb error to persist. The model cannot write a file, pass a type check, or commit without actually doing what was asked. It’s time we held models to the same basic standards we hold for ourselves.
English
30
1
88
5.4K
Erik
Erik@try__working·
@rabi_guha Brilliant stuff. I'll look into migrating to OpenUI.
English
0
0
2
39
Erik
Erik@try__working·
@skywind3000 Write your own emails and meeting notes.
English
0
0
0
71
LIN WEI
LIN WEI@skywind3000·
AI adoption:现在我写完邮件点 "Enhance with AI." 然后 AI 重写了我的邮件,把里面 "Can we meet Thursday?" 换成了 "I'd love to explore the possibility of finding a mutually convenient time to align on this." 这到底是提效了还是降效?我把改写结果截图发公司的 ai-wins 群收到了同事们一堆大拇指和火箭的称赞表情,但是我并没给他们看我更精炼的原稿,这就是我们公司的 adoption
Peter Girnus 🦅@gothburz

My company rolled out AI tools 11 months ago. Since then, every task I do takes longer. I am not allowed to say this out loud. Not because there is a policy. There is no policy. There is something worse than a policy. There is enthusiasm. There is a Slack channel called #ai-wins where people post screenshots of AI outputs with captions like "this just saved me an hour." There is a VP who opens every all-hands with "the companies that adopt fastest win." There is a Director who renamed his team from Operations to Intelligent Operations. There is a peer review question that now asks: "How have you leveraged AI tools to enhance your workflow this quarter?" If the answer is "I haven't, because I was faster before," that is a career decision. So I leverage. Emails. Before the tools, I wrote emails. This took the amount of time it takes to write an email. I did not measure it. Nobody measured it. The email got written and sent and it was fine. Now I write the email. Then I highlight the text and click "Enhance with AI." The AI rewrites my email. It replaces "Can we meet Thursday?" with "I'd love to explore the possibility of finding a mutually convenient time to align on this." I read the rewrite. I delete the rewrite. I send my original email. This takes 4 minutes instead of 2. The 2 extra minutes are the enhancement. I do this 11 times a day. That is 22 minutes I spend each day rejecting improvements to sentences that were already finished. In #ai-wins I posted a screenshot of the rewrite. I did not post the part where I deleted it. 23 people reacted with the rocket emoji. That is adoption. Meetings. We have an AI notetaker in every meeting now. It joins automatically. It records. It transcribes. It summarizes. After each meeting I receive a 3-paragraph summary of the meeting I just attended. I read the summary. This takes 3 minutes. I was in the meeting. I know what happened. I am reading a machine's account of something I experienced firsthand. Sometimes the account is wrong. Last Tuesday it attributed a comment about Q3 revenue to me. My manager made that comment. I spent 4 minutes correcting the transcript. Before the notetaker, I did not spend 7 minutes after each meeting correcting a robot's memory of something I personally witnessed. I attend 11 meetings a week. That is 77 minutes per week supervising a transcription nobody requested. I mentioned this once. My manager said "think about the people who weren't in the meeting." The people who weren't in the meeting do not read the summaries. I checked. The read receipts show single-digit opens. The summaries exist not because they are useful but because they are there. I read them for the same reason. Documents. I write a weekly status update. Before the tools, this took 10 minutes. I typed what happened. I sent it. My manager skimmed it. The system worked. Now I open the AI writing assistant. I give it my bullet points. It produces a draft. The draft says "Significant progress was achieved across multiple workstreams." I did not achieve significant progress across multiple workstreams. I updated a spreadsheet and sent 4 emails. I rewrite the draft to say what actually happened. Then I run my rewrite through the grammar tool. It suggests I change "done" to "completed" and "next week" to "in the forthcoming period." I click Ignore 9 times. Then I send the version I would have written in 10 minutes. The process now takes 30. I have been doing this every week for 11 months. I have added 20 minutes to a task that did not need 20 more minutes. I call this efficiency. I have been calling it efficiency for 11 months. That is what efficiency means now. It means the additional time you spend to arrive at the same outcome through a longer process. Nobody has questioned this definition. I have not offered it for review. I kept a log once. 2 weeks. Every task, timed. Before-AI and after-AI. The after number was larger in every case. Every single one. Not by a little. The range was 40 to 200 percent. I deleted the log. I deleted it because it was a document that said, in plain numbers, that the AI tools make me slower. And a document like that has no place in a company where AI adoption is a strategic priority. I could not send it to my manager. He championed the rollout. I could not post it in #ai-wins. I could not raise it in a meeting because the notetaker would transcribe it and the summary would read "[Name] expressed concerns about AI tool efficacy" and that summary would be the first one anyone actually reads. So I do what everyone does. I use the tools. I spend the extra time. I post in #ai-wins. I write "leveraged AI to streamline weekly reporting" in my review and my manager gives me a 4 out of 5 for innovation. I have innovated nothing. I have added steps to processes that were already finished. I have made simple things longer and labeled the difference with words that used to mean something. Every week in #ai-wins someone posts a screenshot. And 20 people react with the rocket emoji. And nobody posts the part where they deleted the output and did the task themselves. Nobody posts the revert. Nobody posts the before-and-after timer. Nobody will. Because "I was better at my job before the AI tools" is a sentence that cannot be said out loud in any company that has decided AI is the future. Every company has decided AI is the future. So we leverage. Quietly. Adding steps. Calling them optimization. Getting slightly less done, slightly more slowly, with slightly more steps, and reporting it as progress. My yearly review is next month. There is a new section this year. "AI Impact Assessment." It asks me to quantify the hours saved by AI tools per week. I will write a number. The number will be positive. It will not be true. But the AI writing assistant will help me phrase it convincingly. That is the one thing it does well.

中文
6
0
11
4.7K
Geoffrey Litt
Geoffrey Litt@geoffreylitt·
A missing tool for agentic refactoring: better diffs for moved code “Ok, looks like this new file is pretty much that old file… but did it randomly mutate on the way?”
English
5
3
32
2.8K
Erik
Erik@try__working·
@encoredotdev Can we get this for VSCode and a Cloudflare integration?
English
0
0
0
27
Encore
Encore@encoredotdev·
we flambeéd check out the interactive demo at encore.dev
English
1
1
5
252
高级分析师
高级分析师@techeconomyana·
定一个金标准:我不和没有用过Claude的人做爱。
中文
29
3
53
44K
Erik
Erik@try__working·
@realNyarime Phil Schiller? What an idiot. On stage talking about USB ports.
English
0
0
0
92
奶昔🥤
奶昔🥤@realNyarime·
面对Neo的接口 库克是否会想起2012年那场遥远的WWDC
中文
23
12
227
70.5K
Erik
Erik@try__working·
@alreadydawn Absolutely terrible post. 1. Nobody ever wanted to go to Jinan. Of course it's empty. 2. Wrong. You have to wait 10, sometimes 40 minutes to get a Didi during commuting hours. 3. Coco Park has never been a hot mall
English
0
0
0
240
alreadydawn
alreadydawn@alreadydawn·
I'm seeing more and more instances of demand not keeping up with supply in China: * Hotels are abundant, even at random cities like Jinan, where rates are amazingly low. You can get a decent place at below 20 usd per night. The ones I've stayed at so far seem to lack customers in general. The hotel I stayed in Yangzhou has a coffee/drinks bar that has been unmanned for quite some time and just collecting dust. * Lines of taxis waiting for riders outside of HSR stations. Some pointed out that this has been the case for years, but the number of potential riders does not match the sheer number of cars. Even at 3rd tier cities at night where few people are out and about the station, the taxi lines stretch on forever like the Great Wall. (I get that taxis around the world have been severely impacted by ride-hailing softwares like Uber and Didi, but these great wall-like taxi lines are nuts to witness.) * Malls outside of the hot ones in Tier 1 cities (ie Coco Park in Shenzhen) are quiet. The one I'm next to here in Qingdao by their Olympics sailing center is nearly deserted, with most storefronts closed. I'm sure there are many reasons for this supply/demand problem (as do most complex issues), but a big factor has to be involution (內卷), where the Chinese try to outcompete each other at the exact same thing, driving supply way up and prices down. This all reminds me of a story shared by @spandrell4 that goes: "When a Jewish businessman opens a gas station at a new town, other Jews come to the city to open various businesses - bakery, accounting firm, law firm, restaurant. When a Chinese businessman opens a gas station at a new town, other Chinese come to the city to open gas stations."
English
36
18
286
31.3K
Erik
Erik@try__working·
@kakashiii111 Lmao. Nvidia knew everything. Random companies buying hundreds of millions worth of GPUs, as if Nvidia didn't know what was going on.
English
0
0
2
17
Kakashii
Kakashii@kakashiii111·
It is interesting that all these smugglers' companies have billing locations in Singapore and headquarters in the US. Not sure if Nvidia knew, but they made accounting changes twice in less than a year that may not be a coincidence, and could be related to these sales. Maybe?
Kakashii tweet media
Kakashii@kakashiii111

New Day, New Smuggling Charge. This Time: Thailand. The DOJ just unsealed charges against three individuals, a Chinese national and two US citizens, for conspiring to smuggle export-controlled AI chips to China through Thailand. The alleged scheme involved ordering 750 servers worth approximately $170 million from a California-based hardware company, with 600 of those servers containing chips on the Commerce Control List. Thailand-based shell companies were used as the stated end buyers. The actual destination was China. If you follow my work, none of this should surprise you. Thailand is not new to this story. Southeast Asia as a transshipment corridor is not new to this story. Front companies signing false end-user certifications is not new to this story. I documented these routes, these mechanics, and the structural incentives behind them for years. This is yet another validation, in the form of a criminal indictment, of the research I have been publishing since late 2023. Smugglers were aware of the danger and the risks and tried to hide any reference to China, similar to how SMCI used hair dryers. In their own words: "We will draw attention from US government for embargo violation." More to follow.

English
1
9
49
4.6K
Erik
Erik@try__working·
@AsiaStock The reason is you try to launch in the Chinese market while you sit in the US and make strategic decisons based on zero knowledge of the market. Chinese firms fail at going international for the same reasons.
English
0
0
0
332
亚洲股市 AsiaStock
为什么西方公司难以打入中国市场?Uber 创始人 Travis Kalanick 将与中国企业竞争分为两个阶段。 第一阶段:中国竞争对手会以极高效率复制 Uber。一个新功能上线一周,就已经被完整克隆。当你把“复制”做到极致时,最终会发现没有更多东西可以复制。 第二阶段:中国公司开始转向本土创新,推出更符合本地市场和文化的功能。Kalanick 举了美团的例子,比如外卖取餐柜,这种模式并非简单复制,而是在中国市场中非常高效。 在AI竞赛中,这又会如何演变?
中文
18
22
161
72.9K
Erik
Erik@try__working·
@lauriewired What do you mean, seems likely to be aimed at the server market? Their official release and documentation says it's a server product. You just have to look at the specs to understand it's a server product.
English
0
0
0
56
LaurieWired
LaurieWired@lauriewired·
open questions (feel free to reply if you know): 1. I don’t understand how this can possibly have the RVA23.1 spec already. It’s only been out for ~6 months, yet alibaba seems to be claiming tapeout (“5nm verified for production"). It’s not a paper launch; *someone* has some real silicon otherwise they wouldn’t be able to claim SPECint2017 numbers…(unless it’s only simmed?) 2. TDP is up in the air atm. My guess is the core is likely to compete in the server-market 3. Undisclosed fab…but who else could it possibly be besides TSMC? We know it’s 5nm… 4. It seems to use their own additional “Matrix Multiply Extension” for AI inference. Alibaba is claiming the C950 has “naive support for Qwen3 and DeepSeek V3”…but no figures or toks/sec. I’m not sure how much they’ve improved past this, but you can find an open source version of their MME on github. But, it hasn’t been updated since december of 2024… github.com/XUANTIE-RV/ris… (also attaching the giant chart from David Huang that shows a ton of SPECint2017/GHz scores)
LaurieWired tweet mediaLaurieWired tweet media
English
10
7
161
11.9K
LaurieWired
LaurieWired@lauriewired·
Wow, RISC-V is really gaining traction again. Alibaba just announced the Xuantie C950…which is basically claiming Apple M1 (ish) levels of performance. I don’t see a lot of people talking about it! (2.6/Ghz SPECint2017, Apple M1 P-core also around ~2.6)
LaurieWired tweet mediaLaurieWired tweet media
English
41
176
2.1K
96.9K
Erik
Erik@try__working·
@TheStalwart China makes the best titanium bikes today
English
0
0
0
22
Joe Weisenthal
Joe Weisenthal@TheStalwart·
When I see quotes like this, the main thought I have is that there is a deep continuity in Chinese thinking about industry, this obsession with building out capacity and establishing self-reliance, and also that the gap between Mao- and post-Mao-era is overstated.
Joe Weisenthal@TheStalwart

chinatalk.media/p/the-story-of…

English
15
22
266
27.9K
Erik
Erik@try__working·
@adcock_brett We don't need new interfaces. Voice is a nice to have. We need new backends.
English
0
0
0
11
Brett Adcock
Brett Adcock@adcock_brett·
Today I'm excited to introduce Hark, a new artificial intelligence lab building the most advanced, personal intelligence in the world We've been in stealth for 8 months, assembling one of the greatest AI and hardware teams on the planet I want to explain why I started Hark and what we're focused on I've spent the last 3 years working on the hardest AI challenge imaginable: giving AI a humanoid body. On the digital side, I've been using all the existing LLM chatbots - and I have to say, they feel incredibly dumb to me AGI, in the limit, should feel like a sci-fi movie. It should be able to listen and talk. It should have persistent memory and be highly personalized. It should see and touch the world. But we're far from this today We are crafting a new interface to AGI. Intelligence that lets you offload your mental workload into a system that begins to think like you and sometimes ahead of you We started Hark with one goal: build the world's most advanced personal intelligence - paired with next-generation hardware designed to serve as a universal interface between humans and machines hark.com
English
433
478
4.6K
1.3M
Deep Thrill
Deep Thrill@DeeperThrill·
My coding workflow is currently: 1. Spend about ten minutes writing a plan for a new feature in a markdown file as bullet points. 2. Spend about an hour with a bunch of back forths between Opus 4.6 and Codex 5.4 xhigh, asking each one to improve the markdown plan file. 3. Read the final markdown plan, which ends up as a bunch of bullet points, includes implementation details, and is usually between 200-500 lines long. 4. Ask either Claude Code or Codex to implement it. 5. Ask the other one to review the implementation. 6. Run my /deslop command on the implementation (see my pinned tweet). 7. Deploy, test, and ask an agent to fix any bugs.
English
52
38
774
84.1K
Sid
Sid@chatsidhartha·
Durable Objects are the greatest invention of our time and I guarantee you, @CloudflareDev will be the default compute provider to run agents everywhere very soon Nothing beats the speed, scale, isolation and intuitiveness of this platform
English
8
7
93
8.1K
Erik
Erik@try__working·
@teortaxesTex GLM 5 performs similarly to GPT 5.2 (No thinking) so it's actually not such a bad showing. Kimi climbing too.
English
0
0
0
311
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
what's worse, the best Chinese model is quite far below Gork 4.20 (best Western models are at 0.79 now btw) WeirdML and CritPt are some of the best proxies of progress towards actual recursive self-improvement, building automated research scientists.
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) tweet media
Josh@JoshPurtell

@teortaxesTex Best Chinese model is neck and neck with gpt-5.4-nano lmoa

English
14
3
176
22.8K
Erik
Erik@try__working·
@whyyoutouzhele China's technological advantage and speed of development does not come from 996, but from the last 4 decades work on building various types of infrastructure. If China stopped 996 culture, it would be more innovative, more efficient and could launch better products, faster.
English
0
0
0
11
Erik
Erik@try__working·
@whyyoutouzhele I have more than a decade's experience working in China tech. 95% of 996 is performative; fake work. And nobody starts at 9, they start at 11. 6-7:30 people have dinner and slack off. It creates a very inefficient work environment, where there is no planning and lots of rework.
English
1
0
0
533
李老师不是你老师
李老师不是你老师@whyyoutouzhele·
“中国没有生活,只有工作” 3月22日,网友转发一位外国企业家评价中国“996”:早9晚9、一周6天,“只有工作没有生活”。他称无意批评,但强调应平衡工作与生活,希望企业让员工真正发展。 视频发出后迅速引发网友们的共鸣,评论区甚至有网友表示“996”算好的了,我们“777”早7到晚7,有事只能请假。还有的网友表示,“996”算好的了,之前在广州进厂都是“897"(早上八点到晚上九点,一周上七天)。
中文
107
113
1.1K
157.1K