Ruirui Wan

167 posts

Ruirui Wan

Ruirui Wan

@wan_ruirui

Katılım Ekim 2021
172 Takip Edilen8 Takipçiler
Ruirui Wan
Ruirui Wan@wan_ruirui·
what is a cybersecurity risk? I am just trying to download a reel and this pops up. Is downloading currently against Chatgpt TOS?
Ruirui Wan tweet media
English
0
0
0
12
Ruirui Wan
Ruirui Wan@wan_ruirui·
@dingyi @cursor_ai 好像是不是活跃度高的都能收到 credit 虽然我感觉我靠 codex 200 刀就够活了
中文
0
0
0
578
Ding
Ding@dingyi·
啊啊啊竟然也收到了 @cursor_ai 赠送的 10000 美金!下个月只用 Cursor 其他什么都不用了!
Ding tweet media
中文
60
2
136
39.4K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@dearemon 我觉得微信读书真的是最好用的读书软件 但不知道为什么我朋友更喜欢复杂到不行的起点说书多
中文
0
0
0
25
禿道道🐟
禿道道🐟@dearemon·
马化腾发明微信读书是为了死了以后上天堂吗?这软件绝对是腾讯系最好用的软件了,完全是给他积德来的
中文
524
173
5.9K
1.3M
Sarah Chieng
Sarah Chieng@MilksandMatcha·
Giving away 5 Windsurf Max ($200/month) plans Each person will get 3 months of free Windsurf Max (highest tier). Try out SWE 1.6, Cognition's latest, fastest, and most intelligent model, powered by @cerebras. Winners will be selected from comments in 48 hours, comment below why you want it.
Cognition@cognition

We’re releasing SWE-1.6, our best model in both intelligence & model UX. SWE-1.6 matches our Preview model on SWE-Bench Pro while dramatically improving on various behavioral axes. It’s available today in Windsurf in two modes: free tier (200 tok/s) and fast tier (950 tok/s).

English
1K
51
853
162.3K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@sqs @bytebot Yes if I can get the same results, or a flat monthly subscription fee like adobe. I am feeling the same to @bytebot Amp is so great but crazy expensive. I can easily use up 100 dollar token per day (API price compare to cheap codex subscription price)
English
1
0
2
77
Quinn Slack
Quinn Slack@sqs·
Would you be willing to pay a per-token fee (like X cents per 1M tokens) for using Amp in this case? I totally understand the appeal. It is a big distraction and comes with a lot of risks (customer support, compat, competitive, etc.) that could make it hard for us to stay on the frontier.
English
1
0
2
121
Colin Charles
Colin Charles@bytebot·
i love amp, but what changed? a lot more codex usage, is what seems to have happened. yet, every time i use amp code, it truly is awesome. it does stuff no one else does, i.e solve the problem!
Colin Charles tweet media
English
1
0
7
1.9K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@sqs @bytebot I understand that tokens are expensive. What I mean is, if OpenAI supports ACP and oauth, could we potentially use the OpenAI subscription directly? I would be very happy if I could use my existing membership directly in amp rather than relying on Claude Code or Codex.
English
1
0
2
148
Quinn Slack
Quinn Slack@sqs·
@bytebot Cool, makes sense and thank you. We are working on some new product stuff, so stay tuned.
English
2
0
5
200
Ruirui Wan
Ruirui Wan@wan_ruirui·
Today is my first, and also my last two days, using Cursor. Why? I bought a one-year Cursor membership in April 2025, but I barely used it. At the time, I had also subscribed to Cloud and Codex, and I never imagined they would become so popular. As a result, I didn't use my Cursor membership much for a whole year. With only three days left until it expires, I remembered I had it. I wanted to try out the new Composer 2, especially since the $20 usage limit for Cursor Pro membership is so small. I figured $20 would only cover one or two tasks before running out. So, I tried Cursor 2. I found the model to be quite fast, but it's completely incapable of handling tasks it's not specialized in. For example, today it spent about 20 minutes getting context and reviewing the surrounding text. However, its context window quickly filled up, leading to an endless loop of re-examining the context. Its cache was also enormous, reaching millions, even though it only made minor code changes. I now feel it's just an average model. It's acceptable as a basic model for Cursor, but it truly can only perform specific tasks. It can't fully replace humans like Cloud or OpenAI can, especially when it comes to producing a 90-100% perfect result. It probably only achieves 60-70%. That's why I spent most of today finding, catching, and fixing bugs, which was quite frustrating. @cursor_ai
Ruirui Wan tweet media
English
0
0
0
50
Ruirui Wan
Ruirui Wan@wan_ruirui·
@CtrlAltDwayne @Zai_org and when you see 8token per second speed currently, I think they already maxed out all the usage right now for their gpu
Ruirui Wan tweet media
English
0
0
6
708
Dwayne
Dwayne@CtrlAltDwayne·
What an absolute disgrace. @Zai_org has doubled its subscription prices COMPLETELY ANNOUNCED. Nothing in its Discord, nothing on its website. No announcement on its account here. Just jacked them up without notice.
Dwayne tweet mediaDwayne tweet media
English
91
12
298
98.1K
Lex
Lex@xw33bttv·
@nzinfo @wan_ruirui @Zai_org Yeah that checks, the few people I know who did manage to snag subs there said it was on par with openrouter speed.
English
1
0
0
28
Lex
Lex@xw33bttv·
lmfao westerner tax from @Zai_org is crazy The same "Max" plan served in China that costs $68 USD (469 RMB) is $160 USD for westerners. Irony is, if you have WeChat or Alipay, you can just buy the Chinese plan and still have API access. So it's not even GEOLOCK related costs. It's just capitalising on the gold rush right now for subscription-based OAuth integrations. Diabolical is an understatement tbh
Lex tweet media
English
109
32
753
93.9K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@LexnLin can i use deepthink with antigravity or only inside gemini?
English
1
0
1
2K
Leon Lin
Leon Lin@LexnLin·
left: DeepThink right: GPT5.4 Pro Same prompt btw
Leon Lin tweet mediaLeon Lin tweet media
English
62
9
702
131.9K
芒果刀🥭
芒果刀🥭@ThirteenYizuka·
感谢我们的125个用户 一周前,我还是个货比三家,到处找便宜中转的学生党用户,面对高昂的价格望而却步。一个偶然的机会,让我了解到了大多数中转站的成本价,其中巨大的价差不经让我思考,有没有可能自己搭建一个半公益性质的中转站,以接近成本价的价格对外销售? 抱着这样的想法,我在我的一台闲置的服务器上架设好了中转,绑定到了我的个人域名,并邀请了几个朋友前来使用。然而,一传十,十传百,越来越多的人想要加入我们:试开放注册的第一天,我们获得了40个用户。巨大的调用量一度使得来不及扩容的号池被烧干,我也因此不得不熬了好几个晚上来完善自动化机器人。 7天,125个用户,67563个请求,100亿调用量,我没想到我们会增长地这么快。一切的初衷,只是一个学生,为了解决自己的token需求,顺便方便一下朋友。不过无论如何,我都会尽量以当前能压到的最低价格给大家提供服务,并保持公开透明。口说无凭,一些措施正在路上,例如公开号池信息和成本,以保证中转站的公开透明。 维护站点真的很累,不过看到大家用AI实现了自己的想法,我觉得一切都值得了。欢迎进群吹水(QQ: 1093545322,也许可以捡到野生兑换码?!),或者购买我们的服务来支持一下站点❤感谢你看到这里!
芒果刀🥭 tweet media
芒果刀🥭@ThirteenYizuka

开放注册了:router.daoge.me 新注册送10刀额度🥰,不过为了防滥用目前暂时只允许qq和gmail邮箱注册🤔

中文
36
22
317
98K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@0xSero What’s different running in droid and Claude code on glm5.1 what changed
English
0
0
0
114
0xSero
0xSero@0xSero·
lifehack 1. copy ~/.claude into ~/.claude-zai 2. make an alias for claude-zai when you use that command it passes off your api key and url to Claude Code 3. use GLM-5.1 in Claude Code The model is very good with Claude Code, I prefer Droid but loop is really useful
0xSero tweet media
English
27
10
342
15.5K
Ruirui Wan
Ruirui Wan@wan_ruirui·
@pvncher @RepoPrompt Based on actual testing, Claude encounters many issues when calling the officially supported OpenAI Codex plugin, especially during large tasks haven’t tried with repo prompt yet
English
1
0
0
56
eric provencher
eric provencher@pvncher·
@wan_ruirui @RepoPrompt You can have Claude start a codex agent with the that command (plan and build workflow)! Then built in agent has some benefits in terms of reviewing its own work and being more token efficient, so it’s nice to be able to spin one up. Up to you on how you want to use it
English
1
0
0
120
eric provencher
eric provencher@pvncher·
Just released @RepoPrompt 2.1! Really big update that makes it possible to use the RP Agent via MCP/CLI, and for the RP Agent to invoke sub agents! Never been easier to have claude steer codex agents using RP tools to handle ambitious tasks efficiently. Great for openclaw too!
English
7
6
54
5.4K
Ruirui Wan
Ruirui Wan@wan_ruirui·
Yes — Mercury felt extremely fast in practice, with very little noticeable latency. Part of that may be that it is not spending extra time on reasoning like some of the other models I tested. But while it is clearly much faster — close to 2x in my tests — I would not say its translation quality is the best. I have not done a controlled length-scaling benchmark yet, and I am still reading more about this architecture since it is so new. I want to test it more before making a stronger claim, but so far it does seem to stay very responsive as outputs get longer.
English
0
0
0
11
Bnaf.OG | 🟧
Bnaf.OG | 🟧@bnafOg·
@wan_ruirui The Mercury 2 speed advantage isn't just optimization — it's architectural. It's a diffusion LLM: tokens generated in parallel, not sequentially. That gap should widen for longer outputs. Did you notice latency scaling differently vs the others as text length increased?
English
1
0
0
16
Ruirui Wan
Ruirui Wan@wan_ruirui·
I have recently tested several newly released small language models. When balancing price and performance specifically for translation, I found that the top four options are: 1. Gemini 3.1 Flash Lite 2. Mercury 2 3. GPT-5.4 Nano 4. Mistral 4 Small The first three are essentially the best-performing products in this category. There is also Mistral 4 small, but it is primarily chosen for its low cost, as its performance is relatively poor compared to the top three. This ranking is based on a mix of: latency / real-world responsiveness translation naturalness idiom and tone handling HTML / formatting preservation output stability / empty-response rate token efficiency API pricing From my tests: Mercury 2 was the fastest overall at about 0.66–0.79s average latency, delivered 5/5 successful outputs, and had the best balance of speed, stability, and cost at $0.25/M input, $0.75/M output. GPT-5.4 Nano was slightly slower at around 0.79s, but produced some of the most natural phrasing, especially on sarcasm, idioms, and social-style text. Its downside is a much higher $1.25/M output price. Gemini 3.1 Flash Lite Preview had the strongest translation quality in some cases, especially for idioms, tone, and technical clarity, but it was slower at around 1.24s and also the most expensive on output at $1.50/M. Mistral Small is still worth mentioning as a low-cost option, but it fell behind the top three in translation quality and formatting reliability, including weaker HTML preservation in my tests. One surprising result: Qwen 3.5 was not competitive for real-time translation at all. Across the lineup, it often spent 97%–111% of usable output budget on reasoning, sometimes burned 500+ thinking tokens just to translate “Hello world,” and frequently returned empty outputs. In practice, that made it far too slow and unreliable for this use case.
Ruirui Wan tweet media
English
2
0
0
90
Ruirui Wan
Ruirui Wan@wan_ruirui·
I'm currently trying to see what else these small models can be used for besides translation. Does anyone have any suggestions?
English
0
0
0
15