cyberyoung
350 posts


The boundary question between China and India is a complex sensitive question left over from history. It concerns the sentiment of our peoples, and requires dialogue and consultation to seek a fair, reasonable and mutually acceptable solution.
In recent years, China and India have maintained regular communication on boundary questions through mechanisms such as the Special Representatives (SR) and the Working Mechanism for Consultation and Coordination on India-China Border Affairs (WMCC). Currently, the border situation is generally stable and peaceful.
The two countries should put the boundary question at an appropriate position in bilateral relations, not allow the boundary question to define the overall China-India relations, still less let specific differences affect bilateral cooperation.
English

@VanberghenEU You are in 1st class compartment. Feel the real state of China by seating in an ordinary compartment
English

China, in Xi’an, at a railway station so ordinary that nobody would ever think to write about it, a ticket clerk paused for a moment and told me: “Be careful, the water is hot.”
I showed my passport many times. And honestly, I preferred that inconvenience to the alternative of never showing a passport but constantly looking over my shoulder in the street. Grand speeches about freedom mean little if people do not feel safe in their daily lives.
I love Europe far too much to pretend that everything is fine. We spend endless hours debating the future, drafting strategies, and discussing values, while too often neglecting the simple things that make a society work: attention, responsibility, order, and basic civic trust.
Europe needs to regain its sense of direction.
It really does.



English

@ReportBeijing @xinwendiaocha 你这个言论,就像跪台办一样,各种惠台,利益究竟去了哪里,台湾的普通人普通农民有受惠?那些受惠了的台南农民又认为应该感谢谁?说回菲律宾,我们有能力,将管理插入菲律宾,甚至菲律宾的基层吗?如果真的如你所说,马那一派不会跳着将这些公布,对他有利啊。你说的都是大家都知道的大路货
中文

@showme_asap @kynkyn73670170 你们不认,我们当然可以。何况,你们先查查自己的屁股,去google一下“闽平渔事件”
中文

@BruceWa69879898 @jxzdmzw 还是不敢正面回答,一切都是老共的锅。你们是独立国家,也支棱点啊,日菲这么欺负你们,你们都不出声吗?整天就知道对着老共出声?原来真正怂的是你们啊,你们都一点不为台湾渔民考虑考虑?
中文

@cyberyoung @jxzdmzw 一切的源頭,不就是共匪認為台灣是中國的一省?
你盡往別的方向去說,啥論點都一樣啦!
我的回覆很簡單:
台灣就是台灣,不屬於中國,歡迎來觀光,也歡迎打過來。
掰扯一堆彼此互不認同的想法,浪費時間啦!
中文

@cyberyoung @jxzdmzw .......... 這隻粉蛆哪來的新話題?
我點破這個 “共匪孬種不敢打”,簡潔有力的事實,哪裡虛了?
中文

@BruceWa69879898 @jxzdmzw 那就独嘛,反正共匪孬种。怎么又中华民国就是台独了,和以前说得不一样嘛,和党纲说法也不一样
中文

@Real_Baomi @JinRyangKR 好。我来问你,为何台湾的防空识别区,西边划到了江西,东边连日本的边都不敢碰?不是有重叠吗?怎么东北不去重叠了?
中文

@haitun6459861 @AndyBxxx Interestingly, Chinese renminbi notes feature not only Chinese characters but also four minority scripts, including Uyghur. In Japan, how many people still use the Ainu script?
English

@haitun6459861 @AndyBxxx First, everyone is like this, not just Uyghurs; second, are these actions all legal in your country?
English

@RonaldSimmonsUS @m0d8ye 你可能忘了,这一次,是openAI和Anthropic一开始就封禁了中国。btw,你一再说什么“中共国”,只能体现你的执念和执拗
中文

@m0d8ye 你好像没弄清楚中共国的大模型公司出现的原因。
中共国之所以有独立的大模型公司,根本原因是CCP有自己的“维稳”需求(必须严格控制国民的意识形态),因此必须在美国的公司之外“另起炉灶”。
x.com/m0d8ye/status/…
Max Lv@m0d8ye
一直好奇这一轮 AI 泡沫中为什么没有印度大模型公司出现。过往我接触下来,印度工程师在数理上的能力并不比大陆人弱的。
中文

@PearceChen42 @kong_ls @whiteTony99 6. DeepSeek从开始到现在,发表了很多的论文和文章,甚至有美国的YouTube主用它的论文引导,自己以Qwen为底座,训练出了自己的模型。具体事例你可以自行google
7. 相对于DeepSeek晦涩难懂的论文,openAI和Anthropic的这类指控更为普通人喜闻乐见,因为不用动脑子,看得懂,情绪化投入就可以
中文

@PearceChen42 @kong_ls @whiteTony99 1. 人间说的是靠蒸馏能蒸馏出万亿参数的大模型?
2. 你这里给出的是openAI认为DeepSeek违反了它的用户协议,记着,是违反了用户协议
3. 这是openAI的单方面宣称
4. 蒸馏是一个没有褒贬的行为,学术圈还有专门的研究
5. DeepSeek是一个开源项目,不仅权重而且代码
中文
cyberyoung retweetet

how did we make deepseek outperform opus 4.7?
i've been thinking about why "open model bad at tool calling" is almost always a harness problem, not a model problem.
context: spent the two days looking at billions of tokens in @CommandCodeAI (tb open source ai cli) using deepseek. I ended up writing a tool-input repair layer. the trigger was watching deepseek-flash fail on the simplest /review run, every shellCommand and readFile call bouncing back with a raw zod issues blob, the model unable to recover because the error wasn't in a form it could read. by the end deepseek v4 pro was beating opus 4.7 6/10 times on our internal evals.
a few things i learned that feel general:
1/ the failure modes aren't random they're a small finite compositional set.
across deepseek-flash, deepseek v4 pro, glm, qwen, the same four mistakes repeat almost exactly:
- sending `null` for an optional field instead of omitting it
- emitting `["a","b"]` as a json *string* instead of an actual array
- wrapping a single arg in `{}` where the schema expected an array (an "empty placeholder")
- passing a bare string where an array was expected (`"foo"` instead of `["foo"]`)
four repairs, ~30-100 lines each, ordered carefully (json-array-parse must run before bare-string-wrap or `'["a","b"]'` becomes `['["a","b"]']`). that is the whole catalogue. when i hear "this open source model can't do tool calls" i now assume one of those four, and so far that's been right ~90% of the time.
2/ the funniest failure mode is also the most revealing.
deepseek-flash, when asked to edit or write a file, sometimes emits the path as a *markdown auto-link*:
filePath: "/Users/x/proj/[notes.md](http://notes. md)"
our writeFile tool obediently trued creating files literally named `[notes.md](http://notes .md)` until we caught it. this is not a hallucination. it's the post-training chat distribution leaking through the tool boundary the model has been rewarded for auto-linking in conversational output, and is applying that prior in a context where it makes no sense. the fix is two regex lines that unwrap only the degenerate case where link text equals url-without-protocol real markdown like `[click](https://x .com)` passes through untouched.
this is also conditioning of their own tools during RL which were different from all other tools we write and ofc can't predict.
"tool confusion" is a more useful frame than "capability gap." the model knows how to format a path. it just hasn't been told clearly enough that this path is going to fopen, not into a chat bubble. so we encode that hint at the schema level `pathString()` instead of `z.string()` and the leak is plugged for every path field at once.
3/ the design choice that mattered was inverting preprocess-then-validate to validate-then-repair.
my first attempt was the obvious one: a preprocessing pass that normalized inputs (strip nulls, parse stringified arrays, etc.) before zod ever saw them. it broke immediately, writeFile content that *happened* to be json-shaped got rewritten before it hit disk. silent corruption, easy to miss in a smoke test.
then i made it less greedy
- parse the input as-is. if it succeeds, ship it. valid inputs are never touched.
- on failure, walk the validator's own issue list. for each issue path, try the four repairs in order until one applies.
- parse again. on success, log `tool_input_repaired:${toolName}`. on failure, log `tool_input_invalid:${toolName}` and return a model-readable retry message.
the structural insight here is: when you preprocess, you encode a prior about what's broken. when you let the validator complain first, the schema is the prior, and you only spend repair budget at the exact paths the schema actually disagreed at. the validator is doing the work of localizing the bug for you. it's the same shape as cheap-then-careful everywhere else try the fast path, fall back on evidence.
(this also gives you per-tool telemetry for free. you can watch repair rates per (model, tool) and notice when a model regresses on a specific contract before users do.)
4/ shape invariants and relational invariants need different fixes.
the four repairs above all handle shape problems wrong type, missing key, wrong container. but read_file had a *relational* invariant: "if you provide offset, you must also provide limit, and vice versa." deepseek kept calling `readFile({ absolutePath, limit: 30 })` and getting an `ERROR:` back. you can't fix this with input repair, because each field is independently valid the bug is in the relationship between them.
so i taught the function the model's intent instead. `limit` alone → `offset = 0`. `offset` alone → `limit = 2000` (matches common read tool ops default). then surfaced the decision back to the model in the result:
"Note: limit was not provided; defaulted to 2000 lines. To read more or fewer lines, retry with both offset and limit."
no `Error:` prefix, so the tui doesn't paint it red. the model sees what we picked and can self-correct on the next turn if our guess was wrong. transparency over silent magic wins big.
repair where you can. extend semantics where you can't. surface the choice either way.
zoom out:
a lot of what looks like model capability is actually contract design. a strict schema is a choice with a cost it filters out noise, but it also filters out recoverable noise from any model that hasn't memorized the exact json contract you happened to pick. the largest commercial models eat that cost invisibly and are linient on tool calling because they've seen enough of every contract during pretraining; open models pay it loudly and get dismissed for it.
the harness is where you mediate between distributions. four small repairs (i'm sure more to follow as we have three more merging today), two regex lines for auto-links, one relational default, one prefix change. the model didn't change. the contract got more forgiving in exactly the places it needed to be.
deepseek v4 pro now beats opus 4.7 6/10 times on our internal evals.
imo "skill issue" applies to the harness more often than the model.
Ahmad Awais@MrAhmadAwais
Wow I just made DeepSeek V4 Pro beat Opus 4.7 6/10 times in our internal evals by auto repairing many of its quirks in tool calling. It’s performing super solid for such a cheap model.
English




















