Fred D. | 一铭 (@freddmts) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

🐸 I just released opencode-froggy, a plugin for @opencode bringing #claudecode style hooks on top of #opencode. ``` hooks: - event: session.idle conditions: [hasCodeChange] actions: - bash: "npm run lint --fix" ``` github.com/smartfrog/open… #videcoding #aicode

English

0

1

3

370

Fred D. | 一铭@freddmts·5h

Si vous êtes à Paris le 14 avril, venez faire un tour au Meetup Vibe Coding Paris #2 : Controlled Autonomy chez @YesWeScale.

Vincent Le Gallic@vincentLg

Meetup Vibe Coding Paris #2 : Controlled Autonomy le 14/04 chez @YesWeScale 🚀 🔹 Talk #1 : @titouan_benoit (@DotfileApp) : Comment adapter sa DX pour des agents autonomes ? Architecture de systèmes où le bottleneck n'est plus l'écriture, mais le cycle d'exécution. 🔹 Talk #2 : @freddmts (@Cometh ) : Sortir du test manuel : mise en place de benchmarks automatisés et évaluation de la résilience des prompts avec @promptfoo. meetup.com/vibe-coding-co…

Français

0

1

0

20

Fred D. | 一铭@freddmts·3d

🔥🔥🔥

Z.ai@Zai_org

GLM-5.1 is available to ALL GLM Coding Plan users! z.ai/subscribe

ART

0

44

Fred D. | 一铭@freddmts·4d

Open-weight ✅ Lightweight ✅ Multilingual ✅ One problem at a time ... Mistral released Voxtral TTS.

Mistral AI@MistralAI

🔊Introducing Voxtral TTS: our new frontier open-weight model for natural, expressive, and ultra-fast text-to-speech 🎭Realistic, emotionally expressive speech. 🌍Supports 9 languages and accurately captures diverse dialects. ⚡Very low latency for time-to-first-audio. 🔄Easily adaptable to new voices

English

0

1

50

Fred D. | 一铭@freddmts·4d

@karpathy x.com/freddmts/statu…

Fred D. | 一铭@freddmts

Memory for LLMs is a really hot topic right now. The key challenge isn't remembering everything, it's learning what's worth remembering.

QME

0

9

Andrej Karpathy@karpathy·5d

One common issue with personalization in all LLMs is how distracting memory seems to be for the models. A single question from 2 months ago about some topic can keep coming up as some kind of a deep interest of mine with undue mentions in perpetuity. Some kind of trying too hard.

English

1.7K

1.1K

21.1K

2.6M

Fred D. | 一铭@freddmts·4d

@Kimi_Moonshot x.com/freddmts/statu…

Fred D. | 一铭@freddmts

This matches my experience with OpenClaw. Since I share more personal details with it than with other assistants, even an offhand mention can turn into something it keeps bringing up.

QME

0

207

Kimi.ai@Kimi_Moonshot·4d

Zhilin at GTC: Introducing Attention Residuals Learning selective memory, rather than mechanically accumulating everything, is the beauty of attention. Many of you have probably read Attention Is All You Need, the 2017 Transformer paper that brought “human-like” attention into the model’s field of view. From that point on, models no longer simply read everything in a mechanical way. Instead, they began to develop a sense of what matters more and what matters less across the text, choosing to retain the more important information. Recently, Kimi applied this idea of attention to the temporal dimension, then rotated it 90 degrees into the model’s depth dimension. This allows the model to have attention not only over time, but also throughout the process of information transmission across layers—giving it a more intelligent way to understand and process information.

English

47

153

1.4K

99.9K

Fred D. | 一铭@freddmts·4d

Memory for LLMs is a really hot topic right now. The key challenge isn't remembering everything, it's learning what's worth remembering.

Kimi.ai@Kimi_Moonshot

Zhilin at GTC: Introducing Attention Residuals Learning selective memory, rather than mechanically accumulating everything, is the beauty of attention. Many of you have probably read Attention Is All You Need, the 2017 Transformer paper that brought “human-like” attention into the model’s field of view. From that point on, models no longer simply read everything in a mechanical way. Instead, they began to develop a sense of what matters more and what matters less across the text, choosing to retain the more important information. Recently, Kimi applied this idea of attention to the temporal dimension, then rotated it 90 degrees into the model’s depth dimension. This allows the model to have attention not only over time, but also throughout the process of information transmission across layers—giving it a more intelligent way to understand and process information.

English

0

1

39

Fred D. | 一铭@freddmts·4d

This matches my experience with OpenClaw. Since I share more personal details with it than with other assistants, even an offhand mention can turn into something it keeps bringing up.

Andrej Karpathy@karpathy

One common issue with personalization in all LLMs is how distracting memory seems to be for the models. A single question from 2 months ago about some topic can keep coming up as some kind of a deep interest of mine with undue mentions in perpetuity. Some kind of trying too hard.

English

0

230

Fred D. | 一铭@freddmts·5d

👀

RyanLee@851277048Li

Thanks @TheAhmadOsman, I’ve started preparing the model card of the HF — M2.7 coming soon!

ART

0

1

18

Fred D. | 一铭@freddmts·6d

Be careful 🚨 LiteLLM v1.82.8 has been compromised.

Daniel Hnyk@hnykda

LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below

English

0

46

Fred D. | 一铭@freddmts·22 Mar

Minimax 2.7 open weights in 2 weeks . 👌

Skyler Miao@SkylerMiao7

M2.7 open weights coming in ~2 weeks. still actively iterating just updated a new version on yesterday — noticeably better on OpenClaw.

English

0

62

Fred D. | 一铭@freddmts·20 Mar

@_weiping GGUF quantization available here 🚀 huggingface.co/freddm/Nemotro…

Français

1

9

906

Wei Ping@_weiping·20 Mar

🚀 Introducing Nemotron-Cascade 2 🚀 Just 3 months after Nemotron-Cascade 1, we’re releasing Nemotron-Cascade 2: an open 30B MoE with 3B active parameters, delivering best-in-class reasoning and strong agentic capabilities. 🥇 Gold Medal-level performance on IMO 2025, IOI 2025, and ICPC World Finals 2025: • Capabilities once thought achievable only by frontier proprietary models (e.g. Gemini Deep Think) or frontier-scale open models (i.e. DeepSeek-V3.2-Speciale-671B-A37B). • Remarkably high intelligence density with 20× fewer parameters. 🏆 Best-in-class across math, code reasoning, alignment, and instruction following: • Outperforms the latest Qwen3.5-35B-A3B (2026-02-24) and even larger Qwen3.5-122B-A10B (2026-03-11). 🧠 Powered by Cascade RL + multi-domain on-policy distillation: • Significantly expand Cascade RL across a much broader range of reasoning and agentic domains than Nemotron-Cascade 1, while distilling from the strongest intermediate teacher models throughout training to recover regressions and sustain gains. 🤗 Model + SFT + RL data: 👉 huggingface.co/collections/nv… 📄 Technical report: 👉 research.nvidia.com/labs/nemotron/…

English

42

143

898

157.1K

Fred D. | 一铭@freddmts·20 Mar

One week, one new model running on my local GPU 🚀 Switched from GLM-4.7 Flash to Qwen 3.5 35B. Nemotron seems cool, maybe not for agents, but definitely clever 🤔 Crazy times.

Wei Ping@_weiping

🚀 Introducing Nemotron-Cascade 2 🚀 Just 3 months after Nemotron-Cascade 1, we’re releasing Nemotron-Cascade 2: an open 30B MoE with 3B active parameters, delivering best-in-class reasoning and strong agentic capabilities. 🥇 Gold Medal-level performance on IMO 2025, IOI 2025, and ICPC World Finals 2025: • Capabilities once thought achievable only by frontier proprietary models (e.g. Gemini Deep Think) or frontier-scale open models (i.e. DeepSeek-V3.2-Speciale-671B-A37B). • Remarkably high intelligence density with 20× fewer parameters. 🏆 Best-in-class across math, code reasoning, alignment, and instruction following: • Outperforms the latest Qwen3.5-35B-A3B (2026-02-24) and even larger Qwen3.5-122B-A10B (2026-03-11). 🧠 Powered by Cascade RL + multi-domain on-policy distillation: • Significantly expand Cascade RL across a much broader range of reasoning and agentic domains than Nemotron-Cascade 1, while distilling from the strongest intermediate teacher models throughout training to recover regressions and sustain gains. 🤗 Model + SFT + RL data: 👉 huggingface.co/collections/nv… 📄 Technical report: 👉 research.nvidia.com/labs/nemotron/…

English

0

1

55

Fred D. | 一铭@freddmts·17 Mar

@crystalsssup 😂 跟他合作过的中国人给他起了这个名字，其实是在提醒我们：他的水平有问题

中文

0

328

Crystal@crystalsssup·17 Mar

As funny as when a Chinese person sees this: "English name (水平问题)" 水平问题 means “skill issue”

Lucas Beyer (bl16)@giffmana

Love this place. Just noticed someone I'm following is called: "[Chinese] weight decay [Chinese]" lol

中文

11

10

387

58.8K

Fred D. | 一铭@freddmts·16 Mar

Smart move , pushing AI reasoning with formal proofs.👌

Mistral AI for Developers@MistralDevs

🧮 Today, we release Leanstral - the first open-source code agent for Lean 4, an efficient proof assistant capable of expressing complex mathematical objects and software specifications.

English

0

1

56

Fred D. | 一铭@freddmts·12 Mar

@DocteurManox Très classe !

Français

0

15

Manox@DocteurManox·27 Oca

Bonjour chers amis. Je réalise en ce moment un rétro shooter horror. Si vous êtes intéressés vous pouvez suivre ce compte pour voir l'avancement du projet. ( Hésitez pas a rt ça va peut-être me deshadowban)

The Onilith: Fleshbound@Beyond_efps

A quick teaser for the game I’m making, it's still on in its early stages. #indiedev #boomershooter #gameDev

Français

3

7

14

749

Fred D. | 一铭 retweetledi

Enes Akar@enesakar·12 Mar

Announcing Context7 CLI! MCP isn't the only way anymore. Now any AI agent can pull docs with Context7 — just the CLI and the find-docs skill. One command: npx ctx7 setup