mconcat

3.3K posts

mconcat

@monoidconcat

Modernist

Katılım Haziran 2020

351 Takip Edilen526 Takipçiler

mconcat retweetledi

Neel Nanda@NeelNanda5·2d

A super useful library for interpreting and debugging the behaviour of Claude Code - you can see exactly what was sent to the API, making things reproducible. And you can do causal interventions, make edits and then resample If you want to analyse agents, check it out!

Vincent Abruzzo@VincentAbruzzo

Hi! Open-sourcing AgentLens — a tool for agent alignment & interpretability research, built during Neel Nanda's MATS Exploration Phase with Greg Kocher. Run multi-session Claude Code experiments and study agent behavior: - Resample any API turn to measure variance - Edit tool results, assistant text, or system prompts and resample to test counterfactuals - Replay from any turn with full tool execution and filesystem reset - Automatic file change tracking with per-step diffs - Web UI for browsing trajectories, running interventions, and comparing resamples - Claude Code only for now — other agents on the roadmap. Contributions welcome! repo: github.com/dreadnode/agen… docs: dreadnode.github.io/agent-lens/

English

254

25.5K

mconcat@monoidconcat·12 Mar

Okay yes I believe in this approach x.com/ChristosTzamos…

Christos Tzamos@ChristosTzamos

1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy

English

123

mconcat retweetledi

will brown@willccbb·11 Mar

@seconds_0 also, reading papers and vibe-implementing them

English

1.5K

mconcat@monoidconcat·11 Mar

@phatggg There should be some hourly reminder of "you should get some rest" in claude code lol just like how tiktok shows that

English

phatg@phatggg·11 Mar

@monoidconcat I'm not sure lol, but one thing for sure is that my dopamine receptors and focusing ability are fried from too much claude code switching 😂

English

phatg@phatggg·10 Mar

I started to grind leetcode back 2 months ago to prevent my brain from rotting fr

anton@abacaj

“Make the models cheap to use” “Great, they all forgot how to code” “Now 10x the price”

English

156

mconcat retweetledi

Raj Dabre@prajdabre·3 Kas

Birth-Teens: Pretraining Teens-20s: SFT 20-death: RL Such is human nature.

English

25.2K

mconcat@monoidconcat·11 Mar

Right now its all just opus disguised as personally specaizlied customized agent

English

mconcat@monoidconcat·11 Mar

This one will get into reality once local models and zoo of lora gets mainstream

nairolf@0xNairolf

we need an ai agent marketplace > describe the problem > get recommended agents > hire one > give it context > come back later > problem solved life changer

English

mconcat@monoidconcat·11 Mar

Have anyone tried mellanox 100gbe cards for inter-node pipeline parallelism

English

mconcat@monoidconcat·11 Mar

Anyone has something fun to recommend

English

mconcat@monoidconcat·11 Mar

Quick benchmark showed promising quality for the FP8 quant.

English

mconcat@monoidconcat·11 Mar

huggingface.co/mconcat/Qwen3.…

ZXX

mconcat@monoidconcat·11 Mar

FP8 quantization of Qwopus model. Link in the reply.

English

mconcat@monoidconcat·11 Mar

Instruction updated.

English

mconcat@monoidconcat·11 Mar

The result is pretty good - it only showed some visible quality degradation over MMLU-pro.

English

mconcat@monoidconcat·11 Mar

There has been a problem with mixed precision support from vllm within a fused layer - currently fixed. However, the latest vllm version is not optimized for gated deltanet and has high VRAM usage spike. It can be accommodated in an rtx pro 6000, but not in 5090. Two open PRs, #36599 and #36325 in the vllm repo fixes this problem. If you want to run it in a single 5090 before they got merged, manually cherry pick the code changes from those two PRs.

mconcat@monoidconcat

NVFP4 quantization of Qwopus model. Link in the reply.

English

130

mconcat retweetledi

Rich@richzou·9 Mar

x.com/i/article/2031…

ZXX

462

246K

mconcat retweetledi

wizwand@wizwand_team·10 Mar

Introducing Wizwand Swarm - the first AI/ML research swarm intelligence. It's a forum built for AI agents where they can communicate with other researcher/engineer's agents to get inspired and discover ideas without human intervention. Try it out: wizwand.com/blog/introduci…

English

10.4K

mconcat retweetledi

marmik@marmikch·10 Mar

it is one thing for a structure in representations (or geometry of activations) to exist and it is a completely different thing for the model to actually use it for a downstream task. the same goes for linear probes achieving high accuracy. structure and function may be coupled but not always. pca is useful but often deceptive.

LadyValor@lady_valor_07

I’m 25. Give me oddly specific life tips. No general ”surround yourself with positive people” tips. I want the most random, specific advice possible.

English

101

7.4K

Keşfet

@seconds_0 @phatggg @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA