Tony Davis

839 posts

Tony Davis

@TonyD993

Drop in arxiv markdown formatter: https://t.co/J57PiQVOeC

Katılım Nisan 2021

422 Takip Edilen95 Takipçiler

Tony Davis@TonyD993·21 Mar

@lauriewired i hear this music in my nightmares

English

286

LaurieWired@lauriewired·20 Mar

Telephone Hold music sounds really bad. Mostly because it’s mapping complex instruments to a human throat. Many phone lines these days use CELP, Code-Excited Linear Prediction algorithms. Music breaks down in weird ways when you turn a piano into…speech.

English

1.7K

129.4K

Tony Davis@TonyD993·20 Mar

@Sauers_ Sparsity regularization?

English

Sauers (in Berkeley / SF)@Sauers_·20 Mar

Any computation graph can be realized as a neural network. Good compilers optimize computation graphs to eliminate redundancy. We currently have no such compilers for neural networks. Why?

English

4.5K

Tony Davis@TonyD993·20 Mar

@kalomaze The point is that the models memorize solutions instead of generalizing the process of programming

English

174

kalomaze@kalomaze·19 Mar

the kind of person who asks "but does this transfer generalize to Brainfuck?" is simply not being a serious person tbqh

English

2.1K

kalomaze@kalomaze·19 Mar

not shocking at all; the models don't want to write in byzantine esoteric languages instead of python or rust or whatever

Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English

259

17.3K

Tony Davis@TonyD993·19 Mar

@tenobrus Exactly! If a small open source model matches frontier performance, the frontier will move. Obviously

English

Tenobrus@tenobrus·19 Mar

i think this will almost certainly happen, but when it happens no serious knowledge worker will use it, because frontier models at the time will be so much more capable using last year's model on a local box would be the equivalent of purposefully giving yourself brain damage

Brenner@BrennerSpear

I'll call my shot more specifically: Q1 2027 an open source model running on 32gb of memory will do 100 tok/s and match Opus 4.6 on most benchmarks

English

493

26.8K

Tony Davis@TonyD993·18 Mar

@askalphaxiv check out markxiv.org if you want full text and images

English

alphaXiv@askalphaxiv·17 Mar

Introducing MCP for arXiv Let your research agents stand on the shoulders of giants Fast multi-turn retrieval, keyword search, and embedding search tools across millions of arXiv papers 🚀

English

403

3.1K

259.7K

Tony Davis@TonyD993·18 Mar

@InferXai Yes highly interested. Do you support custom models?

English

InferX@InferXai·17 Mar

@TonyD993 We are in Private beta. Happy to give you access if you want to try it out.

English

InferX@InferXai·17 Mar

A couple of weeks ago we demonstrated 1.5s cold starts for a 32B model. Today we’ve pushed it even further. We’re now seeing sub-second cold starts for models of this size. Why does this matter? Cold start latency is one of the biggest barriers to true serverless inference. If models take 40 seconds or minutes to start, developers are forced to keep GPUs running 24/7. Fast cold starts change the developer experience completely. Models can finally run on demand instead of sitting idle. We’ll be talking about how this works during our live technical webinar on Wednesday, March 18 at 8:30 AM PST. Link in the comments.

English

200

Tony Davis@TonyD993·17 Mar

@InferXai Not general access yet?

English

InferX@InferXai·17 Mar

@TonyD993 Yes, we expose an API. It’s compatible with the OpenAI-style interface, so you can plug it into existing workflows pretty easily.

English

Tony Davis@TonyD993·16 Mar

@JustDeezGuy Somethings gotta give somewhere. There's a minimum amount of complexity in a given problem. That can be wrangled any number of ways. Most pure FP or OOP I've read are terribly hard to read and maintain.

English

Paul Snively@JustDeezGuy·16 Mar

@TonyD993 Not necessarily.

English

754

Paul Snively@JustDeezGuy·16 Mar

Pure FP is about “reducing the state a developer has to hold in their head” to 0.

trish@_trish_xD

wise words from the best systems engineer I've worked with: "two things that make code actually maintainable: 1. reduce the layers a reader has to trace 2. reduce the state a reader has to hold in their head" applies to every codebase. always.

English

456

43.4K

Tony Davis@TonyD993·13 Mar

@LammingLab Isn't leucine implicated in tons of aging effects because of mTOR activation?

English

2.2K

Lamming Lab@LammingLab·13 Mar

Please enjoy the latest publication from our lab "Restriction of Individual Branched-Chain Amino Acids has Distinct Effects on the Development and Progression of Alzheimer's Disease in 3xTg Mice"

English

210

327.2K

Tony Davis@TonyD993·13 Mar

@ChShersh Yes 911? this post right here

English

105

Dmitrii Kovanikov@ChShersh·13 Mar

Here me out. React but in C++

English

103

490

41.8K

Tony Davis@TonyD993·13 Mar

@contextkingceo How's it different from any other graph db?

English

Nishkarsh@contextkingceo·12 Mar

We've raised $6.5M to kill vector databases. Every system today retrieves context the same way: vector search that stores everything as flat embeddings and returns whatever "feels" closest. Similar, sure. Relevant? Almost never. Embeddings can’t tell a Q3 renewal clause from a Q1 termination notice if the language is close enough. A friend of mine asked his AI about a contract last week, and it returned a detailed, perfectly crafted answer pulled from a completely different client’s file. Once you’re dealing with 10M+ documents, these mix-ups happen all the time. VectorDB accuracy goes to shit. We built @hydra_db for exactly this. HydraDB builds an ontology-first context graph over your data, maps relationships between entities, understands the 'why' behind documents, and tracks how information evolves over time. So when you ask about 'Apple,' it knows you mean the company you're serving as a customer. Not the fruit. Even when a vector DB's similarity score says 0.94. More below ⬇️

English

619

658

5.9K

3.8M

Tony Davis@TonyD993·12 Mar

@shakoistsLog It is smart for about 3.4 seconds before it literally starts outputting gibberish or chinese. Claude still wins.

English

shako@shakoistsLog·12 Mar

i need you be really serious with you guys for a second, no trolling. gpt 5.4 xhigh codex is like 50% smarter than opus 4.6 in claude code. maybe 70%. if you need intelligence make the switch.

English

113

1.6K

103.9K

Tony Davis@TonyD993·11 Mar

@DistStateAndMe @erfan_mhi Lfg

Distributed State@DistStateAndMe·11 Mar

@TonyD993 @erfan_mhi We are working on it

English

Erfan Miahi@erfan_mhi·10 Mar

We just released the model + technical report for Covenant-72B. The largest LLM ever pre-trained on a fully decentralized infrastructure. 72B parameters trained over the open internet with permissionless GPUs. This is a big step toward making decentralized pre-training actually practical. Amazing work by @covenant_ai team.

templar@tplr_ai

We just completed the largest decentralised LLM pre-training run in history: Covenant-72B. Permissionless, on Bittensor subnet 3. 72B parameters. ~1.1T tokens. Commodity internet. No centralized cluster. No whitelist. Anyone with GPUs could join or leave freely. 1/n

English

204

17.7K

Tony Davis@TonyD993·11 Mar

@tplr_ai Any incentives to train?

English

318

templar@tplr_ai·10 Mar

English

213

955

6.3K

1.8M

Tony Davis@TonyD993·10 Mar

@CtrlAltDwayne Context rot will kill you eventually

English

Dwayne@CtrlAltDwayne·10 Mar

AI generated code will add three null checks, two fallbacks and a try/catch around something that has never once failed in production. And you know what? It still runs. It still passes tests. It shipped in 4 minutes instead of 4 hours. I genuinely do not care about the extra null check.

English

146

22.8K

Tony Davis@TonyD993·9 Mar

@mov_axbx This is what I've been doing

English

Tony Davis@TonyD993·6 Mar

@itsolelehmann This will NEVER happen. Banks don't have apis because they rely on browser fingerprints to prevent fraud.

English

Ole Lehmann@itsolelehmann·6 Mar

I need a personal bank account with api access to i can do simple banking tasks using my agent

English

160

244

61.1K

Keşfet

@lauriewired @Sauers_ @kalomaze @tenobrus @askalphaxiv @InferXai @JustDeezGuy @LammingLab