Valerio Cestrone

5.1K posts

Valerio Cestrone

@valeriocestrone

England, United Kingdom Katılım Aralık 2009

4.9K Takip Edilen1.5K Takipçiler

Valerio Cestrone retweetledi

John Fletcher (𝔦, 𝔦)@Dr_JohnFletcher·3 Nis

Andrej, This sounds extremely useful, and I think it might be even more significant than it first appears. What you describe is not just a knowledge base for information. The structure of the wiki, the queries you file back, etc, encode *how* you do research: which questions to ask, which connections matter, what's worth pursuing. That's “know-how” (in the sense of Michael Polanyi). This sort of knowledge is, currently, overwhelmingly absent from training data, because it was never written down (since there was no point). Now there is, because it significantly improves the AIs performance. But notice what's happening. You propose to build the most efficient mechanism ever devised for making tacit expert know-how / methodology explicit and machine-readable, and then transmitting it, via API, to a third-party model provider. Every query against the wiki is a reasoning trace: see attached video clip. The compiled wiki itself is a structured map of your research process. This is the mechanism described here: x.com/dr_johnfletche… Expert know-how is being externalised and captured through ordinary productive use of AI tools. The user gets a better tool. The platform gets a transferable problem-solving strategy. The fact that this works so well could, in a sense, be the problem: the better it works, the more indispensable it becomes, the more know-how flows out, and, realistically, the less choice people have *not* to use it. Your instinct that "there is room here for an incredible new product" is right. But whoever builds it will be sitting on the highest-fidelity capture mechanism for expert know-how ever constructed. The question is: is the data subject to a “data network effect”, by which I mean, the kind of “data flywheel” which gave Google a 25 year monopoly over search? If so, you might be building not only more most powerful tool humanity has ever possessed, but this power might end up in the hands of a single entity. It would be great to hear your thoughts around this.

Andrej Karpathy@karpathy

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

English

203

40.7K

Chris@everestchris6·4 Nis

this AI agent builds and sells info products on full autopilot. here's how: - scan subreddits like r/anxiety, r/solotravel, r/socialskills, r/overthinking every few hours - find the fears people keep posting about over and over - generate a short PDF guide that actually helps them through it - spin up a landing page with payments built in - scan Reddit 24/7 for people posting about that exact problem and drop helpful comments pointing them to the guide - run completely hands off it finds the pain, builds the product and finds the customers. fully automated reply "AGENT" + RT and I'll send you a free guide so you can set it up too (must be following so I can DM)

English

249

143

239

23.7K

Valerio Cestrone@valeriocestrone·13 Nis

@everestchris6 Agent

English

Valerio Cestrone retweetledi

Chris@everestchris6·11 Nis

this OpenClaw bot finds $500k–$1.2M homes without pools, renders a pool into their backyard, and mails the owner a postcard showing the before/after, on autopilot... here's how pool builders can close $50k+ deals with this system: - scans satellite imagery for mid-market homes with empty backyards - filters by lot size, sun exposure & recent ownership change - pulls the homeowner direct from public records (not shared leads) - renders a luxury pool dropped into their actual yard - calculates build cost + home value lift for their specific zip - generates a cinematic video of their backyard with the new pool - prints a personalised postcard with the before/after + QR code - drops it in the mail + hits them with retargeting every step from sourcing to outreach is automated. reply "POOL" + RT and i'll send you the full breakdown so you can build this too (must be following so i can DM)

English

1.4K

945

7.5K

1.3M

Valerio Cestrone@valeriocestrone·4 Nis

Instead of the fastest chip, mem, GPU, NPU, I find it more useful to focus on compounded outcome Components serve different layers, all together can achieve a consistently higher output over time if orchestrated "efficiently" Like a company vs an individual contributor #ai

English

Valerio Cestrone retweetledi

Vali Neagu@AmbsdOP·2 Mar

YES! Someone reverse-engineered Apple's Neural Engine and trained a neural network on it. Apple never allowed this. ANE is inference-only. No public API, no docs. They cracked it open anyway. Why it matters: • M4 ANE = 6.6 TFLOPS/W vs 0.08 for an A100 (80× more efficient) • "38 TOPS" is a lie - real throughput is 19 TFLOPS FP16 • Your Mac mini has this chip sitting mostly idle Translation: local AI inference that's faster AND uses almost no power. Still early research but the door is now open. → github.com/maderix/ANE #AI #MachineLearning #AppleSilicon #LocalAI #OpenSource #ANE #CoreML #AppleSilicon #NPU #KCORES

English

157

732

7.1K

550.7K

Valerio Cestrone retweetledi

vx-underground@vxunderground·20 Ara

Behold my magnum opus. Here is my malware analysis paper collection Papers: 2006 - 2 papers 2007 - 6 papers 2008 - 4 papers 2009 - 15 papers 2010 - 46 papers 2011 - 60 papers 2012 - 127 papers 2013 - 140 papers 2014 - 170 papers 2015 - 355 papers 2016 - 480 papers 2017 - 793 papers 2018 - 801 papers 2019 - 1056 papers 2020 - 1989 papers 2021 - 2634 papers 2022 - 2607 papers 2023 - 1450 papers 2024 - 1153 papers 2025 - 800 papers Don't you EVER ask "whAts A GooD pLaCe To LeaRn MaLwaRe aNalYsiS?". I've got 14,869 malware analysis papers curated and organized. Most the papers have the samples with them too. It's the muthafuckin' library of Alexandria for malware. This shit took half a decade.

English

1.1K

70K

Valerio Cestrone retweetledi

Denzil Buchner@DenzilZA·21 Eki

@netcapgirl Gives a new meaning to "I'm dating a model"

English

362

28.6K

Valerio Cestrone retweetledi

Aqsa@Aqsahere_·20 Eki

There is a massive AWS outage right now. Half of the internet is down at the moment including Perplexity, Snapchat, Amazon, Apple, PUBG etc 𝕏 is up.

English

128

21.5K

Valerio Cestrone retweetledi

Bernd Strehl@strehldev·20 Eki

Life hack: Make sure your support ticket system is also running on the same aws region as your own SaaS. In case of an outage nobody can complain.

English

849

142.6K

Valerio Cestrone retweetledi

Nick D'Agostino@realNickDags·20 Eki

While @amazon is a monopoly, it’s clear that this is a MAJOR concern when so many companies use @awscloud to function… #AWS #Amazon #outage

English

185

624

165.2K

Valerio Cestrone retweetledi

Daniel Sharon@Daniel_Sharon_·20 Eki

One of the largest AWS outages in years is unfolding right now. Core services across multiple regions are impacted, affecting thousands of businesses globally like @McDonalds @Snap @Venmo @lyft @hulu @DisneyPlus

English

169

61.4K

Valerio Cestrone retweetledi

Derrick Shields@DerrickShieldsX·20 Eki

Too many dependencies → Coinbase and Robinhood are down because an AWS region is down. Thank goodness these websites are just for games and memes, and not dealing people's actual money or anything. It's going on hour now... 100% unacceptable @CoinbaseSupport @RobinhoodApp

Rosemount, MN 🇺🇸 English

442

87.5K

Valerio Cestrone retweetledi

Lefteris Karapetsas@LefterisJP·20 Eki

AWS is down and then the internet stops working. But the blockchain, it never goe ... wait a minute. Scratch that. This sector is a joke. Everyone preaching decentralization and censorship resistance but in reality ... it's all 100% reliant on the cloud.

English

1.2K

1.3K

10.9K

1.1M

Valerio Cestrone retweetledi

Dith@0xDith·20 Eki

when u realize the entire crypto industry runs on AWS

GIF

English

656

1.3K

21.7K

Valerio Cestrone retweetledi

World Bollard Association™@WorldBollard·26 Ağu

In a world full of so much misery, you can ALWAYS rely on bollards to make things better. #WorldBollardAssociation

English

1.2K

82.2K

Valerio Cestrone retweetledi

vx-underground@vxunderground·21 Eyl

tl;dr of today > @rastalandTV gets crypto drained > he has stage 4 cancer > hes targeted specifically for his cancer treatment money > loses $32,000 > nerds band together > @ZssBecker donates $30,000 to him > malware nerds come together > drainer infra found > pull all victim data from infra > victims will be notified > all malware flagged > osint nerds come together > find drainers info from their telegram ids > find info from their steam ids tl;dr tl;dr stage 4 cancer bro gets fucked over, 50+ nerds band together to undo the damage fuck cancer

English

167

782

8.3K

538.9K

Valerio Cestrone retweetledi

🤍𝕁𝕆🤍@jomickane·17 Eyl

Copied from fb.... My daughter came home from school and said, “Mom, you’re not going to believe what happened in history class today.” Her teacher told the class they were going to play a game. He walked around the room and whispered to each kid whether they were a witch or just a regular person. Then he gave the instructions: “Form the biggest group you can without a witch. If your group has even one, you all fail.” She said the whole room instantly lit up with suspicion. Everyone started interrogating each other. Are you a witch? How do we know you’re not lying? Some kids clung to one big group, but most broke off into smaller, exclusive cliques. They turned away anyone who seemed uncertain, nervous, or gave off even the slightest hint of being guilty. The energy shifted fast. Suddenly everyone was suspicious of everyone. Whispers. Finger-pointing. Side-eyes. Trust dissolved in minutes. Finally, when all the groups were formed, the teacher said, “Alright, time to find out who fails. Witches, raise your hands.” And not one hand went up. The whole class exploded. “Wait! You messed up the game!” And then the teacher dropped the bomb: “Did I? Were there any actual witches in Salem, or did everyone just believe what they were told?” My daughter said the room went dead silent. That’s when it hit them. No witch was ever needed for the damage to happen. Fear had already done its work. Suspicion alone divided the entire class, turning community into chaos. And isn’t that exactly what we’re seeing today? Different words, same playbook. Instead of “witch,” it’s liberal, conservative, vaxxed, unvaxxed, pro-this, anti-that. The labels shift, but the tactic is the same. Get people scared. Get them suspicious. Get them divided. Then sit back while trust crumbles. The danger was never the witch. The danger is the rumor. The suspicion. The fear. The planted lies. Refuse the whisper. Don’t play the game. Because the second we start hunting “witches,” we’ve already lost.

English

289

1.9K

6.1K

218.9K

Valerio Cestrone retweetledi

Barack Obama@BarackObama·17 Eyl

Our democracy is not self-executing. It depends on us all as citizens, regardless of our political affiliations, to stand up and fight for the core values that have made this country the envy of the world.

English

23.1K

30.6K

194.5K

13.5M

Valerio Cestrone@valeriocestrone·8 Ağu

theoceancleanup.com 🌊 The largest cleanup in history

English

Keşfet

@everestchris6 @netcapgirl @amazon @awscloud @McDonalds @Snap @Venmo @lyft