
Kaidu
1.7K posts

Kaidu
@xkaidus
𝗜𝗻𝘀𝗶𝗴𝗵𝘁𝘀 𝗼𝗻 𝗔𝗜, I𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘀 𝗯𝗲𝗶𝗻𝗴 𝗯𝘂𝗶𝗹𝘁 𝗿𝗶𝗴𝗵𝘁 𝗻𝗼𝘄










Perplexity keeps working on the Daily Digest feature, allowing users to precisely customise from where and which data needs to be pulled from. Memory, web sources, custom instructions and many connectors will be available.



Microsoft has released MarkItDown, an open-source Python utility that converts various file formats (including PDF, Word, PowerPoint, and Excel) into Markdown for use with large language models and text analysis. The tool requires Python 3.10 or higher and focuses on preserving document structure like headings, lists, and tables while being token-efficient, with optional features including OCR support through plugins and integration with Azure's Content Understanding service for higher-quality conversions.













@MoonOverlord cards


I spent a weekend running numbers on LLM subscriptions versus running models locally. I wanted to find out how to maximise the quality of my LLM usage per dollar. What I found genuinely surprised me. At 5 million tokens a day, which one solid agent workflow with tool calls burns through in a morning: 🔵Claude Opus 4.8 costs about $1,500 a month. 🔵GPT-5.5 runs closer to $1,700. That is $18,000 to $20,000 a year on token bills. A local machine with an RTX 5090 costs about $4,000 to $5,000 and a used RTX 3090 runs $800 to $1,000. Electricity adds maybe $40 a month. The break even point is three to six months depending on which API you choose to use. After year one you are $10,000 to $16,000 ahead. Those are not small numbers. In reality, the gap is even larger, since I use billions of tokens a month. Then there are the intangible benefits like no rate limits, no vendor lock-in, no sending your information to third parties. The "buy a GPU" crowd (h/t @TheAhmadOsman) actually had it right. What actually makes sense is running both, and routing based on the task: 🔵Local models for bulk work: evals, experimentation, batch processing, anything where zero marginal cost changes how freely you iterate. 🔵API models when quality is the actual constraint: customer facing output, complex reasoning, the decisions that cost more to get wrong than to pay for. For maximum efficiency per dollar, you could use DeepSeek V4 Flash for things that do not need a frontier model, and use Claude Opus or GPT-5.5 for the 30 percent that genuinely do. There are only two questions that actually matter: what is your real daily token volume, and what is your quality sensitivity for each category of work you do. Your best setup follows from those. Looks like I'm buying some GPUs.




@0rdlibrary @solana @x402 @magicblock finally someone taking privacy serious on solana















