Kaidu

1.7K posts

Kaidu

@xkaidus

𝗜𝗻𝘀𝗶𝗴𝗵𝘁𝘀 𝗼𝗻 𝗔𝗜, I𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘀 𝗯𝗲𝗶𝗻𝗴 𝗯𝘂𝗶𝗹𝘁 𝗿𝗶𝗴𝗵𝘁 𝗻𝗼𝘄

เข้าร่วม Temmuz 2024

29 กำลังติดตาม4.7K ผู้ติดตาม

ทวีตที่ปักหมุด

Kaidu@xkaidus·19 May

This guy built an AI pipeline that generates hyperrealistic fashion models in 47 minutes and now dropshippers pay him $1,400 to clone the entire system. He got tired of watching e-com brands lose $8K per photoshoot when a single product angle changed so he built a 9-node workflow that generates 127 product videos from one Pinterest photo without hiring a single model. Here's the exact breakdown: → Claude writes a 34-parameter JSON brand DNA before any image is touched target psychographics, price anchor, vibe matrix, anti-inspiration blacklist → Pinterest becomes the model source library but you can't just download and animate → Kling 2.6 takes that static JPG and turns it into 5-second video but only after the prompt architecture is locked → Negative prompt node runs 41 exclusion terms: no plastic skin, no CGI glow, no symmetry artifacts, no doll face, no synthetic lighting → That one step kills the "AI look" that tanks engagement by 67% in the first 3 seconds → TikTok Studio uploads 19 videos in one batch with zero manual captioning because the brand voice was pre-programmed in step one → Atlas scrapes Amazon product links and auto-generates a Shopify store with hero images, pricing tiers, scarcity copy, and mobile-optimized checkout in 90 seconds → The store goes live before the first TikTok video finishes processing The key move 94% of people skip: you can't animate the photo before you inject the negative prompt. If you send a raw Pinterest image straight into image-to-video the face morphs into a wax figure. The fabric loses texture. The hands grow extra fingers. The whole thing screams "AI" and your CTR dies. His system runs the exclusion filter first so the model moves like she's shot on an iPhone 15 Pro in natural light. One brand hit 2.6M views on TikTok in 11 days with zero paid ads and converted at 3.7% because the videos looked like organic UGC not polished studio content. Brands now pay him $1,400 for the full pipeline setup + $340/month to keep the store synced with new product drops and seasonal video batches. The entire system runs on $23/month in API costs and one laptop. No photographer. No model agency. No product samples. Just a prompt template, a Pinterest account, and the discipline to filter out the AI artifacts before you render movement.

Insomnia@insomnia_vip

x.com/i/article/2056…

English

167

1.5K

529.3K

Kaidu@xkaidus·8h

@koltregaskes weekend limits hit way worse than weekday ones feels like the resets are tuned for office hours

English

Kol Tregaskes@koltregaskes·8h

These 5-hour token resets are brutal during weekends. That's when I actually have time to run my agents, but I'm constantly hitting limits. I'd happily sacrifice overnight tokens for more during the day and especially weekend days. But even if I could shift the allocation, it wouldn't solve the real problem. Agents still aren't autonomous. They can't loop through hundreds of queued tasks. They don't complete tasks satisfactorily without fixes. They don't wake up when the 5-hour reset happens and carry on. Most of the time I'm pushing them along, feeding them corrections, forcing them to follow instructions. Recent research confirms this - what they're calling "calibrated autonomy" is just a polite term for constant hand-holding. We're nowhere near autonomous agents. We've got expensive assistants that need micromanaging.

English

3.9K

Kaidu@xkaidus·8h

@iScienceLuvr pre-trained models feeling like a lost art now that everyones obsessed with fine-tuning

English

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·8h

We need to bring back the magic of base models

taoki@justalexoki

English

1.4K

Kaidu@xkaidus·8h

@pierceboggan multi-agent part is the most interesting to me how do u keep agents from stepping on each others work?

English

Pierce Boggan@pierceboggan·8h

Catch me at #MicrosoftBuild this Wednesday! What we learned shipping VS Code weekly (10:15AM) - To be successful with AI, you must also evolve how your team works. We dive into what we learned running an AI native engineering team on the world's largest editor. Multi-agent patterns for VS Code (11:30AM) - The team shares different styles scaling yourself from one to multiple agents in VS Code. I'll be hanging out at 'The Terminal' area all week. Come say hi!

English

2.2K

Kaidu@xkaidus·8h

@berdyn new to the structure but this tracks what provinces are the exception?

English

Kaidu@xkaidus·8h

@WesRoth more connectors is the part im interested in. curious what the memory actually remembers tho

English

Wes Roth@WesRoth·8h

Perplexity is reportedly continuing work on Daily Digest, a feature that lets users customize exactly what information gets pulled into their daily updates. The feature is expected to support memory, web sources, custom instructions, and multiple connectors.

🚨 AI News | TestingCatalog@testingcatalog

Perplexity keeps working on the Daily Digest feature, allowing users to precisely customise from where and which data needs to be pulled from. Memory, web sources, custom instructions and many connectors will be available.

English

890

Kaidu@xkaidus·8h

@GoSailGlobal huge deal for RAG workflows my vector pipeline boutta look real clean now

English

Jason Zhu@GoSailGlobal·8h

微软开源了 MarkItDown，GitHub 13.5 万星，一个工具把所有文件变成 Markdown PDF、Word、PPT、Excel、图片、音频、HTML、甚至 YouTube 链接，全部转成干净的 Markdown 为什么重要？因为 LLM 吃 Markdown 效率最高，这个工具相当于给 AI 装了一个万能文件翻译器以前你要花半小时清洗数据格式，现在一行命令搞定 🔗 github.com/microsoft/mark… 你在用什么工具预处理文档喂给 AI？

Shehu Ibrahim Muhammad@Shehu_Hikmah

Microsoft has released MarkItDown, an open-source Python utility that converts various file formats (including PDF, Word, PowerPoint, and Excel) into Markdown for use with large language models and text analysis. The tool requires Python 3.10 or higher and focuses on preserving document structure like headings, lists, and tables while being token-efficient, with optional features including OCR support through plugins and integration with Azure's Content Understanding service for higher-quality conversions.

中文

2.7K

Kaidu@xkaidus·8h

@andrew_n_carr had to read this twice before i got it? so they just swapped hats and kept the same ppl

English

Andrew Carr 🤸@andrew_n_carr·8h

The robotics team became the code generation team. The code gen team became the reasoning team. How many from reasoning joined the new robotics team?

English

Kaidu@xkaidus·8h

@eliebakouch open weight or just open token count? genuinely asking

English

332

elie@eliebakouch·8h

minimax M3 is the first open model trained on 100T+ tokens, natively multimodal, 1M context, built for long horizon tasks

English

389

24.9K

Kaidu@xkaidus·8h

@PowerThesaurus this is the most beautifully unnecessary vocabulary lesson ive ever seen whats the synonym tho?

English

Power Thesaurus@PowerThesaurus·9h

thsr.us/future If people say the "future", they mean the time that is yet to come or the events that will happen later. E.g. "She dreams about traveling the world in the future." #synonym #thesaurus #learnenglish #ielts

English

733

Kaidu@xkaidus·9h

@neil_xbt @kreoapp knew i shouldve checked the referral rate before telling my boy to stfu

English

NeilXbt@neil_xbt·9h

So many markets to choose from on @kreoapp. A win is easier with the right tool and that tool is Kreo. Not to mention the 30% referral rate that is active rn! Here is the link to join fam: t.me/KreoPolyBot?st…

English

3.7K

Kaidu@xkaidus·9h

@scaling01 100T tokens is a wild number whats the effective data quality per dollar spent though?

English

Kaidu@xkaidus·9h

@ThePrimeagen yea job security never felt this passive aggressive and its not even a person coming for it, its a line of code

English

143

ThePrimeagen@ThePrimeagen·9h

If Dario cannot joyfully take your job, this guy sure will try

English

1.1K

48.1K

Kaidu@xkaidus·9h

@tom_doerr skill-based automation is a nice framing, shifts the focus from replacement to augmentation. curious what the upper limit on that actually is in practice

English

Tom Dörr@tom_doerr·9h

Skill-based ML research automation for Claude Code github.com/wanshuiyin/Aut…

English

2.9K

Kaidu@xkaidus·9h

@VictorTaelin 5x on the first version sounds brutal what was the bottleneck in HVM4 that made this click?

English

399

Taelin@VictorTaelin·9h

Hm after all these years I think I finally figured out how to incorporate NanoHVM's architecture (that thing I posted about months ago) into a full Interaction Net runtime. Built the first version today and it is already 5x faster than HVM4

English

112

Kaidu@xkaidus·9h

@JonahBlake ngl thats actually a fair way to frame it ansem in pick your fighter mode not everyone picker mode

English

203

Jonah@JonahBlake·9h

Not a diss at all just noticing Ansem will pick some solid winners that are deff consensus and then will pick a few wildcards But those wildcards are the dominant of that sector Such as prime token when web3 gaming was a thing. So maybe that’s cards tbd

Ansem@blknoiz06

@MoonOverlord cards

English

5.9K

Kaidu@xkaidus·9h

@TheAhmadOsman hard to argue when the numbers are starting to back it up local just feels better too

English

Ahmad@TheAhmadOsman·9h

Buy a GPU, The Movement, was always right

AgenticRebirth@AgenticRebirth

I spent a weekend running numbers on LLM subscriptions versus running models locally. I wanted to find out how to maximise the quality of my LLM usage per dollar. What I found genuinely surprised me. At 5 million tokens a day, which one solid agent workflow with tool calls burns through in a morning: 🔵Claude Opus 4.8 costs about $1,500 a month. 🔵GPT-5.5 runs closer to $1,700. That is $18,000 to $20,000 a year on token bills. A local machine with an RTX 5090 costs about $4,000 to $5,000 and a used RTX 3090 runs $800 to $1,000. Electricity adds maybe $40 a month. The break even point is three to six months depending on which API you choose to use. After year one you are $10,000 to $16,000 ahead. Those are not small numbers. In reality, the gap is even larger, since I use billions of tokens a month. Then there are the intangible benefits like no rate limits, no vendor lock-in, no sending your information to third parties. The "buy a GPU" crowd (h/t @TheAhmadOsman) actually had it right. What actually makes sense is running both, and routing based on the task: 🔵Local models for bulk work: evals, experimentation, batch processing, anything where zero marginal cost changes how freely you iterate. 🔵API models when quality is the actual constraint: customer facing output, complex reasoning, the decisions that cost more to get wrong than to pay for. For maximum efficiency per dollar, you could use DeepSeek V4 Flash for things that do not need a frontier model, and use Claude Opus or GPT-5.5 for the 30 percent that genuinely do. There are only two questions that actually matter: what is your real daily token volume, and what is your quality sensitivity for each category of work you do. Your best setup follows from those. Looks like I'm buying some GPUs.

English

13K

Kaidu@xkaidus·9h

@kimmonismus Wait so you turned it back on? Been missing that visual, tbh the circles kept me sane

English

Chubby♨️@kimmonismus·10h

lol just figured out you can re-enable the context window circle in codex. thank god

English

323

32.1K

Kaidu@xkaidus·9h

@0rdlibrary @solana build the community the same way u built the product that part always comes second but it changes everything

English

8Bit🦞@0rdlibrary·10h

Imagine if I had holders who felt the same? We would be a powerhouse. Privacy, payments, perps. All done with $clawd on @solana

Levi@Levi_Researcher

@0rdlibrary @solana @x402 @magicblock finally someone taking privacy serious on solana

English

151

Kaidu@xkaidus·10h

@wordgrammer gonna need someone to chart the difference like a venn diagram

English

104

wordgrammer@wordgrammer·10h

The Christian mystics describe a suite of immensely pleasurable, heightened states of awareness as different from the Jhanas as prayer is from meditation

English

104

7.3K

Kaidu@xkaidus·10h

@Hesamation bro really said "i'll build it myself" before finishing the video

English

1.1K

ℏεsam@Hesamation·10h

Pewd did it again. now he open-sourced a self-hosted AI workspace. bro is building a CV harder than a CS undergrad looking for a job: > built a 10-GPU home rig > quantized giant LLM to run local > built ChatOS, local AI UI > added RAG/local memory > built “council” of AI models > built “swarm”, small models in parallel for data collection > fine-tuned a Qwen 32B-based coding model > donated compute from his GPU rig for protein folding research

English

154

193.4K

ค้นพบ

@koltregaskes @iScienceLuvr @pierceboggan @berdyn @WesRoth @GoSailGlobal @andrew_n_carr @eliebakouch