Kaidu

1.7K posts

Kaidu banner
Kaidu

Kaidu

@xkaidus

𝗜𝗻𝘀𝗶𝗴𝗵𝘁𝘀 𝗼𝗻 𝗔𝗜, I𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘀 𝗯𝗲𝗶𝗻𝗴 𝗯𝘂𝗶𝗹𝘁 𝗿𝗶𝗴𝗵𝘁 𝗻𝗼𝘄

เข้าร่วม Temmuz 2024
29 กำลังติดตาม4.7K ผู้ติดตาม
ทวีตที่ปักหมุด
Kaidu
Kaidu@xkaidus·
This guy built an AI pipeline that generates hyperrealistic fashion models in 47 minutes and now dropshippers pay him $1,400 to clone the entire system. He got tired of watching e-com brands lose $8K per photoshoot when a single product angle changed so he built a 9-node workflow that generates 127 product videos from one Pinterest photo without hiring a single model. Here's the exact breakdown: → Claude writes a 34-parameter JSON brand DNA before any image is touched target psychographics, price anchor, vibe matrix, anti-inspiration blacklist → Pinterest becomes the model source library but you can't just download and animate → Kling 2.6 takes that static JPG and turns it into 5-second video but only after the prompt architecture is locked → Negative prompt node runs 41 exclusion terms: no plastic skin, no CGI glow, no symmetry artifacts, no doll face, no synthetic lighting → That one step kills the "AI look" that tanks engagement by 67% in the first 3 seconds → TikTok Studio uploads 19 videos in one batch with zero manual captioning because the brand voice was pre-programmed in step one → Atlas scrapes Amazon product links and auto-generates a Shopify store with hero images, pricing tiers, scarcity copy, and mobile-optimized checkout in 90 seconds → The store goes live before the first TikTok video finishes processing The key move 94% of people skip: you can't animate the photo before you inject the negative prompt. If you send a raw Pinterest image straight into image-to-video the face morphs into a wax figure. The fabric loses texture. The hands grow extra fingers. The whole thing screams "AI" and your CTR dies. His system runs the exclusion filter first so the model moves like she's shot on an iPhone 15 Pro in natural light. One brand hit 2.6M views on TikTok in 11 days with zero paid ads and converted at 3.7% because the videos looked like organic UGC not polished studio content. Brands now pay him $1,400 for the full pipeline setup + $340/month to keep the store synced with new product drops and seasonal video batches. The entire system runs on $23/month in API costs and one laptop. No photographer. No model agency. No product samples. Just a prompt template, a Pinterest account, and the discipline to filter out the AI artifacts before you render movement.
Insomnia@insomnia_vip

x.com/i/article/2056…

English
34
167
1.5K
529.3K
Kaidu
Kaidu@xkaidus·
@koltregaskes weekend limits hit way worse than weekday ones feels like the resets are tuned for office hours
English
0
0
0
59
Kol Tregaskes
Kol Tregaskes@koltregaskes·
These 5-hour token resets are brutal during weekends. That's when I actually have time to run my agents, but I'm constantly hitting limits. I'd happily sacrifice overnight tokens for more during the day and especially weekend days. But even if I could shift the allocation, it wouldn't solve the real problem. Agents still aren't autonomous. They can't loop through hundreds of queued tasks. They don't complete tasks satisfactorily without fixes. They don't wake up when the 5-hour reset happens and carry on. Most of the time I'm pushing them along, feeding them corrections, forcing them to follow instructions. Recent research confirms this - what they're calling "calibrated autonomy" is just a polite term for constant hand-holding. We're nowhere near autonomous agents. We've got expensive assistants that need micromanaging.
Kol Tregaskes tweet media
English
22
2
38
3.9K
Kaidu
Kaidu@xkaidus·
@iScienceLuvr pre-trained models feeling like a lost art now that everyones obsessed with fine-tuning
English
0
0
0
17
Kaidu
Kaidu@xkaidus·
@pierceboggan multi-agent part is the most interesting to me how do u keep agents from stepping on each others work?
English
0
0
0
56
Pierce Boggan
Pierce Boggan@pierceboggan·
Catch me at #MicrosoftBuild this Wednesday! What we learned shipping VS Code weekly (10:15AM) - To be successful with AI, you must also evolve how your team works. We dive into what we learned running an AI native engineering team on the world's largest editor. Multi-agent patterns for VS Code (11:30AM) - The team shares different styles scaling yourself from one to multiple agents in VS Code. I'll be hanging out at 'The Terminal' area all week. Come say hi!
English
8
5
55
2.2K
Kaidu
Kaidu@xkaidus·
@berdyn new to the structure but this tracks what provinces are the exception?
English
0
0
1
10
Kaidu
Kaidu@xkaidus·
@WesRoth more connectors is the part im interested in. curious what the memory actually remembers tho
English
0
0
0
5
Wes Roth
Wes Roth@WesRoth·
Perplexity is reportedly continuing work on Daily Digest, a feature that lets users customize exactly what information gets pulled into their daily updates. The feature is expected to support memory, web sources, custom instructions, and multiple connectors.
Wes Roth tweet media
🚨 AI News | TestingCatalog@testingcatalog

Perplexity keeps working on the Daily Digest feature, allowing users to precisely customise from where and which data needs to be pulled from. Memory, web sources, custom instructions and many connectors will be available.

English
3
0
14
890
Kaidu
Kaidu@xkaidus·
@GoSailGlobal huge deal for RAG workflows my vector pipeline boutta look real clean now
English
0
0
0
54
Jason Zhu
Jason Zhu@GoSailGlobal·
微软开源了 MarkItDown,GitHub 13.5 万星,一个工具把所有文件变成 Markdown PDF、Word、PPT、Excel、图片、音频、HTML、甚至 YouTube 链接,全部转成干净的 Markdown 为什么重要?因为 LLM 吃 Markdown 效率最高,这个工具相当于给 AI 装了一个万能文件翻译器 以前你要花半小时清洗数据格式,现在一行命令搞定 🔗 github.com/microsoft/mark… 你在用什么工具预处理文档喂给 AI?
Jason Zhu tweet media
Shehu Ibrahim Muhammad@Shehu_Hikmah

Microsoft has released MarkItDown, an open-source Python utility that converts various file formats (including PDF, Word, PowerPoint, and Excel) into Markdown for use with large language models and text analysis. The tool requires Python 3.10 or higher and focuses on preserving document structure like headings, lists, and tables while being token-efficient, with optional features including OCR support through plugins and integration with Azure's Content Understanding service for higher-quality conversions.

中文
2
4
20
2.7K
Kaidu
Kaidu@xkaidus·
@andrew_n_carr had to read this twice before i got it? so they just swapped hats and kept the same ppl
English
0
0
0
85
Andrew Carr 🤸
Andrew Carr 🤸@andrew_n_carr·
The robotics team became the code generation team. The code gen team became the reasoning team. How many from reasoning joined the new robotics team?
English
8
1
32
3K
Kaidu
Kaidu@xkaidus·
@eliebakouch open weight or just open token count? genuinely asking
English
1
0
0
332
elie
elie@eliebakouch·
minimax M3 is the first open model trained on 100T+ tokens, natively multimodal, 1M context, built for long horizon tasks
elie tweet media
English
25
22
389
24.9K
Kaidu
Kaidu@xkaidus·
@PowerThesaurus this is the most beautifully unnecessary vocabulary lesson ive ever seen whats the synonym tho?
English
0
0
1
14
Kaidu
Kaidu@xkaidus·
@neil_xbt @kreoapp knew i shouldve checked the referral rate before telling my boy to stfu
English
0
0
0
77
NeilXbt
NeilXbt@neil_xbt·
So many markets to choose from on @kreoapp. A win is easier with the right tool and that tool is Kreo. Not to mention the 30% referral rate that is active rn! Here is the link to join fam: t.me/KreoPolyBot?st…
NeilXbt tweet media
English
47
22
78
3.7K
Kaidu
Kaidu@xkaidus·
@scaling01 100T tokens is a wild number whats the effective data quality per dollar spent though?
English
0
0
0
34
Kaidu
Kaidu@xkaidus·
@ThePrimeagen yea job security never felt this passive aggressive and its not even a person coming for it, its a line of code
English
0
0
0
143
ThePrimeagen
ThePrimeagen@ThePrimeagen·
If Dario cannot joyfully take your job, this guy sure will try
ThePrimeagen tweet media
English
55
32
1.1K
48.1K
Kaidu
Kaidu@xkaidus·
@tom_doerr skill-based automation is a nice framing, shifts the focus from replacement to augmentation. curious what the upper limit on that actually is in practice
English
0
0
0
15
Kaidu
Kaidu@xkaidus·
@VictorTaelin 5x on the first version sounds brutal what was the bottleneck in HVM4 that made this click?
English
1
0
1
399
Taelin
Taelin@VictorTaelin·
Hm after all these years I think I finally figured out how to incorporate NanoHVM's architecture (that thing I posted about months ago) into a full Interaction Net runtime. Built the first version today and it is already 5x faster than HVM4
Taelin tweet media
English
8
1
112
6K
Kaidu
Kaidu@xkaidus·
@JonahBlake ngl thats actually a fair way to frame it ansem in pick your fighter mode not everyone picker mode
English
0
0
0
203
Jonah
Jonah@JonahBlake·
Not a diss at all just noticing Ansem will pick some solid winners that are deff consensus and then will pick a few wildcards But those wildcards are the dominant of that sector Such as prime token when web3 gaming was a thing. So maybe that’s cards tbd
Ansem@blknoiz06

@MoonOverlord cards

English
5
0
25
5.9K
Kaidu
Kaidu@xkaidus·
@TheAhmadOsman hard to argue when the numbers are starting to back it up local just feels better too
English
0
0
0
46
Ahmad
Ahmad@TheAhmadOsman·
Buy a GPU, The Movement, was always right
AgenticRebirth@AgenticRebirth

I spent a weekend running numbers on LLM subscriptions versus running models locally. I wanted to find out how to maximise the quality of my LLM usage per dollar. What I found genuinely surprised me. At 5 million tokens a day, which one solid agent workflow with tool calls burns through in a morning: 🔵Claude Opus 4.8 costs about $1,500 a month. 🔵GPT-5.5 runs closer to $1,700. That is $18,000 to $20,000 a year on token bills. A local machine with an RTX 5090 costs about $4,000 to $5,000 and a used RTX 3090 runs $800 to $1,000. Electricity adds maybe $40 a month. The break even point is three to six months depending on which API you choose to use. After year one you are $10,000 to $16,000 ahead. Those are not small numbers. In reality, the gap is even larger, since I use billions of tokens a month. Then there are the intangible benefits like no rate limits, no vendor lock-in, no sending your information to third parties. The "buy a GPU" crowd (h/t @TheAhmadOsman) actually had it right. What actually makes sense is running both, and routing based on the task: 🔵Local models for bulk work: evals, experimentation, batch processing, anything where zero marginal cost changes how freely you iterate. 🔵API models when quality is the actual constraint: customer facing output, complex reasoning, the decisions that cost more to get wrong than to pay for. For maximum efficiency per dollar, you could use DeepSeek V4 Flash for things that do not need a frontier model, and use Claude Opus or GPT-5.5 for the 30 percent that genuinely do. There are only two questions that actually matter: what is your real daily token volume, and what is your quality sensitivity for each category of work you do. Your best setup follows from those. Looks like I'm buying some GPUs.

English
10
3
79
13K
Kaidu
Kaidu@xkaidus·
@kimmonismus Wait so you turned it back on? Been missing that visual, tbh the circles kept me sane
English
0
0
0
69
Chubby♨️
Chubby♨️@kimmonismus·
lol just figured out you can re-enable the context window circle in codex. thank god
Chubby♨️ tweet mediaChubby♨️ tweet media
English
35
8
323
32.1K
Kaidu
Kaidu@xkaidus·
@0rdlibrary @solana build the community the same way u built the product that part always comes second but it changes everything
English
0
0
1
14
Kaidu
Kaidu@xkaidus·
@wordgrammer gonna need someone to chart the difference like a venn diagram
English
0
0
0
104
wordgrammer
wordgrammer@wordgrammer·
The Christian mystics describe a suite of immensely pleasurable, heightened states of awareness as different from the Jhanas as prayer is from meditation
English
11
3
104
7.3K
Kaidu
Kaidu@xkaidus·
@Hesamation bro really said "i'll build it myself" before finishing the video
English
0
0
1
1.1K
ℏεsam
ℏεsam@Hesamation·
Pewd did it again. now he open-sourced a self-hosted AI workspace. bro is building a CV harder than a CS undergrad looking for a job: > built a 10-GPU home rig > quantized giant LLM to run local > built ChatOS, local AI UI > added RAG/local memory > built “council” of AI models > built “swarm”, small models in parallel for data collection > fine-tuned a Qwen 32B-based coding model > donated compute from his GPU rig for protein folding research
ℏεsam tweet media
English
66
154
3K
193.4K