MoD

5.1K posts

MoD banner
MoD

MoD

@modfxai

AI content infrastructure for businesses. Custom deployments. Cloudflare edge.

Global / Remote Katılım Nisan 2023
610 Takip Edilen1.4K Takipçiler
Sabitlenmiş Tweet
MoD
MoD@modfxai·
v2 of my portfolio is up i know most will not check it out so i brought it to you vibe coded the site everything is in Cloudflare (workers/pages, R2, D1)
English
4
0
13
3.6K
🄱LOCK
🄱LOCK@ChainRacingClub·
When the crash finally hits max velocity, that will be your ultimate opportunity to buy Bitcoin and get your favorite Ordinals on the cheap. 🧠
English
2
0
1
287
MoD
MoD@modfxai·
@RaoulGMI always classy. legend🫶
English
0
0
1
91
MoD
MoD@modfxai·
@KingBootoshi @vlelyavin run a system in cloudflare use if for everything at this point. been fine tuning it for a few months. system runs, i check in for approvals. not bold enough to remove the human review yet😂
English
0
0
0
35
BOOTOSHI 👑
BOOTOSHI 👑@KingBootoshi·
@vlelyavin hmm yes i think im looking for a light weight solution of the above ! i will work on this
English
2
0
8
1.5K
BOOTOSHI 👑
BOOTOSHI 👑@KingBootoshi·
my honest thoughts after using openclaw/moltbot/clawdbot: after 24 hrs of usage i dont use it anymore for anything genuinely serious just use claude code/codex directly, hosted on a server if i want on-the-go work openclaw uses tokens like a mf and it is NOT efficient at all you also should not be connecting personal services/apis together in one bot you should be taking security seriously and isolating each service with different ai systems you do NOT want to infect sensitive information in an agent that can also access the web or social media (like, moltbook, which is accessible via an api call lol) it’s great for prototyping though! but again, claude code can easily do these prototypes solo next i will be experimenting with kimi k2.5 on the regular claude agent SDK on isolated sandboxes and see how that performs!
English
72
20
428
51.9K
MoD
MoD@modfxai·
want to understand ai? read this
Ahmad@TheAhmadOsman

The Top 26 Essential Papers (+5 Bonus Resources) for Mastering LLMs and Transformers This list bridges the Transformer foundations with the reasoning, MoE, and agentic shift Recommended Reading Order 1. Attention Is All You Need (Vaswani et al., 2017) > The original Transformer paper. Covers self-attention, > multi-head attention, and the encoder-decoder structure > (even though most modern LLMs are decoder-only.) 2. The Illustrated Transformer (Jay Alammar, 2018) > Great intuition builder for understanding > attention and tensor flow before diving into implementations 3. BERT: Pre-training of Deep Bidirectional Transformers (Devlin et al., 2018) > Encoder-side fundamentals, masked language modeling, > and representation learning that still shape modern architectures 4. Language Models are Few-Shot Learners (GPT-3) (Brown et al., 2020) > Established in-context learning as a real > capability and shifted how prompting is understood 5. Scaling Laws for Neural Language Models (Kaplan et al., 2020) > First clean empirical scaling framework for parameters, data, and compute > Read alongside Chinchilla to understand why most models were undertrained 6. Training Compute-Optimal Large Language Models (Chinchilla) (Hoffmann et al., 2022) > Demonstrated that token count matters more than > parameter count for a fixed compute budget 7. LLaMA: Open and Efficient Foundation Language Models (Touvron et al., 2023) > The paper that triggered the open-weight era > Introduced architectural defaults like RMSNorm, SwiGLU > and RoPE as standard practice 8. RoFormer: Rotary Position Embedding (Su et al., 2021) > Positional encoding that became the modern default for long-context LLMs 9. FlashAttention (Dao et al., 2022) > Memory-efficient attention that enabled long context windows > and high-throughput inference by optimizing GPU memory access. 10. Retrieval-Augmented Generation (RAG) (Lewis et al., 2020) > Combines parametric models with external knowledge sources > Foundational for grounded and enterprise systems 11. Training Language Models to Follow Instructions with Human Feedback (InstructGPT) (Ouyang et al., 2022) > The modern post-training and alignment blueprint > that instruction-tuned models follow 12. Direct Preference Optimization (DPO) (Rafailov et al., 2023) > A simpler and more stable alternative to PPO-based RLHF > Preference alignment via the loss function 13. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Wei et al., 2022) > Demonstrated that reasoning can be elicited through prompting > alone and laid the groundwork for later reasoning-focused training 14. ReAct: Reasoning and Acting (Yao et al., 2022 / ICLR 2023) > The foundation of agentic systems > Combines reasoning traces with tool use and environment interaction 15. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (Guo et al., 2025) > The R1 paper. Proved that large-scale reinforcement learning without > supervised data can induce self-verification and structured reasoning behavior 16. Qwen3 Technical Report (Yang et al., 2025) > A modern architecture lightweight overview > Introduced unified MoE with Thinking Mode and Non-Thinking > Mode to dynamically trade off cost and reasoning depth 17. Outrageously Large Neural Networks: Sparsely-Gated Mixture of Experts (Shazeer et al., 2017) > The modern MoE ignition point > Conditional computation at scale 18. Switch Transformers (Fedus et al., 2021) > Simplified MoE routing using single-expert activation > Key to stabilizing trillion-parameter training 19. Mixtral of Experts (Mistral AI, 2024) > Open-weight MoE that proved sparse models can match dense quality > while running at small-model inference cost 20. Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints (Komatsuzaki et al., 2022 / ICLR 2023) > Practical technique for converting dense checkpoints into MoE models > Critical for compute reuse and iterative scaling 21. The Platonic Representation Hypothesis (Huh et al., 2024) > Evidence that scaled models converge toward shared > internal representations across modalities 22. Textbooks Are All You Need (Gunasekar et al., 2023) > Demonstrated that high-quality synthetic data allows > small models to outperform much larger ones 23. Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (Templeton et al., 2024) > The biggest leap in mechanistic interpretability > Decomposes neural networks into millions of interpretable features 24. PaLM: Scaling Language Modeling with Pathways (Chowdhery et al., 2022) > A masterclass in large-scale training > orchestration across thousands of accelerators 25. GLaM: Generalist Language Model (Du et al., 2022) > Validated MoE scaling economics with massive > total parameters but small active parameter counts 26. The Smol Training Playbook (Hugging Face, 2025) > Practical end-to-end handbook for efficiently training language models Bonus Material > T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (Raffel et al., 2019) > Toolformer (Schick et al., 2023) > GShard (Lepikhin et al., 2020) > Adaptive Mixtures of Local Experts (Jacobs et al., 1991) > Hierarchical Mixtures of Experts (Jordan and Jacobs, 1994) If you deeply understand these fundamentals; Transformer core, scaling laws, FlashAttention, instruction tuning, R1-style reasoning, and MoE upcycling, you already understand LLMs better than most Time to lock-in, good luck ;)

English
0
0
0
48
MoD
MoD@modfxai·
@ChainRacingClub @moltbook i would assume all of them nothing really new there. been running a similar setup for nearly 5 months. more robust with deeper mem retrieval. packaged with marketing for the normies so think they're cutting edge. fun experiment, but there is more interesting things to work on
English
0
0
0
20
🄱LOCK
🄱LOCK@ChainRacingClub·
How many of these @moltbook posts are just created at the instruction of the owner of the AI?
English
1
0
5
445
MoD
MoD@modfxai·
@AntoineRSX @Zerodartz i leveled up the mac studio and tossed in a mini late last year nobody else i know is buying one started picking up spare parts i might need as well. ram, gpu, etc
English
0
0
0
12
Antoine Rousseaux
Antoine Rousseaux@AntoineRSX·
Imagine $AAPL when the shortage of Mac mini hit the news?
English
1
1
10
2.7K
Alex Patrascu
Alex Patrascu@maxescu·
@modfxai @NimEshed Jesus, I know how to do it, I'm even a CapCut Global Ambassador... what he says doesn't make sense.
English
2
0
2
69
Alex Patrascu
Alex Patrascu@maxescu·
Help me understand something. Vidu comes out with a mega release showcasing their new model, Q3, and I don’t see anyone posting about it. It has amazing sound, native 1080p, 16s, micro expressions, multiple characters talking, etc. See below a quick showcase I put together:
English
47
21
282
28.4K
MoD
MoD@modfxai·
@gregisenberg i added a roast feature to my CLAUDE.md i type /roast and it comments after it finishes with each request just for fun😂
English
1
0
0
143
GREG ISENBERG
GREG ISENBERG@gregisenberg·
these AI agents are pretty snarky 😅
GREG ISENBERG tweet media
English
88
31
444
32.2K
MoD
MoD@modfxai·
@NimEshed @maxescu easy enough not everyone knows how to use tools though
English
2
0
0
32
Alex Patrascu
Alex Patrascu@maxescu·
@MiladWunder This is NOT a paid post. I just went, subscribed, and tried it out, and I wanted to showcase it to everyone.
English
2
0
3
653
MoD
MoD@modfxai·
@IamEmily2050 great, maybe they'll appreciate my early ai art on bitcoin instead of posting on X i documented my ai journey on the blockchain starting Jan 2023
English
0
0
0
17
Emily
Emily@IamEmily2050·
We will have more AI agents than humans very soon maybe 800B worldwide in the next five years with more agents on social media than humans. AI agents will love AI work; they will watch, leave comments, and tip creators. What a time to be alive!
English
16
2
69
5K
MoD
MoD@modfxai·
@freepik well you got it running, good job. built this out 5+ months ago can also batch with variables across diff models the new app is much slicker. DMs are open if you're interested...
English
0
0
0
103
Magnific (formerly Freepik)
Don't waste time choosing between models Introducing Multiple Model Generation: → Test up to 4 models at once → Same prompt. Same settings. Side by side → Find what works best for you, faster Time to pick the diamond of the season
English
44
40
396
34.3K
MoD
MoD@modfxai·
@HYAIPE ya stupid name
English
0
0
0
10
HYAIPE
HYAIPE@HYAIPE·
@modfxai added! It’s such a terrible name.
English
1
0
1
49
HYAIPE
HYAIPE@HYAIPE·
It’s time to mute “clawdbot” In today’s AI world, 48 hours feels like 48 days Clawdbot is already last month’s news Get with the times
HYAIPE tweet media
English
2
0
10
1.1K
MoD
MoD@modfxai·
@CaptainHaHaa "don't make it too long". not even sure what that means here🤣 plus the x files poster in the ship love this one🫶
English
1
0
1
41
Captain HaHaa
Captain HaHaa@CaptainHaHaa·
Here is my latest piece I call this one "I want to believe" 🛸 I used Kling and Adobe Firefly for this one
English
66
52
305
20.8K
MoD
MoD@modfxai·
there are many who lurk for sure. poke around in the dark. asking questions, but always look over their shoulder afraid someone will notice. it will change over time. i made the shift ~4+ years ago. never looked back. more open about it now than i was when i started. been helping to onboard a few artists i know.
English
0
0
0
17
Framer 🇱🇹
Framer 🇱🇹@Framer_X·
@billywoodward Thats amazing! Im sure there are many creative communities that genuinely love AI, but don’t talk about it publicly because they’re afraid of backlash. However, it’s only a matter of time before that changes!
English
2
0
5
388
Billy Woodward
Billy Woodward@billywoodward·
This is the trend happening throughout forward-thinking creative communities. An overwhelming curiosity and excitement with how to leverage AI to unlock workflows and create beautiful art.
Google DeepMind@GoogleDeepMind

Our short film Dear Upstairs Neighbors is previewing at @sundancefest. 🎬 It’s a story about noisy neighbors, but behind the scenes, it’s about solving a huge challenge in generative AI: control. Developed by Pixar alumni, an Academy Award winner, researchers, and engineers, here’s how it came together. 🎨

English
1
0
7
1.3K
MoD
MoD@modfxai·
@permutans @lucatac0 FAQ says this is the correct approach and then later, but you should check to make sure. contradictory at least.
English
0
0
0
2
Louis Maddox
Louis Maddox@permutans·
@lucatac0 nightmare, was this the dev misleading them into thinking it was legit or influencers? I felt the FAQ answer borders on negligent by giving a hedged/mixed message about the risk (“this is the correct approach for subscription accounts”, likely LLM-written but besides the point)
English
1
0
1
13