Success

47 posts

Success

@SuccessVsdworld

Silence Over Noise. ML || DL || Quant

Sierra Leone Katılım Haziran 2024

48 Takip Edilen58 Takipçiler

Success@SuccessVsdworld·3 Nis

@belindmo Oh this is nice... Since you already tried the JSON route, tried adding WandB? It's supported in nanoGPT-style code. Every run auto log all hyperparam, val_bpb, and training curves. And the dashboard makes it quite easy to scan all 38 runs and spot silent/weird failures.

English

Belinda@belindmo·3 Nis

On a whim, I decided to run an agent to optimize model pretraining using autoresearch, for 38 hours over 38 experiments on Claude Opus 4.6, cost $173.15 in API credits. Question is... how do I spend the least amount of time to validate all experiments were run properly? 🫠

English

447

Success@SuccessVsdworld·2 Nis

@ChujieZheng weights soon?👀

English

364

Chujie Zheng@ChujieZheng·2 Nis

Here is Qwen3.6-Plus: qwen.ai/blog?id=qwen3.6

Magyar

118

177

386.1K

Success@SuccessVsdworld·28 Mar

@yawnxyz @taigrr 😂😂

QME

Jan@yawnxyz·28 Mar

Bad claw. Bad. > Slap your MacBook lightly: nudge your claw to try again. > Slap it hard: send it scurrying back to fix what it broke. Slap it repeatedly — things escalate. Because sometimes Ctrl+C doesn't convey enough discipline. Fork of @taigrr's spank github.com/janzheng/spank…

anul agarwal@anulagarwal

You all are overthinking your app ideas. This app makes your Mac moan when you slap it -> it made $5,000 in 3 days. Just ship it.

English

656

Success retweetledi

Viv@Vtrivedy10·26 Mar

x.com/i/article/2036…

ZXX

232

1.7K

Success retweetledi

Jenny Zhang@jennyzhangzt·23 Mar

Introducing Hyperagents: an AI system that not only improves at solving tasks, but also improves how it improves itself. The Darwin Gödel Machine (DGM) demonstrated that open-ended self-improvement is possible by iteratively generating and evaluating improved agents, yet it relies on a key assumption: that improvements in task performance (e.g., coding ability) translate into improvements in the self-improvement process itself. This alignment holds in coding, where both evaluation and modification are expressed in the same domain, but breaks down more generally. As a result, prior systems remain constrained by fixed, handcrafted meta-level procedures that do not themselves evolve. We introduce Hyperagents – self-referential agents that can modify both their task-solving behavior and the process that generates future improvements. This enables what we call metacognitive self-modification: learning not just to perform better, but to improve at improving. We instantiate this framework as DGM-Hyperagents (DGM-H), an extension of the DGM in which both task-solving behavior and the self-improvement procedure are editable and subject to evolution. Across diverse domains (coding, paper review, robotics reward design, and Olympiad-level math solution grading), hyperagents enable continuous performance improvements over time and outperform baselines without self-improvement or open-ended exploration, as well as prior self-improving systems (including DGM). DGM-H also improves the process by which new agents are generated (e.g. persistent memory, performance tracking), and these meta-level improvements transfer across domains and accumulate across runs. This work was done during my internship at Meta (@AIatMeta), in collaboration with Bingchen Zhao (@BingchenZhao), Wannan Yang (@winnieyangwn), Jakob Foerster (@j_foerst), Jeff Clune (@jeffclune), Minqi Jiang (@MinqiJiang), Sam Devlin (@smdvln), and Tatiana Shavrina (@rybolos).

English

157

661

3.6K

497.9K

Success retweetledi

Yacine Mahdid@yacinelearning·23 Mar

after reading a book about neuroplasticity I realized that learning hard things is the equivalent of an intelligence buff that year I purposefully introspected within myself what I found difficult to do then I did just that by year end I was doing bio research in a lab

ᐱ ᑎ ᑐ ᒋ ᕮ ᒍ@Andr3jH

Share a piece of introspection about yourself

English

1.2K

19.3K

715.8K

Success@SuccessVsdworld·14 Mar

@adedola_csv Big Congrats

English

òdòdó 🌹@adedola_csv·14 Mar

My first data engineering role rejection letter. Let’s have it!!! 😂😂😂

English

Success@SuccessVsdworld·10 Mar

@claudeai Here we go again...

English

Claude@claudeai·9 Mar

Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.

English

2.1K

5.1K

62.5K

23.5M

Success@SuccessVsdworld·28 Şub

@xeraa @elastic @elastic_devs super easy... Used 3 YAML workflows (~30 lines each), agent builder auto-picks them as tools. best part - elasticsearch IS the queue. no kafka needed. workflows write to indices, runners poll, agent queries with ES|QL. took roughly 4hours to fully finish👀

English

Philipp Krenn@xeraa·28 Şub

@SuccessVsdworld @elastic @elastic_devs sweet! how easy the integration with workflows?

English

Success@SuccessVsdworld·24 Şub

@mastersam_ 😂😂

QME

133

Mr Sam@mastersam_·24 Şub

jitesh💙@Jitesh_117

someone should make a vscode extension which plays "FAAAAH" sound whenever a test fails

ZXX

161

1.7K

19.9K

936.8K

Success@SuccessVsdworld·15 Şub

@1xrayyy Ahhh. Serayyy👀

English

Success@SuccessVsdworld·29 Oca

@BioAIDevs Oh wow @nelonweke 👀

English

197

BioAIDevs@BioAIDevs·28 Oca

Meet BIOS, an AI Scientist built to orchestrate complex biomedical research. • Global SOTA on Data Analysis Benchmarks: BixBench 48.78% open-answer, 55.12% multiple-choice + refusal, 64.39% multiple-choice (no refusal) - outperforming systems like Edison Scientific and Kepler. • Human-in-the-Loop or Autonomous Mode: Intermediate checkpoints let researchers guide investigations mid-flight as insights emerge. No more waiting hours for batch runs + reruns to get results. Or, run in fully autonomous mode for extended investigations. • Persistent World State: Rather than losing context as conversations grow, world state ensures investigations build on insights within each research cycle and across sessions. • Subagent Swarm: BIOS orchestrates subagents specializing in research functions (Literature Review, Data Analysis, Novelty Detection) and, soon, research domains (microbiology, longevity, genomics). BIOS is available now in Beta with free + paid tiers, exclusive launch pricing and, for limited time, free full access to academic users with a .edu email address. Pro, Researcher and Lab subscription tiers offer discounted packages on monthly credits. Our usage-based pricing is competitive and in some cases significantly cheaper than leading scientific agents. Try BIOS and read our paper in the links below ↓

English

176

66.8K

Success@SuccessVsdworld·28 Ağu

@FulaFeminist My papa dash me club

English

FulaFeminist with an MSC from LSE@FulaFeminist·28 Ağu

United na the Athens of west Africa So so history R gains bo 🥹

English

1.8K

Success@SuccessVsdworld·23 Ağu

@ruthefordml Congrats my bro

English

Chisom Rutherford@ruthefordml·22 Ağu

Today I defended my final year research project. Over five months, my research group studied 213 doctors to determine their knowledge of AI and how they use it in their practice. We got very surprising results! Looking forward to conducting many more research in health AI.

English

576

18.8K

Success@SuccessVsdworld·12 Ağu

@tereza_tizkova @e2b Oh wow.

English

Tereza Tizkova@tereza_tizkova·12 Ağu

Launch of Open Lovable is here! Proud that @e2b AI cloud is powering this product 🫡 Congrats to the team!

Developers Digest@devdigest

Open Lovable is live and now has 6,000+ Github stars 💜 💙 It's an Next.js app I built that instantly reimagines any website and generates full React apps in seconds. Powered by @firecrawl, @GroqInc, @e2b and more. Here's a complete breakdown of the project in 4 minutes👇

Prague, Czech Republic 🇨🇿 English

1.9K

Success@SuccessVsdworld·11 Ağu

@ruthefordml Ok but predicting multiple tokens at once... how do they keep it from going off track?

English

Chisom Rutherford@ruthefordml·10 Ağu

Traditional language models create text one token at a time, which can be slow. Apple’s new “multi-token prediction” approach lets models predict several tokens at once, making them faster and more efficient.

English

518

Success@SuccessVsdworld·10 Ağu

@yusufabol_ Oh nice👍🏾

English

🔧∑Y@yusufabol_·10 Ağu

@SuccessVsdworld Yeah, two projects 1. A multi-class plant disease detector with LLM treatment assistance. 2. Develop a smart OCR system for invoice data.

English

🔧∑Y@yusufabol_·10 Ağu

Picked up a Computer Vision course. Halfway in, it feels like a Photoshop tutorial(hearing words like resizing, sharpening, Gaussian blur, filters, textures and so on). The only difference? I am coding it instead of clicking it.

English

290

Success@SuccessVsdworld·5 Ağu

@samireey Since you're diving into finetuning + evals, you might wanna peek at Unsloth... crazy efficient, esp. for local fine-tuning👀

English

samir@samireey·4 Ağu

ML grind day 138/365🎯 (finetuning llms) > studied llm evaluation techniques/models > explored llm APIs (openrouter, togetherai) > little bit dive into DoRA , reward modeling + RLHF > started collecting project ideas [will start with guided ones]

samir@samireey

ML grind day 137/365🎯 (finetuning llms) > visualized how LoRA saves memory > how quantization handled hardware limits > used QLoRA + PEFT to finetune normal gpu(not my code btw) > read a few book pages

English

3.8K

Success@SuccessVsdworld·3 Ağu

@Muyiiwaa 😭😭

QME

MUYIWA@Muyiiwaa·3 Ağu

Can you implement transformer architecture from scratch in plain Numpy?

English

1.2K

Keşfet

@belindmo @ChujieZheng @yawnxyz @taigrr @AIatMeta @BingchenZhao @winnieyangwn @j_foerst