Lotto

25.7K posts

Lotto banner
Lotto

Lotto

@LottoLabs

mlai side gig / building models for my kids

Canada Katılım Mayıs 2019
1K Takip Edilen3.1K Takipçiler
Mad ML scientist
Mad ML scientist@HououinTyouma·
@shinboson uploaded whole genome of me and my ex girlfriend and asked ChatGPT to grow our child's brain in a vat and connect it to play animal crossing, pokemon, and doom
English
1
0
1
15
José H A Monteiro
José H A Monteiro@jhmonteiro·
@LottoLabs I'm a simp with 4070 mobile 8Gb. OminiCoder-9b 4q_k_m ggfu full VRAM 192K ctx FTW! 35t/s. Rock solid.
English
1
0
1
12
Lotto
Lotto@LottoLabs·
Qwen 27b remains my favorite still, gonna do a write up on all the models tested so far
English
20
2
166
6.5K
Lotto
Lotto@LottoLabs·
@sergeykarayev Meh get me a phat card and a couple instances of qwen27b and docker then tell me that’s not the future
English
0
0
0
9
Sergey Karayev
Sergey Karayev@sergeykarayev·
Running agents locally is a dead end. The future of software development is hundreds of agents running at all times of the day — in response to bug alerts, emails, Slack messages, meetings, and because they were launched by other agents. The only sane way to support this is with cloud containers. Local agents hit a wall quickly: • No scale. You can only run as many agents (and copies of your app) as your hardware allows. • No isolation. Local agents share your filesystem, network, and credentials. One rogue agent can affect everything else. • No team visibility. Teammates can't see what your agents are doing, review their work, or interact with them. • No always-on capability. Agents can't respond to signals (alerts, messages, other agents) when your machine is off or asleep. Cloud agents solve all of these problems. Each agent runs in its own isolated container with its own environment, and they can run 24/7 without depending on any single machine. This year, every software company will have to make the transition from work happening on developer's local machines from 9am-6pm to work happening in the cloud 24/7 -- or get left behind by companies who do.
English
25
4
68
4.5K
Lotto
Lotto@LottoLabs·
@Forgework_ Aren’t you a business already 😂
English
1
0
0
10
Forgework
Forgework@Forgework_·
@LottoLabs Been working on this as well! Excited to see how it goes!
English
1
0
1
10
Lotto
Lotto@LottoLabs·
I’m gonna start a business w/ Hermes agent + 27b and see how far we can get, I bet we can have an autonomous business built in a day or two
English
2
0
25
273
Lotto
Lotto@LottoLabs·
Hermes agent + qwen 3.5 27b on a 3090 All local, your data stays with you, runs 24/7, create tools and reporting to run autonomously while you sleep, access through telegram, WhatsApp, discord etc., simple set up, avid group of technical people developing it, mlops focused but expandable to any needs, unless you’re needing sota coding this stack covers you for everything and more.
English
0
0
0
118
Tyler
Tyler@TylerDurden·
Tell me why I need Claude or a clawd bot? I don’t think I need more than grok currently which is a search engine on steroids. Convince me otherwise and I’ll send you $500 in bitcoin.
English
197
3
284
29.4K
lycaon
lycaon@lyc_aon·
@LottoLabs burner computer? fuck that we do it live
English
0
0
3
171
Lotto
Lotto@LottoLabs·
@yacineMTB We need the great agent browser condom more than ever
English
0
0
1
61
kache
kache@yacineMTB·
"we are going to stop bots on x" Refresh the timeline 90% LLM generated anxiety bait from posters outside of the western world Sweet
English
20
3
115
2.5K
Lotto
Lotto@LottoLabs·
@duganist I use my local desktop to serve the 27b and have hermes on a separate dev computer then ssh into it w/ tailscale or use telegram
English
0
0
1
43
Patrick ₿ Dugan _________________
@LottoLabs I'd like to host 27B on a remote node and have Hermes tag into it instead of it needing to be local or an approved API model. Or SSH in a terminal and run Hermes in that terminal, I suppose is fine.
English
1
0
1
52
Dan Woods
Dan Woods@danveloper·
I actually hate MoE's now. Not just because they're difficult to hardwaremaxx, but it's actually a really dumb architecture (no offense to anyone). They naively approximate a graph without any of the benefit of graph traversal. We're sending a blind person down a path and we've trained something to nudge them onto a different path to get to the end, but it doesn't know the next part of the map until the person has walked down the street. I hate this.
English
8
2
48
6.1K
Lotto
Lotto@LottoLabs·
@jdmsec Depends what your doing, it’s not going to replace opus but if you have a defined scope and prompt it can work well
English
1
0
1
122
jay(dm)
jay(dm)@jdmsec·
@LottoLabs hows its orchestration? opus 4.6 is expensive w Hermes, but afraid Im going to lose massively on orchestration/reasoning if I move to a local model
English
2
0
1
133
behind_seven_proxies
behind_seven_proxies@user351513·
@agentsdotmd @LottoLabs You can't just give it a general mission and 500 files and have it figure everything out for you. But my theory is that with the right tooling and prompt curation you can get close enough for most work.
English
1
0
1
15
Cynical Optimist
Cynical Optimist@ChemPhysMajor·
@LottoLabs @johnhanacek @sudoingX I had issues with the original Qwopus at 27B. But Jackrong just released a v2 at 4B and 9B with way more Claude training data and actual benchmarks. The 9B v2 sounds like it's pretty much just as accurate but the thinking is effective at dramatically shortening the thinking.
English
1
0
2
22
Lotto
Lotto@LottoLabs·
Gonna try GLM 4.7 flash w/ Hermes + 3090 Probably similar to the other 30b a3b models but it is what it is Running out of models to fit on the 3090 w/ tool calls
English
10
0
39
2.6K
Lotto
Lotto@LottoLabs·
@DoDataThings You using it in an agent harness or just normal chat inference?
English
1
0
0
68
Winston B.
Winston B.@DoDataThings·
@LottoLabs The Qwen team did really well on the 3.5 series. Enjoying it so far
English
1
0
1
76
Lotto
Lotto@LottoLabs·
@mweinbach Or small models that think less
Kyle Hessling@KyleHessling1

@LottoLabs I just turned off thinking in the 27B today on my Hermes agent, doing some Apple development, because thinking times at long context were insane. But DUDE I kid you not it got smarter somehow!

English
0
0
3
51
Lotto
Lotto@LottoLabs·
No definitely not but like for 99% of general population it’s enough, for devs it’s usable, it kinda reminds me of sonnet 3.5 where it wasn’t insanely smart but it was really sticky to the prompt and steered easily. I think it’s probably smarter than 3.5 but that’s the vibes I get and 3.5 was famous for a reason
English
1
0
4
194