Sup*

1.7K posts

Sup* banner
Sup*

Sup*

@aibenchmarking

AI | Model Benchmarking & evals | Hackathons & personal projects | Building curative intelligence | I run an agency btw but this profile ain't about that

Vibing Beigetreten Ağustos 2024
351 Folgt78 Follower
Sup*
Sup*@aibenchmarking·
@rive_app @contra Missed out on this one, and now that I see the entries 😅 the chances were slim.
English
0
0
0
6
Rive
Rive@rive_app·
Incredible submissions for the Rive HMI challenge. Winners this week with @contra
English
2
3
8
464
Csaba Kissi
Csaba Kissi@csaba_kissi·
Unpopular opinion: AI fixes the code that AI wrote.
English
6
0
5
175
Sup*
Sup*@aibenchmarking·
@iammSunnyBoss Okay this made me think and guess what...
GIF
English
0
0
0
13
Sup*
Sup*@aibenchmarking·
@richiemcilroy Why do I forget all this before buying more credits!!!!!!
English
0
0
0
3
Richie - oss/acc
Richie - oss/acc@richiemcilroy·
claude is always down just stick it on a $5 hetzner server and it's good to go forever 😤
English
3
0
16
1K
Sup*
Sup*@aibenchmarking·
@mehulmpt Waste of credits
GIF
English
0
0
0
7
Sup*
Sup*@aibenchmarking·
@boneGPT Their Marketing team makes it so tough to take anything related to them seriously
English
0
0
0
1
Sup*
Sup*@aibenchmarking·
@KaranVaidya6 Dude 1000 businesses spending over $1 million p.a. is both diabolical and crazy cool at the same time ⚡
English
0
0
0
2
Sup*
Sup*@aibenchmarking·
@sketch [Something abusive]
English
0
0
0
5
Sup*
Sup*@aibenchmarking·
@sketch Get me the Business subscription 👀 Or my designers will make me pay for it (by force 😶‍🌫️)
English
1
0
0
13
Sup*
Sup*@aibenchmarking·
@diegozaks Always amazing to see posts like these 👀
English
0
0
0
17
Sup*
Sup*@aibenchmarking·
@garrytan I believe it is more about personalization and the 'ecosystem chaining' An upside for one and a major downside for the other. It'll be interesting to see how gets the W
English
0
0
0
20
Garry Tan
Garry Tan@garrytan·
Claude Cowork is powerful but right now nothing compared to OpenClaw It’s a race. May the best product win.
Hosea@hoseakidane_

@garrytan Gary, have you tried claude cowork for the same tasks you would want openclaw for? A lot of the features overlap now like using iMessage, scheduling, using computer use etc

English
43
5
145
12.8K
Sup*
Sup*@aibenchmarking·
@MarkKnd I knew it ⚡
English
0
0
0
38
Sup*
Sup*@aibenchmarking·
@cjzafir This is gold, I am already into Fine -Tuning. The rest sound tempting and promising.
English
0
0
0
10
CJ Zafir
CJ Zafir@cjzafir·
@aibenchmarking Pick anything related to OS models. Consulting, fine tuning, synthetic dataset preparation, on premise deployment. Anything.
English
1
0
1
37
CJ Zafir
CJ Zafir@cjzafir·
I'm doing that already. > took OS model > fine tuned it on (80M dataset) > now running 24/7 on my macbook > with 98% accurate tool calls > it design its own workflows > can talk, research, automate, save > and much more Launching soon.
CJ Zafir tweet media
Marc Andreessen 🇺🇸@pmarca

Magical OpenClaw experiences that use frontier models cost $300-1,000/day today, heading to $10,000/day and more. The future shape of the entire technology industry will be how to drive that to $20/month.

English
2
1
9
687
Sup*
Sup*@aibenchmarking·
@cjzafir That's interesting. I am excited for all the updates you post for this 👀
English
0
0
0
6
CJ Zafir
CJ Zafir@cjzafir·
@aibenchmarking Tool calling, instruction following, and low tier reasoning.
English
1
0
1
24
CJ Zafir
CJ Zafir@cjzafir·
I've been fine-tuning open source models lately, (again) and I'm loving it. It's the first time I'm feeling that I'm in control, models are learing super fast and on benchmarks they are beating Sonnet 4.5, Gemini 3.0 easily. SLMs (Small Language Models) are the real unlock. They can run on any consumer device. No API, No Cloud, No provacy issues. And they can beat SOTA models on niche specific use cases. I've done that myself. Enterprise will through bundles of cash on specialized SLMs because of the control and power it brings. The only bottleneck is cloud compute, (GPUs are not available on major inference providers). I'm considering buying my own GPU rig because GPU prices will definitely go up. Also I felt that there's too much technical stuff that no one is simplifying for beginners (how to fine-tune models exactly) So, I'll be doing that. I'll share my findings, struggles and few wins here. Time to come out of stealth. Join the ride.
CJ Zafir tweet media
English
2
1
6
434
Sup*
Sup*@aibenchmarking·
@dakshpixelup "AI is the new hype" People have been saying the same thing for years now. What in your opinion is the reason for the industry to flourish even on a cash burn rate 👀
English
0
0
0
8
Daksh Aswal
Daksh Aswal@dakshpixelup·
300B poured into startups last quarter 80% of it went to AI thats more AI companies competing for the same enterprise contracts than any quarter in history with everyone selling the same promise, here’s the truth nobody wants to admit: The best product doesn’t always win. The best perceived product does. Enterprise doesn't pick features. They pick confidence. And confidence = brand. Product opens the door. Brand closes the deal
English
1
0
3
85
Sup*
Sup*@aibenchmarking·
@skirano @MagicPathAI Ah, when creating a component we do have the option to select Mobile devices to adjust the canvas size, but on preview (new tab) it opens in desktop mode. I am suggesting to get a dedicated Mobile prototype installed in which the user can try out the component.
English
0
0
0
11
Pietro Schirano
Pietro Schirano@skirano·
We pushed a cool update in @MagicPathAI. Now when sharing a link to a public file, on mobile you see a "feed" of all the prototypes in there. It's such a cool way to share projects with your team and clients, and truly only possible on our platform.
English
7
1
34
3.6K