Wing Chan

342 posts

Wing Chan banner
Wing Chan

Wing Chan

@sourceful_wing

I help early-stage CPG founders ship product faster. Research @sourceful. Product packaging helps you stand out and tell your brand story.

انضم Aralık 2024
141 يتبع50 المتابعون
Heba AI
Heba AI@SubarcticRec·
@VadimStrizheus Deploy the model to RunPod and you control the limits with your wallet.
English
1
0
1
122
Vadim
Vadim@VadimStrizheus·
What’s all this hype about GLM 5.2 being AGI?? I’m constantly getting rate limits and errors.
English
18
1
61
7.7K
Chetaslua
Chetaslua@chetaslua·
🚨 Google New Image Model > Instant-ramen (successor of nano-banana) Ramen is cooked time to serve soon , we will share results as soon as we get hands on it 😉
Chetaslua tweet media
English
40
74
952
132.9K
Wing Chan
Wing Chan@sourceful_wing·
@reach_vb So impressed. When codex mobile first came out I was pretty worried due to the lag /connection issues, it felt really unfinished. But now it is rock solid. Congrats on finding the path through, probably helped by a mythical beast of a model yet to be unleashed
English
0
0
0
109
Vaibhav (VB) Srivastav
Codex Mobile updates: - browse workspace files and link paths into prompts - pick a workspace folder when starting a new thread - expand or collapse all diffs while reviewing changes - approve MCP actions for one chat or across chats - LaTeX rendering in Codex messages and plans - clearer status for running threads, queued prompts, side chats, and subagents - better pairing, onboarding, reconnects, host refresh, and thread performance - improved Codex profile sharing, activity history, settings, transcript layout, and assistant actions - smoother /goal workflows from mobile - fixes for stuck swipes, duplicate messages, subagent rows, and misleading connection errors
Vaibhav (VB) Srivastav tweet media
English
35
4
207
18.5K
Wing Chan
Wing Chan@sourceful_wing·
@MicahBerkley @SubarcticRec I'm excited to move to a benchmark that is the cost to finish a project to MVP launch state. And I think for most cases the overall cost will be dominated by the human time spent and not the token cost, unless you are using fable etc. Nothing great can really be one shot.
English
1
0
1
43
Wing Chan
Wing Chan@sourceful_wing·
@SubarcticRec Simple text it does great. Struggles if more than a few words probably model size issue
English
1
0
2
11
Heba AI
Heba AI@SubarcticRec·
Have not even tested before, but Flux.2 Flash and ZiB both do correct text.
Heba AI tweet mediaHeba AI tweet media
English
1
0
0
29
Wing Chan
Wing Chan@sourceful_wing·
Day 1 of asking @zoink if he will consider adding RF2.5 to Figma Weavy and Figma. Especially now that I know his email inbox is not the right route.
Riverflow@riverflow_ai

Riverflow 2.5 Pro just topped the charts. Across all three categories on @Designarena's global benchmarking - Image, Graphic Design and Image Editing - Riverflow 2.5 Pro came in at #1 Design Arena is powered by real user voting, the people who we created it for. The results speak for themselves: #1 Image #1 Graphic Design #1 Image Editing #3 Logo Beating GPT Image 2, Gemini 3 Pro, Ideogram and every other model in the field. Riverflow 2.5 Pro is available now on OpenRouter, Runware and Replicate. Or get in touch with us directly.

English
0
1
1
447
Wing Chan
Wing Chan@sourceful_wing·
@ArtificialAnlys Great to see the care and attention in keeping these single number measures refreshed. Thanks for sharing
English
0
0
1
57
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
Following up on our Intelligence Index v4.1 release yesterday, in the video below, Daniel from our team shares a short overview of what's changed: 1. Three upgraded evaluations: Terminal-Bench 2.1, τ³-Bench Banking and GDPval-AA v2 2. Cost, time, and tokens per task: Understand the cost, time, and tokens of tasks across our Index and for individual evals, and how these trade off against Intelligence 3. Cached input token reporting: We now report the amount of cached tokens a particular model uses and how this influences cost
English
4
5
78
12.2K
Wing Chan أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Riverflow Pro 2.5, a reasoning model by @riverflow_ai that calls a mix of proprietary and open diffusion models, has scored 1st on Image Arena (Models + Routers), 1st on Graphic Design Arena, and 1st in Image Edit (Models + Routers). Riverflow Pro 2.5 averages 10 Elo points above GPT Image 2 from @OpenAI in Image, Image Editing, and Graphic Design. It also establishes Pareto frontiers across Image, Image Editing, and Graphic Design in Preference vs. Speed. Congratulations to the @riverflow_ai team on the launch!
Design Arena tweet media
English
10
28
298
25.2K
Grace Li
Grace Li@grx_xce·
BREAKING: Le Chaton Fat has fully saturated our benchmark. We are at a loss for words. In response, we are retiring Design Arena. Congratulations to the @MistralAI team, and thanks for putting us on vacation.
Grace Li tweet media
English
46
55
1.2K
91.6K
Wing Chan
Wing Chan@sourceful_wing·
This is a high difficulty problem to solve. It's another form of alignment. To help me make my code secure and good, I need the model to know how to break it. To stop it from attacking others, I need the model to resist me. But it depends who is asking and that's non trivial. Think about the surface area, with agents, context, tool calling, harnesses etc. How do you verify? How do I trust the result of a tool call? We end up in the same place, training the model to make moral (i.e. non trivial non binary grey area) decisions based on some imperfect principles and imperfect data. data. The hardest part of all this is that since model training now exists across a broad array of actors, there always exists the incentive to offer a version slightly more permissive and greedy.
Colin | clerk.com@tweetsbycolin

The jailbreak we found convinced Fable that it wrote our code, so it was willing to look for issues Not too surprising if there were other vectors besides the one we found. Must be hard to have an LLM that can author secure code but not check if “other” code is secure

English
0
0
0
52
Salma
Salma@Salmaaboukarr·
this claude fable 5 drama made me realise how important it is to have a backup plan and not rely on these labs! time to go back to LOCAL MODELS + a v good harness the models i use -qwen -kimi -ds
English
5
0
27
5.7K
Grace Li
Grace Li@grx_xce·
He still hasn’t woken up yet
Grace Li tweet media
English
4
0
30
3.6K
nic
nic@nicdunz·
1. i did not get the option to save this reset and not use it 2. why is my usage already a few percents down when i havent even used it yet?
nic tweet media
English
5
0
44
6.8K
jack friks
jack friks@jackfriks·
can't believe this was only 18 months and 8 weeks ago...
jack friks tweet media
English
104
8
674
52.9K
Sauers
Sauers@Sauers_·
Big if true
Sauers tweet media
English
247
106
4.7K
2.2M
Ronan
Ronan@Ronanchamberss·
After speaking with UK AI Minister @KanishkaNarayan today on @etnshow, I have it on good authority that, we are indeed, so back. Best, R
English
2
3
78
4.4K
Wing Chan
Wing Chan@sourceful_wing·
@petergostev Should get 10x points for loc reduction if all tests pass. Code golfing is fun but the principle applies. Verbosity is a feature of bad compression.
English
0
0
1
87