tarun bana

549 posts

tarun bana

@iabom

Katılım Mayıs 2010

230 Takip Edilen31 Takipçiler

tarun bana@iabom·5h

@MaziyarPanahi Insane levels of surveillance tech, i guess u still need the satellite, but government already got that

English

352

Maziyar PANAHI@MaziyarPanahi·6h

Gemma 4 analyzes the video. Generates key questions. Calls Falcon Perception. "Find all the people." 156 found. "Detect only white cars." 8 found. A 26B model is running agentic multi-QA vision orchestration. The models are running locally on a MacBook with MLX. No API.

English

670

55.8K

tarun bana@iabom·9h

@bnjmn_marie Can't wait to see this. I feel like gemma 4 should be much better, in my test it's the only model that has passed it that's not paid and from big 3 main labs. I hope there's a section where you have your own opinion too, beyond the benchmarks. Based on your testing.

English

380

Benjamin Marie@bnjmn_marie·10h

Gemma 4 31B vs Qwen3.5 27B, Thinking Enabled I ran multiple benchmarks multiple times. Gemma 4 31B looks better and more stable (smaller accuracy variations between runs, which makes sense since it generates shorter sequences). I'll publish my full results and analysis on my blog later this week (link in profile).

English

237

12.5K

tarun bana@iabom·15h

@rodrigosaure this is blender? wtf

English

322

Rodrigo Sauré@rodrigosaure·18h

Kogar and the Old Man. A few weeks ago, Facebook’s algorithm showed me an epic illustration by John McKenzie. His style is quite unique, so I recommend checking out John’s account to appreciate it.

English

539

5.6K

121K

tarun bana@iabom·1d

@Kazi5isAlive Insane either way.

English

Kazi@Kazi5isAlive·1d

@iabom I mean, not all text alone.

English

Kazi@Kazi5isAlive·1d

More crab

English

1.4K

tarun bana@iabom·1d

@Kazi5isAlive Yeah, but image is also generated using text, or are u making images in photoshop ?

English

tarun bana@iabom·1d

@0xCVYH I read through the hugginfacve page, i don't get it. I could ask llm, but I'm sure those dumb things will just hallucinate a explanation. What is this? I'm guessing something to do with low vram or faster with same vram something like this.

English

376

CV.YH@0xCVYH·1d

quantizei o wan2.2 animate 14B pra video-to-video com hlwq. modelo de animacao de imagem, 8-bit, apache 2.0. quase mil downloads no hf sem eu ter divulgado antes. base e o wan-ai oficial. huggingface.co/caiovicentino1…

Português

184

8.3K

tarun bana@iabom·1d

@janusch_patas LMFAO

English

332

MrNeRF@janusch_patas·1d

Left: MRNF. Right: Mykonos densification, which adds another ~+0.8 dB PSNR. For the sake of humanity, and to help delay its downfall, this one will not be released publicly.

English

10.7K

tarun bana@iabom·1d

@Sauers_ Yup, same. And my use case is not even that complex. But same experience, it needs harness, well I'm guessing. I'm gon know soon I'm building harness now.

English

Sauers@Sauers_·2d

A few weeks ago I had GPT 5.4 derive and prove dozens of genetics theorems over multiple days. It did manage to write correctly-compiling math, but there were subtle improper assumptions or specifications leading to the results being useless and disconnected from reality

Mira@_Mira___Mira_

2 days ago GPT 5.4 ran for 12 hours proving 107 different theorems while I was sleeping. Yesterday it ran for 14 hours formalizing Shor's Algorithm. I think that needs another day or two to finish.

English

3.1K

tarun bana@iabom·2d

@htihle Yes. Makes sense. Thanks for benchmark.

English

196

Håvard Ihle@htihle·2d

@iabom When running locally, most people use quants, so when testing a model that is mostly run locally, it makes sense to use a quant. That is my reasoning anyway.

English

954

Håvard Ihle@htihle·3d

Gemma 4 31b scores 52.3% and is the strongest open model on on WeirdML, ahead of GLM 5 and gpt-oss-120b. This score is comparable to o3 and gemini 2.5 pro, and well ahead of qwen 3.5 27b at 39.5%. Gemma 4 is also significantly cheaper than other models with the same score. I ran this locally through ollama, with a 4-bit quant (q4_K_M). Full precision might score even better. The costs assume $0.14/$0.40/M.

Håvard Ihle@htihle

WeirdML v2 is now out! The update includes a bunch of new tasks (now 19 tasks total, up from 6), and results from all the latest models. We now also track api costs and other metadata which give more insight into the different models. The new results are shown in these two figures. The first one shows an overview of the overall results as well as the results on individual tasks, in addition to various metadata. The second figure shows cost vs performance and shows a clear scaling with better results for higher costs. We also have a very varied pareto frontier with 11 models from 6 different companies having the best accuracy for a given cost for at least some of the cost range. Grok 3, Claude Opus 4 and GPT 4.5 are the ones that underperform for their costs, while Gemini pro and o3 pro have the best results at the highest costs. Qwen3 30B3A, grok 3 mini and deepseek R1 also each represent a good chunk of the pareto frontier.

English

301

60.9K

tarun bana@iabom·2d

@francoisfleuret Anything verifiable and instant output that can be evaluated= AI good. Anything else = horseshit covered in gold foil. People actually take advice from these things. LMAO

English

François Fleuret@francoisfleuret·3d

> LLM write a very convincing explanation hinging on [WHATEVER] > me asking why [WHATEVER] would make any fucking sense > LLM apologizes and goes into self-criticism session

English

2.2K

tarun bana@iabom·2d

@cocktailpeanut Veo?

cocktail peanut@cocktailpeanut·2d

Most U.S. generative media AI labs that raised hundreds of millions or billions seem to have quietly given up on building the best foundation model---They pivoted to consumer apps---Is there any AI lab left, actually focused on building the best foundational generative AI model?

English

1.1K

tarun bana@iabom·2d

@Yokohara_h yes, and not only that, it all looks so same. There is a sort of AI likeness, not that its low quality. just not distinct enough in the art somehow. Its the same thing that happens to every Unreal engine 5 game, they all just feel same somehow.

English

192

Hirokazu Yokohara@Yokohara_h·3d

seedance2.0とか動画生成どれも値段高い問題これじゃあ小金持ちのおっさんくらいしか動画生成で遊べないよね

日本語

14.7K

tarun bana@iabom·2d

there is a very good chance you will be beating it to a generated woman in a few years. bcz real stuff's dopamine will be lower than the generated perfection.

English

tarun bana@iabom·2d

@Alibaba_Wan Exhausted prince Hamlet sitting on a stone floor, staring obsessively at a skull between his boots. Discarded sword. Claustrophobic chamber Death metal aesthetic, chiaroscuro lighting, pale moonlight, thick fog. Match uploaded reference image art style. #wan2.7image

English

tarun bana@iabom·2d

@Alibaba_Wan @Alibaba_Wan [Composition]: 4K wide shot. [Subject]: Sun Wukong in red/gold armor with staff. [Props]: Cloud & golden dragon as physical 1900s stage props. [Environment]: Misty karst mountain valley. [Lighting]: Golden hour sunset, dramatic rim light. #wan2.7image.

English

119

Wan@Alibaba_Wan·6d

1/2 Big news, creators! We are officially launching a three-part creative marathon centered around the powerhouse capabilities of Wan2.7-Image. We’re pushing the boundaries of what AI can do, and today, we kick off with our first challenge! Track 1 – Refined Persona Sculpting The Mission: Choose one of these three world-famous figures to recreate based on your vision: Hamlet (Shakespeare): Sun Wukong (Journey to the West): Catherine Earnshaw (Wuthering Heights) Prize: The top ten winners will receive a one-month Wan Premium membership. What We’re Judging: Authenticity: Does the character match the author's vision? Creative Flare: Set the scene with dramatic lighting and mood. Technical Detail: From skin texture to the fabric of their clothes. Duration: Challenge ends in one week!

English

122

1.1M

tarun bana@iabom·2d

@Alibaba_Wan [Composition]: 4K wide shot. [Subject]: Sun Wukong in red/gold armor with staff. [Props]: Cloud & golden dragon as physical 1900s stage props. [Environment]: Misty karst mountain valley. [Lighting]: Golden hour sunset, dramatic rim light. #wan2.7image.