Sol4ra

159 posts

Sol4ra

@therealsol4ra

Hiker, Photographer, Gamer, "AI" Advocate I post things I think are cool

Katılım Temmuz 2015

41 Takip Edilen15 Takipçiler

Sol4ra@therealsol4ra·1d

@mattcreed23 @WongUpdates Literally the most dogshit ending, the fuck are you talking about. Literally anything would be better.

English

101

Matt R@mattcreed23·1d

@WongUpdates Would have been crazy but def not a better ending then him getting what he deserves

English

12.9K

Wong Updates@WongUpdates·1d

Antony Starr reveals that they shot an alternate ending for ‘THE BOYS’ where Homelander wins. “He fucking kills everyone and scorches the entire planet. Oh well, it will never see the light of day.” (via: variety.com/2026/tv/news/t…)

English

342

860

30.7K

1.3M

Sol4ra@therealsol4ra·1d

@NancyJo1961 @depressionlesss Why are you so retarded

English

379

Nancy Jones@NancyJo1961·1d

@depressionlesss Please, please stop anthropomorphizing animals!

English

17.9K

Antidepressant Content@depressionlesss·2d

Cat immediately apologises to dog after realising that she was just playing with her baby, not harming him

English

1.2K

38.2K

1.2M

Sol4ra@therealsol4ra·3d

@winterlilies333 @Awk20000

QME

R@winterlilies333·3d

@Awk20000 I always found him so insufferable and very quick to shame women, people of color, and the poor. Won’t be surprised when he comes out as republican or something else shitty.

English

1.2K

yeet@Awk20000·3d

Caleb Hammer on those paying $20 per lunch to get fast food “Pack a sandwich”

English

743

190

9.7K

11.2M

Sol4ra@therealsol4ra·5d

@nicksortor How many fucking fires will this shitty state go through before it actually puts in solid infrastructure to combat it lmao

English

2.1K

Nick Sortor@nicksortor·5d

🚨 BREAKING: EVACUATIONS ORDERS have been issued in Simi Valley, California, in the Greater Los Angeles area as a brush fire threaten entire neighborhoods Structures are reportedly engulfed — possibly homes. Pray for anyone in the path of this fast-moving blaze 🙏🏻 🎥 @DanielJMcGreevy

English

1.2K

3.8K

12.2K

Sol4ra@therealsol4ra·5d

@VoodooDE_Gaming @edison_goff You didnt answer the question.

English

361

VoodooDE VR | XR Expert & Reviewer@VoodooDE_Gaming·5d

@edison_goff Steam Frame is not for Mixed Reality

English

5.8K

Sol4ra@therealsol4ra·14 May

@Ch1bi_ @AB_type_aruaru1

QME

みんなの気になるバズ動画@AB_type_aruaru1·14 May

アメリカの男性カップルがクラファンで"代理出産"の資金調達 ↓ 資金が集まり無事男児の親権をゲットカップルで仲睦まじく男児にキスするほのぼの動画をup ↓ 動画右側の男性が過去に...

日本語

363

445

11.4M

Sol4ra@therealsol4ra·14 May

@yxngaltpolo @peter_lingua_ Africans =/ African Americans

English

1.3K

polo@yxngaltpolo·14 May

@peter_lingua_ Bold as hell letting three random niggas in her car

English

481

17.9K

Àyántáyò@peter_lingua_·13 May

After their village is burned down during the civil war, a group of siblings flee for survival. But tragedy strikes when one sibling is left behind while the rest begin a new life in the United States. 🎬 Title: The Good Lie

English

122

378

18.6K

83.3K

Sol4ra@therealsol4ra·13 May

@danielhanchen Not paroquant in 2026? 😴

Français

146

Daniel Han@danielhanchen·13 May

We released experimental MTP Qwen3.6 Unsloth GGUFs! Qwen3.6 27B MTP now runs at 140 tokens/s. Qwen3.6 35B-A3B MTP gets 220 tokens/s generation on a single GPU. Qwen3.6 27B and 35B-A3B have >1.4x speed-up over the original GGUFs without any change in accuracy. Guide + GGUFs + Benchmarks: #mtp-guide" target="_blank" rel="nofollow noopener">unsloth.ai/docs/models/qw… In terms of average speedup, we see a 1.4x for dense models at draft tokens = 2 and for the MoE around 1.15 to 1.2x. We do not recommend more than 2 draft tokens because the acceptance rate drops precipitously from 83% to 50% with 4 draft tokens, and the forward passes for MTP become less beneficial. Use `--spec-type mtp --spec-draft-n-max 2` Thanks to Aman for github.com/ggml-org/llama…!

English

117

789

122.2K

Sol4ra@therealsol4ra·13 May

@grovymango

QME

grovy@grovymango·12 May

Young men in my dms please stop flirting with me i am 38 years old

grovy@grovymango

its me and my buck teeth against the world

English

2.5K

3.4K

161K

4.1M

Sol4ra@therealsol4ra·12 May

@Montreal Never been prouder to be an American

GIF

English

Montréal@Montreal·12 May

You're welcome here. 🏳️‍🌈🏳️‍⚧️

Cult MTL@cultmtl

Canada has been named the second safest country in the world for LGBTQ+ travellers: 1. Netherlands 🇳🇱 2. Canada 🇨🇦 3. France 🇫🇷 4. Australia 🇦🇺 5. Austria 🇦🇹

English

348

2.5K

34.2K

937.6K

Sol4ra@therealsol4ra·12 May

@bnjmn_marie Please test paroquant! Its supposed to be better than FP8 while being the same size as Int4! huggingface.co/z-lab/Qwen3.6-…

English

Benjamin Marie@bnjmn_marie·12 May

Evaluations of Quantized Qwen3.6 27B: FP8, NVFP4, and INT4 The model is robust to quantization. But have to be careful with the linear attention, not all submodules can be safely quantized to 4-bit, and NVFP4 underperforms (evaluated autoround and llm compressor). Intel’s INT4 model is particularly strong, and I also found it’s the fastest, and the smallest I’ve evaluated, with vLLM. One issue: these 4-bit models tend to generate more tokens, +10~30% compared with the BF16 model Full analysis: Latency, accuracy, MTP compatibility, memory consumption, here: kaitchup.substack.com/p/qwen36-27b-q…

English

114

8.1K

Sol4ra@therealsol4ra·12 May

@fw_keitth @PoroRaal @veizau Black men are actually just as likely to shoot up a school as white men based off the demographics of prior shooters.

English

1.5K

VortX@fw_keitth·12 May

@PoroRaal @veizau Just like whites shoot up schools and kill multiple innocent children and ruin multiple innocent families ?

English

318

10.3K

veizau@veizau·11 May

Joe Bartolozzi GOES OFF on people who say Black people are more likely to commit crimes than white people “If you are going to say statistically they do, that is only because of impoverished areas and the lives they’ve been given due to the implications of systemic racism.”

English

3.2K

6.7K

103.6K

10.6M

Sol4ra@therealsol4ra·12 May

@ArtificialAnlys Need to see where Qwen 27b stands and which harness is best for it. You already know its a beast…

English

117

Artificial Analysis@ArtificialAnlys·11 May

Announcing the Artificial Analysis Coding Agent Index! Our new coding agent benchmarks measure how combinations of agent harnesses and models perform on 3 leading benchmarks, token usage, cost and more When developers use AI to code they’re choosing a model, but also pairing it with a specific harness. It makes sense to benchmark that combination to understand and compare performance. The Artificial Analysis Coding Agent Index includes 3 leading benchmarks that represent a broad spectrum of coding agent use: ➤ SWE-Bench-Pro-Hard-AA, 150 realistic coding tasks that frontier models struggle with, sampled from Scale AI’s SWE-Bench Pro ➤ Terminal-Bench v2, 84 agentic terminal tasks from the Laude Institute and that range from system administration and cryptography to machine learning. 5 tasks were filtered due to environment incompatibility ➤ SWE-Atlas-QnA, 124 technical questions developed by Scale AI about how code behaves, root causes of issues, and more, requiring agents to explore codebases and give text answers Analysis of results: ➤ Opus 4.7 and GPT-5.5 lead the Index: Opus 4.7 in Cursor CLI scores 61, followed closely by GPT-5.5 in Codex and Opus 4.7 in Claude Code at 60. GPT-5.5 in Cursor CLI follows at 58. ➤ Open weights models are competitive, but still trail the leaders: GLM-5.1 in Claude Code is the top open-weight result at 53, followed by Kimi K2.6 and DeepSeek V4 Pro in Claude Code at 50. These are strong results, but still meaningfully behind the top proprietary models. ➤ Gemini 3.1 Pro in Gemini CLI underperforms: Gemini 3.1 Pro in Gemini CLI scores 43, well below where Gemini 3.1 Pro sits on our Intelligence Index, highlighting that Gemini’s performance in Gemini CLI remains a relative weak spot for Google’s offering. ➤ Cost per task (API token pricing) varies >30x: Composer 2 in Cursor CLI is cheapest at $0.07/task, followed by DeepSeek V4 Pro in Claude Code at $0.35/task and Kimi K2.6 in Claude Code at $0.76/task. At the high end, GPT-5.5 in Codex costs $2.21/task, while GLM-5.1 in Claude Code costs $2.26/task. For both models this was contributed to by high token usage, and in GPT-5.5’s case by a relatively higher per token cost. ➤ Token usage varies >3x: GLM-5.1 in Claude Code uses the most tokens at 4.8M/task, followed by Kimi K2.6 at 3.7M/task and DeepSeek V4 Pro at 3.5M/task. GPT-5.5 in Codex uses 2.8M tokens/task, substantially more than Opus 4.7 in Claude Code at 1.7M/task. In GLM-5.1’s case, higher token usage, cost and execution time were partly driven by the model entering loops on some tasks. ➤ Cache hit rates remain high but vary materially: Cache hit rates range from 80% to 96% across combinations. Provider routing, harness prompt structure and cache behavior can materially change the economics of running the same model given cached inputs are typically <50% the API price of regular input tokens. ➤ Time per task varies >7x: Opus 4.7 in Claude Code is fastest at ~6 minutes/task, while Kimi K2.6 in Claude Code is slowest at ~40 minutes/task. This is contributed to by differences in average turns per task, token usage and API serving speed. Opus 4.7 had materially lower amount of turns to complete a task than all other models while Kimi K2.6 had the most. ➤ Cursor made real progress with Composer 2: Composer 2 in Cursor CLI scores 48, near the leading open-weight model results, while being the cheapest combination measured at $0.07/task. Cursor has stated Composer 2 is built from Kimi K2.5, showcasing they have made substantial post-training gains. This is just the start. We are planning to add additional agents (both harnesses and models). Let us know what you would like to see added next.

English

125

170

1.6K

Sol4ra@therealsol4ra·11 May

@rpcs3 Lol would love to see the little bitch behind these shitty posts. You do nothing but deface the real RPCS3 team.

English

301

RPCS3@rpcs3·11 May

Our guidelines for submitting AI-generated code are now up in our repository! As for all the AI bros seething on our socials, we're simply blocking you. Learn how to debug, code, and leave behind something useful to humanity when you're gone, instead of peddling slop.

RPCS3@rpcs3

Please stop submitting AI slop code pull requests to RPCS3. We will start banning those who do without disclosing. There are plenty of resources online to learn how to debug and code instead of generating slop that you don't understand and that doesn't work.

English

198

1.7K

15.6K

376.2K

Sol4ra@therealsol4ra·10 May

@taco2p @animeupdates Youre a retard.

English

391

taco@taco2p·10 May

@animeupdates There’s no way this many people think “data center has been burned” means it actually lit on fire.

English

211

31.1K

Anime Updates@animeupdates·10 May

Notorious Piracy Streaming site AnimeKai will be shutting down, with developer ending the project following the data center catching on fire "Sorry, our data center has been burned :( We're no longer able to provide the file hosting service."

English

734

824

14.2K

Sol4ra@therealsol4ra·10 May

@rpcs3 Lol, I have no issue calling out shitty half assed PRs but to require disclosing the use of AI and telling people to go learn how to debug and code the old fashioned way is dumb asf.

English

4.2K

RPCS3@rpcs3·10 May

English

266

2.6K

29.5K

1.2M

Sol4ra@therealsol4ra·9 May

@TheoryLemur @RealestMemes_ You know you can delete games off your steam library right?

English

437

Lemur Theory@TheoryLemur·9 May

@RealestMemes_ Sometimes the idea that a game I'll never play again after it gets old is gonna sit on my games list is the main thing that will stop me from buying it in the first place no matter how cheap or many people I know like it.

English

6.5K

RealestMemes@RealestMemes_·9 May

ZXX

864

14.4K

745.2K

Sol4ra@therealsol4ra·8 May

@bnjmn_marie Please compare this new 4 bit paroquant with the others, apparantly its supposed to be better than FP8, I wouldnt believe it but it comes from Z-Labs! huggingface.co/z-lab/Qwen3.6-…

English

346

Benjamin Marie@bnjmn_marie·8 May

4-bit Qwen3.6 models, even with some layers kept in 16-bit, are more prone to endless thinking loops. FP8 behaves well. The worst case is the NVFP4 version with quantized linear attention, although the behavior curve is interesting: when it does not loop, responses tend to be shorter.

English

144

11.6K

Sol4ra@therealsol4ra·8 May

@sudoingX @GrizzledTexan Sure bud, or its because you dont use this for literally anything worth a damn. All just smoke and mirrors. What useless software.

English

Sudo su@sudoingX·8 May

@GrizzledTexan wouldn't you like to know. some things stay between me and the machine.

English

1.1K

Sudo su@sudoingX·8 May

i named my dgx spark "spark." it runs hermes agent /goal overnight. brain is qwen 3.6 27B Q8, 262K context, i set a goal before bed and wake up to results. no rate limits. no token costs. just local inference grinding while i sleep. this thing never stops.

English

227

62.3K

Sol4ra@therealsol4ra·8 May

@theodormarcu Working on a Claude/ chatgpt ecosystem equivalent but that is model/ API agnostic. This compute would go far as my claude pro and gpt pro subscriptions are both at their limits 😭

English

995

Theodor Marcu@theodormarcu·8 May

Have a few more Max subscriptions to give out to folks if you missed this! Just reply and DM me :)

Cognition@cognition

Intelligence at 1000 tokens per second, right in your terminal. Now available with SWE-1.6 Fast, powered by @cerebras. We're giving the first 100 people who respond a free month of Max to try it out.

English

607

638

108.8K

Keşfet

@mattcreed23 @WongUpdates @NancyJo1961 @depressionlesss @winterlilies333 @Awk20000 @nicksortor @DanielJMcGreevy