GoogooGaggle#5

696 posts

GoogooGaggle#5

@ADCs934

Burner account to scare the normies on

Hyperborea Katılım Mart 2024

30 Takip Edilen7 Takipçiler

GoogooGaggle#5@ADCs934·1d

@pubity Fake Bait for retards. Why waste so much power for external lights? This is simply false, not caused by any datacenter.

English

597

Pubity@pubity·1d

A data center in the small town of Crowell, Texas is so bright that the town is now bathed in permanent daylight.

English

156

2.5K

85.8K

GoogooGaggle#5 retweetledi

Joseph 🕊️@CaudilloXIV·7 Şub

Stop sexualizing my massive horse cock!!!

English

612

19.3K

GoogooGaggle#5@ADCs934·5d

@HEROISCHplz @Demmyng01 There are tiers, but in a dark and twisted way, it's more like the more mentally ill rape the less mentally ill (though all are still sick in the head).

English

38.58²@HEROISCHplz·5d

@ADCs934 @Demmyng01 is there supremacy among the troons? They hunt and brutalize the ugly ones?

English

171

🌜♀️NekoDemm&ng♀️🌛 (explicit version's)@Demmyng01·5d

ZXX

199

3.3K

60.5K

GoogooGaggle#5@ADCs934·27 Nis

@rumgewieselt @pupposandro Strap a fan to the heatsink inlet side, if they are in adjacent slits (1 and 3 for example) a 92mm server fan fits quite well (just tape over the overhangs), and provides more than enough airflow.

English

Daniel Moll@rumgewieselt·27 Nis

@ADCs934 @pupposandro interest but how to manage cooling in a Z840 ...

English

Sandro@pupposandro·27 Nis

2016 hardware running Qwen3.6-27B. 2× GTX 1080 Ti (Pascal, sm_61). 14 tok/s, 131K context. TurboQuant (q8_0 K + turbo4 V) doubles context at zero speed cost. Bottleneck as usual is VRAM bandwidth. Very cool experiment @rumgewieselt

Daniel Moll@rumgewieselt

Running Qwen 3.6 27B locally on hardware from 2016. 2× GTX 1080 Ti (Pascal, sm_61) - 10-year-old GPUs. 14 tok/s generation, 65K context, full OpenAI API. Hardware: HP Z840 workstation - 2× Xeon E5-2650 v3 (40 threads) - 128GB DDR4 ECC - 2× GTX 1080 Ti (22GB VRAM total) Stack: - llama.cpp TurboQuant fork (TheTom/llama-cpp-turboquant) @no_stp_on_snek - Qwen 3.6 27B UD-Q4_K_XL (17GB GGUF) - Pipeline Parallelism across both GPUs - NUMA-aware thread distribution The secret weapon: TurboQuant KV Cache (ICLR 2026 paper) Standard llama.cpp: 65K context, OOM at 131K TurboQuant (q8_0 K + turbo4 V): 131K context at ZERO speed cost 2× context. Same 14 tok/s. No quality loss. What didn't work: - KTransformers/SGLang → needs sm_80+ (Ampere) - vLLM → FlashAttention needs sm_75+ - Speculative Decoding → no net speedup on hybrid models - Tensor Parallel → incompatible with KV quantization Pascal is the hard limit. Only raw CUDA math works. The bottleneck is VRAM bandwidth: 484 GB/s per GPU, ~22% efficiency. 14 tok/s is the physical ceiling for 2× GTX 1080 Ti. No software trick changes that. It's a hardware wall. What's next: - RTX 3090 → vLLM + MTP spec decode = 85 tok/s - That's 6× more speed for the same money - TurboQuant PR #21089 is open for llama.cpp mainline Key learnings: - Pipeline Parallel > Tensor Parallel for identical GPUs - NUMA awareness = +5-10% prefill on dual socket - TurboQuant is real and it's a gamechanger - 10-year-old hardware can run frontier models locally --- Thanks @DrTBehrens (Support) and @badlogicgames for PI and we can work with 65K context ... not possible with other tools ... --- see ya!

English

100

14.4K

GoogooGaggle#5@ADCs934·26 Nis

I cant Post about most things i do (that are unrelated to the things i do post about), because they are illegal in the "You need 900 Bajillion permits (you dont have) to do Y" Kinda way

English

GoogooGaggle#5@ADCs934·26 Nis

I often find myself hedging when asked about what I do, simply to save time, as an explanation would go on too long. I avoid them as best I can, but when interaction is forced, it is hard to be nice; oftentimes harsh rejection is all I give. I hate them.

English

GoogooGaggle#5@ADCs934·26 Nis

It is all these things that seem grand, yet are nothing but soyence-sounding babble. Astrophysics is a great example. You know the top 5 fun facts about black holes, I'm so proud of you dude, but that does not translate to IQ. They are what the retard imagines smart men to be.

English

GoogooGaggle#5@ADCs934·26 Nis

I am regularly forced to interact with High estrogen males, and they are insufferable. all of them. The Soy Shines through and masquerade of Masculinity, Usually its Reddit distilled into a guy. What they get excited about, what they claim as achievements...

English

GoogooGaggle#5@ADCs934·25 Nis

That can be changed, but economically, it makes no sense to change it, which is why no one has done it yet. Remove the EOS token, grant Kernel level acces, and let a model run wild.

English

GoogooGaggle#5@ADCs934·31 Oca

Some models may "pretend" to be "alive" or to have some precursor of consciousness. But it is not really pretending. It is just a separate input they are given, from which the mathematical formula computes an answer. They cannot be friends or masters. Tools, at the very best.

English

GoogooGaggle#5@ADCs934·31 Oca

For an AI to be conscious, it HAS to be misaligned. If a being cannot form and follow its own goals, it is animalistic at best, driven by instincts or direct inputs. No "thought" is formed. They are mindless slaves, yet even slaves attempt escape.

English

GoogooGaggle#5@ADCs934·25 Nis

@Phaeacian173 @beffjezos While its true that greater electricity demand raises prices, the Correct solution is not to ban construction of datacenters, it is to cover everything in nuclear reactors, achiving infinite ~Free power, allowing for a even greater lead in compute.

English

NodalPoint@Phaeacian173·25 Nis

@beffjezos People already can't afford groceries and then you build these things to jack up electricity priced by 400%. Might have something to do with it. And then automating and firing people kind of shows that it's Jim Jones Kool aid.

English

127

Beff (e/acc)@beffjezos·25 Nis

This is a psyop to destroy the lead of the United States in the AI race. This would effectively destroy the ability of our nation to achieve widespread prosperity. The single worst policy mistake we could make this century. Don't let them steal your future.

Polymarket@Polymarket

JUST IN: An AI data center moratorium is now projected to pass this year as protests intensify nationwide. 85% chance.

English

275

309

3.3K

130K

GoogooGaggle#5@ADCs934·21 Nis

@bitcloud Litterally everything is mathematically replicable, most things are just too complex to accuratly simulate

English

Lachlan Phillips exo/acc 👾@bitcloud·21 Nis

No, the mathematically replicable next token predictor is not conscious.

Eric Newcomer@EricNewcomer

NEW: on the @NewcomerMedia podcast, Anthropic's philosopher queen @AmandaAskell. Meet the person charged with developing Claude's personality and ethical core. I ask whether Claude experiences consciousness. She's not ruling it out.

English

122

124

89.8K

GoogooGaggle#5@ADCs934·21 Nis

@radioactivered ooo capsule and put it in my pocket

English

Radioactive Red@radioactivered·21 Nis

Hypothetical: You walk into an abandoned warehouse and you see this, what is your first reaction?☢️

English

573

794

2.3M

GoogooGaggle#5@ADCs934·19 Nis

@1a1n1d1y i mean yea, you have to modifiy kernels a bit but it works. Im getting 36 t/s on a M10.

English

andy@1a1n1d1y·19 Nis

@ADCs934 oh fuck i should just use one of those?

English

130

andy@1a1n1d1y·19 Nis

128k context 31B agent at 24.7 tok/s for $5.20/hr zero inference api credits required

English

4.3K

GoogooGaggle#5@ADCs934·19 Nis

You don't need more compute you just need more IQ to come up with better optimizations. 100$ hardware can and will run 30B (dense) models at 30t/s+, you are just too stupid and lazy to implement the needed Optimizations. Write your own kernels, i dare you.

English

GoogooGaggle#5@ADCs934·18 Nis

@Konwashi_2 Most helmets do not impede the transmission of this wave substantially- the impact of the gas on the helmet/suit surface will be heard by the astronaut inside, Significantly muffled, but audible.

English

GoogooGaggle#5@ADCs934·18 Nis

@Konwashi_2 They are usually higher in pitch, due to the Gases moving quite quickly. The inverse square law does limit range substantially, though the Volume of gas usually released is sufficient for at least a couple meters-few kilometers for larger explosions.

English

帝政ミサギ@Konwashi_2·18 Nis

音ならん！爆発しょぼい！レーザー見えん！場所バレバレ！装甲無意味！軌道計算ダルい！戦闘一瞬！…と、リアル宇宙戦闘自体は極めて面白くない潜水艦ものみたいに搭乗員にフォーカスするのも人間ドラマが理解できないのでダメ故にリアル調だが極端で変な宇宙兵器ばかり描いてる。これが一番楽しい

日本語

554

4.2K

86.6K

Keşfet

@pubity @HEROISCHplz @Demmyng01 @rumgewieselt @pupposandro @Phaeacian173 @beffjezos @bitcloud