aivrar

8.1K posts

aivrar banner
aivrar

aivrar

@aivrar

AI Enthusiast https://t.co/QZxeQwTVCJ I will work for VRAM

Mexico Katılım Nisan 2011
944 Takip Edilen378 Takipçiler
Sabitlenmiş Tweet
aivrar
aivrar@aivrar·
My Hermes Agent portable Windows application is getting a lot of attention. No need to use Docker or even have Python installed in your system, completely PORTABLE with a Native Windows GUI. github.com/aivrar/portabl…
English
0
2
9
616
aivrar
aivrar@aivrar·
@vllm_project vLLM is so awesome I made a Windows build for it. No need to use docker or have Linux/WSL. for RTX 30/40/50-series, pre-built wheel, Windows patchset, 10 KV-cache compression dtypes, OpenAI API server fixes, Rust frontend, and Rust tool parser support. github.com/aivrar/vllm-wi…
English
1
0
3
247
vLLM
vLLM@vllm_project·
🚀 Qwen3.6-27B-NVFP4 is inference ready with vLLM on NVIDIA Blackwell GPUs. This checkpoint is optimized for Blackwell and reduces GPU memory requirements by ~2.5x for local AI with open-source models. 🧠 27B params, Hybrid Attention 📊 NVFP4 evals: 86.3 on MMLU Pro, 85.5 on GPQA Diamond 🛠️ Exclusively supported on vLLM as the runtime engine Get started from the Hugging Face checkpoint: huggingface.co/nvidia/Qwen3.6…
NVIDIA RTX Spark@NVIDIARTXSpark

Fast, efficient local AI with open-source models just got easier. Qwen3.6-27B-NVFP4 is now on @HuggingFace! It's optimized for NVIDIA Blackwell GPUs & inference ready with @vllm_project. The checkpoint reduces GPU memory requirements by approximately 2.5x for powerful 27B-parameter inference on your own hardware.

English
15
39
430
49.9K
aivrar
aivrar@aivrar·
@YC1401 @FoxNews It's a great quote though, that's how I basically feel about the World.
English
0
0
4
406
Fox News
Fox News@FoxNews·
BREAKING: Two people have climbed to the top of the Empire State Building in New York City, holding a banner from the skyscraper's antenna reading, "When the power of love beats the love of power, the world knows peace." As of now it's unclear how the pair reached the top of the building as police work to get them down from the spire, 1,454 feet above the ground.
English
9.3K
45.2K
349.1K
70.8M
aivrar
aivrar@aivrar·
@YC1401 @FoxNews Thanks! I didn't know that because on FB they just spam the Jimi Hendrix one.
English
5
0
90
13.4K
🐦‍⬛♕♡♕🐦‍⬛
@aivrar @FoxNews The original phrasing comes from 19th-century British Prime Minister William E. Gladstone: "We look forward to the time when the Power of Love will replace the Love of Power. Then will our world know the blessings of peace." Not Jimmy Hendrix lol
English
11
13
295
19.3K
aivrar
aivrar@aivrar·
@MoundLore Wouldn't that be the dirt under our feet?
English
1
0
2
40
MoundLore
MoundLore@MoundLore·
What’s the oldest thing you’ve ever touched?
English
627
8
304
1M
Cinema Tweets
Cinema Tweets@CinemaTweets1·
One of the Funniest Performances in Cinema History
English
67
477
6.3K
425.9K
aivrar
aivrar@aivrar·
@coinbureau That's the kind of hype that makes people start looking to open source even more.
English
0
0
0
66
Coin Bureau
Coin Bureau@coinbureau·
🚨ANTHROPIC CEO: OPEN SOURCE AI IS GETTING DANGEROUS Anthropic CEO Dario Amodei told lawmakers that open-source AI is moving down a “very dangerous path.” His warns that once powerful models are released openly, companies lose the ability to monitor misuse, revoke access, or update safety guardrails.
English
1.9K
494
3.7K
3.4M
aivrar
aivrar@aivrar·
@hqmank Exact same for me, I was at 2% left and got the reset, happy Sunday indeed!.
English
1
0
2
290
Thrilla the Gorilla
Thrilla the Gorilla@ThrillaRilla369·
I need a very specific tough sounding name for a tiny chihuahua puppy Not Rocky 🐕
English
5K
131
1.6K
274.8K
aivrar
aivrar@aivrar·
@gofishh77 That couldn't work very efficiently without getting air into the bottom of that barrel.
English
0
0
1
1.4K
Richie Rich
Richie Rich@gofishh77·
Redneck ingenuity is always fun!
English
114
294
2.9K
798.4K
Henrick Johansson
Henrick Johansson@compliantvc·
As a European, I am taking the climate pledge to NOT use air conditioning or other climate-destroying cooling devices Americans hate how strong we are (they are coddled and rely on artificial cool air) Who else is taking the pledge with me? Let's save the planet!
English
5.4K
74
1K
495.9K
Iberian_America
Iberian_America@Iberian_America·
man this is like the 10th person who has been shot to death since I moved to tepito im starting to think tepito might not be safe idk
Iberian_America tweet media
English
1
0
3
230
aivrar
aivrar@aivrar·
@Jackkk Not the worst life I guess.
English
0
0
0
3
Jack
Jack@Jackkk·
Mark Zuckerberg reveals he's feeding his cows beer and macadamia nuts “On the ranch, one of my projects is I'm trying to create the highest quality beef in the world” “It's very low stakes, I’m not selling it but I'm very into the genetics of the cattle. We're trying to figure out how do you make it so that you basically can deliver the highest density diet to them” “We started growing macadamia trees because that kind of nut is extremely dense and they will eat a lot so they will put on weight and become fat quicker and become delicious” “The macadamia nuts have a lot of oil so you need to actually roast that. So now we need to design this whole process to roast the nuts so that way you can give them to the cows” “You want them to eat more. So then it's like how do you get them to eat more? Well it turns out alcohol is great for that because alcohol induces appetite” “That's actually why very high-end beef, they're fed beer. But okay, what's the right balance of beer versus water? I don't know. Let's let them choose. They get either as much cold beer as they want or as much room temperature water” “So now we're brewing all this beer and we're putting it out”
English
822
298
6.8K
3.6M
Defiant L’s
Defiant L’s@DefiantLs·
What's something that makes you think, "I'm too old for this shit"?
English
186
6
107
53.8K
aivrar
aivrar@aivrar·
@lauriewired I'd totally game in the cloud, I need my GPUs for other AI stuff.
English
0
0
0
15
LaurieWired
LaurieWired@lauriewired·
you’ll get mad at me for saying this…but cloud gaming is so obviously more economically efficient than physical hardware I think it’s going to be the default soon. your home console / pc is idle 90%+ of the day. meanwhile, data centers targets what, 5%, maybe at worst 10% idle. every second a cloud gamer isn’t gaming, that hardware is being used for someone else, training, etc. I think there should be a new measurement, something like cost-per effective FLOP hour that takes into account the TCO + effective utilization. If a gamer spends $500 on a GPU, uses it for 3 years, but it’s only fully active ~5% of that period…the cost-per relative FLOP hour is crazy high! Meanwhile, a $50,000 datacenter GPU might have a *LOWER* cost-per FLOP hour just because the effective utilization is 90+%.
LaurieWired tweet media
English
3.2K
169
4.1K
3.4M
aivrar retweetledi
Tom Turney
Tom Turney@no_stp_on_snek·
TurboQuant+ updates. 4.25→4.125 bpw, faster decode, lower KLD, crash fixed. NEED TESTERS: spent the last stretch hammering on my llama-cpp-turboquant fork and the numbers finally moved the way i wanted. this is the same-fork before vs after on qwen3.6 / 5090, so it isolates exactly what my fixes did: KLD vs f16: down 34-35%. the culprit was a mis-scaled centroid table (σ≈0.064 instead of exact Lloyd-Max). fixing it also cut PPL degradation from +0.55% to +0.16%. decode: +17% at short context, growing to +60% deep. a flash-attn launch-latency backport plus a fused-MMA decode path that reads the KV once per head-group instead of once per head. the 32K crash: fixed. turned out it was never an inference crash, just an int-to-size_t overflow in the perplexity tool. size: 4.25 down to 4.125 bpw, bit-identical quality. dropped a dead field that was just along for the ride. prefill: roughly flat. wasn't a target, it was already competitive. net: every quality, speed, and robustness axis moved the right way, and the block got smaller. loop's still live, chasing a clean -30% KLD bonus next. what i actually need now: testers. if you've got a 5090, or honestly any CUDA / HIP / Metal / Vulkan box, and you run local models, pull the branch and tell me what breaks. real hardware, real workloads, the messier the better. PR's here: github.com/TheTom/llama-c… Mixed results on pascal. need more info. plz let me know your results
Tom Turney tweet media
English
10
7
58
10.2K
Natism
Natism@his4Everz·
My fellow neurodivergents: What color is the number 9?
English
1.1K
46
765
187.3K
morganhippie
morganhippie@morgan88877·
Imagine how good the CIA's LSD was
English
43
91
783
32.9K