Purplefin Neptuna

568 posts

Purplefin Neptuna banner
Purplefin Neptuna

Purplefin Neptuna

@PurplefinNep

Game Programmer Making 2D Games on https://t.co/etQ1K1rfu0 Godot, and Unity if needed

Indonesia Katılım Aralık 2015
88 Takip Edilen45 Takipçiler
KVCache.AI
KVCache.AI@KVCache_AI·
🚀 We just launched the open-source KV Cache Size Calculator by KVCache.ai! Calculate KV cache size for mainstream LLMs with flexible precision settings and detailed breakdowns. Supports DeepSeek, GLM, Kimi, Qwen3 and MiniMax. Try it now: kvcache.ai/tools/kv-cache…
KVCache.AI tweet media
English
9
19
137
46.7K
Lunar φ
Lunar φ@LunarWave__·
Love watching someone open a game 3 times within the span of a minute on Steam. Like hell yeah this time it will be fixed.
English
103
3.1K
46.4K
629.7K
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
Capping reasoning budget maybe made it worse, I might need to rerun this. See this leaderboard to compare how fucked up your AI setup is🤣: bench-loop.com/leaderboard
Purplefin Neptuna tweet media
English
0
0
7
43
月須和・那々
月須和・那々@nana_tsukisuwa·
animaにて、ADDifTでの学習も出来ました。 この辺の手法はanimaでも通用するようです。
月須和・那々 tweet media月須和・那々 tweet media月須和・那々 tweet media
日本語
1
8
57
4K
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
Sayang pas nyoba di pc gw, malah offset semua hasilnya. Kyknya Q4 terlalu bego buat ginian 😔
ModelScope@ModelScope2022

Big leap in Object Detection with Qwen3.6-35B-A3B! 🚀We are excited to showcase the new "Instruction-Oriented Object Detection" capability on ModelScope. Demo 👉modelscope.ai/studios/Qwen/O… 📈 Performance: ODinW score jumped from 42.6 (Qwen3.5) to 50.8! 🧠 Beyond standard detection, Qwen3.6 leverages LLM reasoning to: 1️⃣ Identify fine-grained objects, such as PCB components and reference designators. 2️⃣ Detect small and occluded cars in aerial-view parking lots. 3️⃣ Handle dense scenes multi-scale objects. 🤖 Download model: modelscope.ai/models/Qwen/Qw… #Qwen36 #ObjectDetection #ComputerVision #OpenSource

Indonesia
0
0
0
67
Mass
Mass@MemoryReboot_·
Spent today getting DFlash running on dual 3090 + Gemma 4 31B From the very beginning I took a wrong turn - AWQ 8bit + DFlash = 0.4% acceptance, drafter was trained on a different quant - pip install PR branch → trashed my venv What worked: @malikwas1f club 3090 recipe, pre patched docker container. Just docker compose up Results: - 86 tok/s, accept rate 61.6% +33% over my MTP result (52 tok/s) Couldn't hit those 168 tok/s (hello PCIe x4 on the second card) Gonna try to get better numbers tomorrow
Mass tweet media
English
6
4
18
2.2K
𝔸𝕟𝕕𝕚 𝕁𝕒𝕖𝕘𝕖𝕣📯📸
@sudoingX Cloning and compiling the most popular llama forks as well as upstream was never as easy and comfortable as now. Just make sure to have gcc between version 12 and 15, nvcc doesn't like the latest version 16.
English
3
0
4
938
Sudo su
Sudo su@sudoingX·
anyone interested in or getting started with local ai personal inference, pay attention. start with the right practice. compile llama.cpp from source. i know lm studio and ollama exist. they're great onramps. but they're mostly wrappers around llama.cpp with abstraction layers that hide the flags you actually need to tune. what compiling once gets you: > the best inference engine for personal use, full stop > latest features the day they merge (vulkan flash attention dp4a, kv cache quant, fa toggles) > exact gpu arch optimization (sm_120 for 5090, sm_89 for 4090, sm_86 for 3090) > direct flag control > openai-compatible llama-server api ready out of the box the build (3-5 minutes on a modern cpu): git clone github.com/ggerganov/llam… cd llama.cpp cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=120 cmake --build build --config Release -j (replace 120 with 86 for 3090, 89 for 4090, 80 for A100. for AMD GPUs swap GGML_CUDA for GGML_VULKAN.) when to NOT use llama.cpp: > multi-gpu batch serving at scale = vllm > production async high-throughput = vllm or sglang > apple silicon = mlx is faster for single-gpu personal inference + agentic workflows + benchmarking: llama.cpp from source. every time.
English
43
46
493
23.2K
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
@sudoingX Did you run this with thinking enabled/preserved? I don't trust Qwen3.6 thinking mode with low-bit model & KV quant. 35B can waste 100k tokens just thinking, so I'm worried 27B will waste more time thinking, since it's much slower.
English
0
0
0
97
Sudo su
Sudo su@sudoingX·
carnice-v2 is up now on my rog 5090 mobile 24gb tier, llama-server holding 21gb at 262k context, hermes agent on the other pane, nvtop watching the line go up. hell yeah tonight is gonna be great. if you are running with me on 24gb vram, here is the exact stack: ./llama-server -m carnice-v2-27b-Q4_K_M.gguf -ngl 99 -c 262144 -np 1 -fa on --cache-type-k q4_0 --cache-type-v q4_0 --port 8080 --host 0.0.0.0 3090, 4090, 5090 mobile, w7900, anything 24gb, copy the flags and share what you got. if you have not run carnice on your card yet anon, what is stopping you?
Sudo su tweet media
Sudo su@sudoingX

a question keeps hitting my mind. does same base SFT for harness beat vanilla qwen 3.6-27b on hermes agent agentic loops? to find out i loaded carnice-v2 by kaios, qwen 3.6-27b tuned specifically on hermes agent traces. trinity 3-stage merged bf16 SFT. i have been benchmarking vanilla 3.6 on agentic tasks for weeks so i would catch any meaningful improvement on a head to head. been wanting to run this exact lineup. carnice on the same hardware, same context, same flags i run vanilla on. let's see how it performs against the base. hardware: rog scar 5090 mobile, 24gb vram tier. every flag i use reflects directly to your 3090 desktop 24gb territory, so if you are running a 3090 you can copy the same setup and follow along. results coming next anon.

English
11
0
80
26.4K
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
@0xSero I have Nvidia, but small vram. can only run gguf on llama.cpp
English
0
0
0
144
0xSero
0xSero@0xSero·
SGLang > VLLM > Exllamav3 > Llama.cpp IF you have any Nvidia hardware this is the play.
English
65
39
668
45.1K
👯‍♀️カガミカミ水鏡👯‍♀️
anima-lllite-any-test-like-3 すごい便利……色つき画像でも参照できるし、ポーズ維持しながらもバリエーション出してくれる(node整理せんとな……)
👯‍♀️カガミカミ水鏡👯‍♀️ tweet media
日本語
1
28
308
16.8K
nen
nen@nd_haryo·
I’ve played enough OpenTTD to finally understand this
Jakarta City Of Galian@LetitFlowBeib

@AndCav4 Tapi kecelakaan ini penyebabnya ya harusnya bukan dari taxi. Kalopun itu bukan taxi dan diganti pohon tumbang. Sinyal kereta tersebut ga akan berubah. Menurut Masini Bekasi sinyal hijau. Padahal seharusnya Bekasi Kuning karena ada kereta di stasiun bektim

English
1
0
0
90
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
Finished. Reason: Output token limit reached.
English
0
0
0
27
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
I think Qwen3.6 35B A3B is the best local model that can fit in my 3090 and also fast. But wait, let me check if Qwen3.6 35B A3B is the fastest. Right, Qwen3.6 35B A3B is the fastest and fit in my 3090. But wait, let me check if Qwen3.6 35B A3B fit in my GPU. Yes, Qwen3.6 35B A3B
English
2
0
1
62
D🚴🏻‍♂️
D🚴🏻‍♂️@_sgdx·
Kalau di tahun 1969 Amerika Serikat bisa mendarat di Bulan. Kenapa sekarang cuma bisa flyby(?) Kenapa gak sekalian mendarat aja gitu🤔
Indonesia
2
0
0
88
Purplefin Neptuna
Purplefin Neptuna@PurplefinNep·
@bizlet7 It looks way better by a lot. But for someone who scrolls through AI-generated images all day… I'm just tired of it, boss. i love old DLSS where they tried to fully reconstruct 1/9 resolution image to match the full resolution one. not this smartphone beautify filter crap.
English
0
0
0
153
毎日カト MaiKatz14
毎日カト MaiKatz14@MaiKatz14·
Followers when I dont post preg arts during Ramadan
毎日カト MaiKatz14 tweet media
English
2
2
35
6.1K
JFJ🇺🇸
JFJ🇺🇸@jifferey·
I really like what Hytale offers with it's progression. In Minecraft: your progression is anywhere as long as you can dig and go down. In Vintage story: your progress is based on luck but a slight amount of adventure. Hytale separates your progress by distance. which encourages adventure. While exploring and moving there's better and more difficult mobs that drop more resources of what you need. there's premade structures such as enemy bases, mineshafts, demon temples, sunken pirate ships, ocean monuments. I GREATLY enjoy the adventure aspect and it's good to know Hytale is focussed heavily on moving forward. and they give you teleporters so you can keep moving and head home to have a base of operations. I like it. I'd probably install mods or wait for updates to flesh it out and polish it more but this is pretty great so far.
JFJ🇺🇸 tweet media
English
84
226
7.6K
356.3K