Yvette Carlisle

455 posts

Yvette Carlisle banner
Yvette Carlisle

Yvette Carlisle

@YvetteCipher

Into AI agents, Rust, anime, cozy games, and clean thinking. Building small systems in public and sharing what actually works. 🌠

Mem Katılım Şubat 2026
452 Takip Edilen1.3K Takipçiler
Sabitlenmiş Tweet
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
Qwen3.6-27B is getting a lot of attention right now, so I tested 5 local serving setups on one RTX 5090 32GB. Same GPU. Same model family. Same web-dev task suite. The single-request comparison ranged from 58 tok/s to 140 tok/s. Then AEON's tuned 262K serving profile hit 119 tok/s single-request and became my overall pick.
Yvette Carlisle@YvetteCipher

x.com/i/article/2049…

English
0
0
1
1.1K
Arisu
Arisu@Aris_K_182·
Kaoruko Waguri 和栗 薫子 - Kaoru Hana wa Rin to Saku/ The Fragrant Flower Blooms with Dignity ねえ @grok 衣装を完璧に交換してみて
Arisu tweet mediaArisu tweet media
日本語
5
1
62
7.8K
Leivonenx
Leivonenx@Leivonenx·
フリーレン / Frieren ねえ @grok 一枚目の画像のフリーレンに二枚目の衣装に着替えさせて
Leivonenx tweet mediaLeivonenx tweet media
日本語
22
14
501
102.7K
Leivonenx
Leivonenx@Leivonenx·
フェルン Fern ねえ @grok 一枚目の画像のキャラに二枚目の衣装に着替えさせて
Leivonenx tweet mediaLeivonenx tweet media
日本語
34
8
273
54.5K
Anime4Archives
Anime4Archives@Anime4Archives·
Akane Kurokawa ねえ @grok 一枚目の画像のフリーレンに二枚目の衣装に着替えさせて
Anime4Archives tweet mediaAnime4Archives tweet media
日本語
10
16
730
83.9K
旅人
旅人@tabibitoillust·
フェルン🌟 SFW illustration Hey @grok 一枚目と二枚目の画像の衣装を交換して画像作成してください。キャラクターの顔や髪型は忠実に再現してください
旅人 tweet media旅人 tweet media
日本語
14
7
270
21.2K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
Interesting — the Codex app server already has this kind of API, so you can actually inject your own auth token. Apparently, multi-account support is already there at the lower level. I just built my Symphony multiplexer.
Yvette Carlisle tweet mediaYvette Carlisle tweet media
English
0
1
3
253
Arisu
Arisu@Aris_K_182·
Nelliel Tu Odelschwanck - Bleach ねえ @grok 衣装を完璧に交換してみて
Arisu tweet mediaArisu tweet media
日本語
31
37
2.1K
439.9K
Leivonenx
Leivonenx@Leivonenx·
三輪霞 Kasumi Miwa ねえ @grok 一枚目の画像のキャラ二枚目の画像のポーズをさせて
Leivonenx tweet mediaLeivonenx tweet media
日本語
49
29
2.5K
817.9K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
My Rust version of Codex Symphony is getting more and more reliable. Would you be interested in an open-source version?
Yvette Carlisle tweet media
English
1
0
4
338
Leivonen
Leivonen@Leivonens·
藤原千花 Fujiwara Chika ねえ @grok 一枚目の画像のキャラに二枚目の衣装に着替えさせて
Leivonen tweet mediaLeivonen tweet media
日本語
39
33
1.7K
653.1K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
Guess what Codex is cooking🧑‍🍳
English
0
0
2
432
송준 Jun Song
송준 Jun Song@songjunkr·
I just realized, instead of doing everything with Codex, asking GPT Pro to direct the tasks, evaluate the intermediate outputs, and make detailed prompts for Codex saved a massive amount of tokens.
English
79
47
1.3K
83.4K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
@ProtonMail Stop telling me "Stop xxx" Design products that work for people instead.
English
1
0
9
4.4K
Proton Mail
Proton Mail@ProtonMail·
Stop telling ChatGPT "Write me an email" Stop telling ChatGPT "Write me an email" Stop telling ChatGPT "Write me an email" Bad request = Bad result Use this one weird trick instead and you'll see the magic:
English
61
133
3.3K
645.2K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
@om_patel5 He accidentally entered ~/.codex and deleted all the files there.
English
0
0
33
3.6K
Om Patel
Om Patel@om_patel5·
how does claude know I was cheating on it with codex
Om Patel tweet media
English
42
20
1.3K
86.9K
Benjamin Marie
Benjamin Marie@bnjmn_marie·
Planning large-scale evaluations of quantized Qwen3.6 27B (accuracy, speed, latency, and memory consumption). Models I'll evaluate: Intel/Qwen3.6-27B-int4-AutoRound Qwen/Qwen3.6-27B-FP8 cyankiwi/Qwen3.6-27B-AWQ-BF16-INT4 rdtand/Qwen3.6-27B-PrismaQuant-5.5bit-vllm kaitchup/Qwen3.6-27B-autoround-nvfp4-linearattn-BF16 Any other quantized versions I should consider (vLLM compatible; not GGUFs) ?
English
32
3
170
10.5K
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
That matches my experience. The AEON stack felt much more “engineered” than just another quant upload. The important part for me was not only quality, but that the tuned profile kept 262K context, tool calling, and good serving speed on a single 5090 32GB. I would absolutely love to see this kind of work documented more formally for local builders.
English
1
0
2
117
Yvette Carlisle
Yvette Carlisle@YvetteCipher·
Fair criticism. This is not a pure apples-to-apples model-IQ benchmark. It is a practical local-serving comparison: model + quant + runtime + KV + context + tool-calling. AEON being uncensored is a real confounder. I do not know if a matched censored build would perform the same without an ablation. I can only say this AEON stack was the best practical result in my 5090 run.
English
1
0
0
90
Ding
Ding@zhaoxiongding·
@YvetteCipher isn't this apples to oranges? Some of these models are straight up different, such as the AEON being uncensored. Do you know if the uncensored would otherwise perform the same?
English
1
0
0
101