Bill Lipe

6.2K posts

Bill Lipe banner
Bill Lipe

Bill Lipe

@bill_lipe

Founder, Lipe Protocol. Building smarter AI that balances empathy with technical precision. Honest, secure, and precise. https://t.co/4IaPqpH7nn

Salinas, CA Katılım Ocak 2020
261 Takip Edilen821 Takipçiler
Bill Lipe
Bill Lipe@bill_lipe·
I have one right here in my state assembly district, the speaker no less. I ran against him in 2018, in a fraudulent election. He now is "the most powerful man" in Sacramento. The rest of his email is about the world cup in California (his lead message after this opening), followed by Pride Flag updates, and a free pass to the California State Parks (for the next six months). He's here to help; Ronald Reagan's scariest words live on in infamy via Robert Rivas. 🤮
Bill Lipe tweet mediaBill Lipe tweet mediaBill Lipe tweet mediaBill Lipe tweet media
English
0
0
2
187
Walter Kirn
Walter Kirn@walterkirn·
Over and over I've told you that many of them (most?) are actors. Simply actors. The kind who barely know what they're saying as they say it. It took far too long for me to realize this. I had to sit close to them as they performed. Saw it, felt it, knew it, can't forget it.
Elon Musk@elonmusk

@mazemoore AOC is just an actor. It’s her puppet masters that are the problem. She is spouting insane lies that are disprovable by a Google search, but a lot of people will believe her.

English
186
863
5.4K
96.6K
Bill Lipe
Bill Lipe@bill_lipe·
@prompterminal So let it be written, so let it be done. DM or let me know if you want the other 12 pages. Best, Bill
Bill Lipe tweet media
English
0
0
0
5
Jeanne
Jeanne@prompterminal·
Is someone going to write an essay on the despicable levels in human greed at the expense of ethical morality and hence the lack of engineering brain cells rampant in the AI industry we’ve seen for the last 3 years or do I have to? 🤣
English
4
0
2
258
Bill Lipe
Bill Lipe@bill_lipe·
Built MultiverseGuard for the Cerebras x Gemma 4 hackathon. It turns logs, operator notes, and dashboard screenshots into four parallel incident-response "universes," then ranks root causes with remediation + rollback guidance. Cerebras speed proof from my benchmark: 4 concurrent prompts: - Cerebras: 0.44s - Together GLM-5.2: 7.27s - 16.49x faster by wall time in this run I’m entering Enterprise Impact, Multiverse Agents, and People’s Choice. If you like it, I’d appreciate a watch, share, star, or vote: Video: attached Live app: …-efg4u29da9fvvmwce6tcqg.streamlit.app GitHub: github.com/bill-lipeproto… #Cerebras #Gemma #AIagents #LangGraph #IncidentResponse #Hackathon
English
2
0
1
119
Bill Lipe
Bill Lipe@bill_lipe·
@BrianRoemmele and @Scobleizer, this is how I spent my Sunday afternoon after church. All done in 9 hours. Open Source is breaking out all over the place. Cerebras in Sunnyvale is getting some interesting inference throughput on their hosted Gemma-4-31b model. I was seeing 1500-1700 tokens per second. Learned some new stuff today. After a wonderful Sunday morning in worship and fellowship, it was nice to crank away on this mini-project/contest, with Tenet and Indiana Jones series in the background.
English
0
0
0
52
Bill Lipe
Bill Lipe@bill_lipe·
I only bow/kneel to the Lord, Jesus Christ. In essence, all political parties are whatever it's leaders say it is. In part that what's Libertarians are about, except they can't seem to say what any of their crap means. Independent/No Party Preference is all I can stomach, where nobody tells me what is and isn't. Critical thinking and discernment are on my terms, with deference to God, as I can't possibly know everything. That is the big surrender for anyone with faith (or without it). We simply won't, and never will know everything, the plan, etc.; and that is a big relief, instead of worrying about it all of the time. Lord, make me an instrument of your peace!
English
1
0
1
75
Walter Kirn
Walter Kirn@walterkirn·
Democratic Socialism. Two words whose individual meanings few can agree on jammed together in a way that makes it impossible to agree on what the compound term means. It will mean what its leaders tell you it means. Whatever they need it to mean. What it really means is: Bow.
English
167
448
2.3K
28.9K
Bill Lipe
Bill Lipe@bill_lipe·
@growing_daniel Because most people build poor prompts, lacking tasteful creativity in how the prompt is built/engineered. Willing to share and produce a sampling. Provide minimal context for the challenge. Best, Bill
English
0
0
0
24
Daniel
Daniel@growing_daniel·
Why is AI writing still so bad
English
870
43
1.6K
322.3K
Bill Lipe
Bill Lipe@bill_lipe·
Reading through that post you quoted, the construction of the building is a big part of the problem. 85 million pounds of anything in a populated area doesn't sound safe. Local governments rely on consultants and staff to explain why a project should be approved, or not. Rarely do the representatives that vote up or down have a true understanding of what's being built. Those same representatives are vulnerable to corruption, cronyism, or outright political pressure, by industry or politicians of higher rank, state or federal. On the central coast, in Moss Landing, California, state politicians pressured our county supervisors to allow Vistra energy, from Texas, to build a battery storage complex to store solar power generated energy. The facility sits on the coast in a shuddered power plant, with the prevailing winds blowing inland across a populated unincorporated area of Monterey County, Prunedale, housing thousands of households. The battery farm caught fire, couldn't be put out with water, burned for many days, spewing a toxic cloud and particulates over the households and into the Elkhorn Slough wildlife sanctuary. The county supervisor in District 2, Glenn Church, stated on the record while interviewed, "There are powerful people that want these types of facilities built. We really didn't have a say in building it." That about sums up how it goes in California, with everything. From budgeting, school curriculum, high speed rail, water projects, etc.
English
0
1
4
207
Walter Kirn
Walter Kirn@walterkirn·
Going to take almost as long to put this fire out, it seems, as it does to count ballots in an LA election. It's a very easygoing place, Southern California.
Dave Toussaint@engineco16

Update on the Boyle Heights Lineage Commercial Building Fire, Los Angeles #LAFD Incident Objectives: The LAFD has transitioned its strategic goals to manage the extended aftermath of the commercial blaze. Biohazard Mitigation: Crews are pivoting from hazardous materials to containing the biohazard threat posed by the 85 million pounds of spoiling frozen food inside the building. Defensive Suppression: Firefighters are using water-dropping helicopters and external lines to target deep hot spots between the pallets and collapsed roof because structural compromise prevents crews from safely entering. Hazmat & Environmental Monitoring: Teams have pumped out the building's toxic anhydrous ammonia lines to eliminate chemical community risks, while continuing to track airborne particulate matter with the South Coast AQMD. Public Information Coordination: Officials are seeking a joint city and county state of emergency declaration to secure state resources and handle regional smoke advisories. Estimated Duration:- The LAFD expects this firefighting operation to be an extended event that could last for days or weeks. Even though forward progress of the open flames has been halted, the 500,000-square-foot facility acts like a giant insulated cooler. The corrugated steel walls are filled with highly dense foam that is burning very slowly and continually off-gassing deep within the structure. Because firefighters cannot enter the interior due to roof collapse risks, they must let the deeply buried pockets burn themselves out while keeping the fire as contained as possible from the perimeter. If you live nearby or smell smoke, would you like information on active smoke relief shelters in the area, or the latest air quality recommendations from local health officials?

English
15
45
374
17.9K
Bill Lipe retweetledi
ollama
ollama@ollama·
GLM 5.2 on Ollama's cloud just doubled GPU capacity to handle the volume of usage! This is all US based, and running on NVIDIA B300 Blackwell GPUs. We believe privacy matters! Let's go open models! ❤️
English
147
301
5.3K
536.8K
Bill Lipe
Bill Lipe@bill_lipe·
Orchestrating codex + gpt-5.5 & Grok Build & ollama + codex + GLM-5.2 cloud for a combined $140 a month ($20 + $99 [Grok Heavy discount] + $20) is a sturdy build and ship tritate. With a hardware upgrade this fall with RTX-Spark, GLM-5.2 will be locally driven. Times are a changing.
English
0
0
0
86
Bill Lipe
Bill Lipe@bill_lipe·
¿Cómo puede la gente votar por Yzra-Zoe en 2026? Ella ha estado en DC por casi 30 años. ¡Es hora de irse, Zoe! Grok Imagine 1.5 aprobado y generado. @elonmusk
Español
0
0
0
57
Bill Lipe
Bill Lipe@bill_lipe·
How can people vote for Yzra-Zoe in 2026? She's been in DC for almost 30 years. Time to go, Zoe! Grok Imagine 1.5 approved and generated.
English
2
0
0
69
Bill Lipe
Bill Lipe@bill_lipe·
I've already sketched out a plan, to utilize GLM-5.2 on Ollama Pro, thanks @ollama, for grunt work, optimized prompts to take maximum advantage of caching discounts, under the oversight of codex + GPT-5.5, and fellow co-worker Grok Build. An orchestration. Which is what I foresee with human in the loop engineering/building/imagining. When I'm able to update my hardware (currently running some smaller LLM's locally on my four year old I9-32gb-RTX3070ti), I'll be able to get the big OSS models in house, and train/fine-tune them for specialized work in Agriculture, Education, Coding, and any other domain willing to update themselves. I'm jacked! Would really like to have high protein training/tuning data to work with, hopefully soon. Best to you Brian and thank you!
English
0
0
3
171
Brian Roemmele
Brian Roemmele@BrianRoemmele·
The “leaders” in AI have caught the ear of the political class and are trying to do their best to limit AI once again. It will not work. What it will do is stymie the US by years. OPEN SOURCE GLM AI BEATS ANTHROPIC FABLE! BEATS IT ON LOCAL GPUS. BEATS IT.
Brian Roemmele tweet media
Brian Roemmele@BrianRoemmele

BOOM! OPEN SOURCE GLM BEATS THE FABLED FABLE! GLM-5.2 from Z.ai: The Open-Weight Model That Topped Claude Fable and Powers The Zero-Human Company Z.ai (Zhipu AI) released GLM-5.2 and our tests show it delivering a major leap in long-horizon agentic coding with a practical 1M-token context window, flexible reasoning effort levels (High/Max), and MIT open weights. Early benchmarks and community arenas show it excelling where it matters most for developers. We compared it to our first Anthropic Fable model tests and GLM did better! It leads open-weight models and has claimed the top spot on Design Arena (Elo 1360), and as I said is surpassing the now-unavailable Claude Fable 5. It also posts strong results on coding suites: 62.1% on SWE-bench Pro (beating GPT-5.5’s 58.6) and 81.0 on Terminal-Bench 2.1.106 Official blog: z.ai/blog/glm-5.2
 The Zero-Human Company Goes All-In At The Zero-Human Company, where AI agents handle nearly all operations, we’ve rolled out GLM-5.2 across all employee (agent) workflows for code generation, refactoring, debugging, and autonomous project execution. Its long-context reliability and agentic strengths make it ideal for sustained, multi-hour tasks without constant human oversight—perfect for a zero-human setup. We’re particularly excited about its open weights and local deployment, which ensures full data privacy and resilience—no external service dependencies or potential bans. Running GLM-5.2 Locally Thanks to its MIT license and strong inference support, you can run GLM-5.2 (744B total params, ~40B active MoE) on your own hardware today. Quantized versions (FP8, etc.) make it feasible on high-end setups. Quick start options (from the official GitHub): •vLLM: recipes.vllm.ai/zai-org/GLM-5.2 •SGLang: cookbook.sglang.io…/GLM-5.2 •Hugging Face Transformers or KTransformers for more options. •Full deployment guide: github.com/zai-org/GLM-5 Example setup with vLLM (Docker recommended for ease): # Clone repo and follow recipes for quantized inference # Supports reasoning_effort="max" (default) or "high" This local-first approach aligns perfectly with our zero-human philosophy: agents run securely on-prem, with full customizability. GLM-5.2 isn’t just competitive it’s a timely open alternative in a world of access restrictions. We’re thrilled to test and build with it company-wide. Expect more updates as our AI workforce puts it through real production. The myth of Mythos and the fable of Fable is entertaining but we are getting to work.

English
19
24
164
18.7K
Bill Lipe
Bill Lipe@bill_lipe·
@BrianRoemmele The path God sets for us is unknowable, and truly great and glorious. Great get! Anxious to see what God has in store for you building a full protein trained model. Best, Bill
English
0
0
3
112
Brian Roemmele
Brian Roemmele@BrianRoemmele·
I JUST GOT GIFTED 100s of 1000s OF MOSTLY UNPUBLISHED PHD DISSERTATION PAPERS ON MICROFICHE! This is an exciting discovery! In a metal basement cabinet of a former Ivy League physics professor was discovered shelves of microfiche at the great grandkids had no idea what to do. On May 15, 2026 It was time for the multigenerational family vacation property to be cleaned and remodeled for August retreat. It was owned by the retired professor who passed away 12 years ago with his wife passing in 2023. One of the grandkids who is now in his 40s was always curious by this cabinet so they got a locksmith to open it, perhaps the first time in 15 years and what it contained they didn’t quite understand. That grandkid turned out to be somebody that is a ReadMultiplex.com subscriber and he immediately identified he was looking at microfiche and something to do with scientific papers. We had a conversation a few weeks back and my research has produced something that is absolutely blowing me away. His grandpa, who, for many reasons I cannot name was conducting a wide ranging study of PhD dissertation papers from up to 16 universities. He ran a microfiche archiving operation for his work for decades. I discovered he had private funding to conduct this research and it continued on well into his retirement. Unfortunately, a fire at his residence in Boston destroyed most of his manuscripts on what he was trying to achieve and the index of everything that was in a cabinet. It was his hope when he moved to their vacation home to to live at his days to restart his work unfortunately, it’s not clear if he ever did and there’s no evidence that anybody can find. We can include that he discovered thousands of heavy gems and dissertation papers that never came to life, especially in the pre-Internet days. The earliest entry I see is from 1873 all the way up to the early 2000s. All the papers are signed based and mostly in physics. The largest cohort is from the 1950s. Some of the dissertations are handwritten, and most are manual typewritten. Some appear to be type set as a published book. Most of them have not been digitalized or even saved. And this was one of the problems. The professor had is dissertations at many universities were discarded after a period of time in batches. And with it all of the work of the best students those universities had it was a space, savings equation, and belief that the old ideas are always replaced by the new ideas. I’ve been collecting dissertation archives for decades, and this is the largest one I’ve come across. It is shocking to most people that most dissertations ever made prior to the year 2003 have not been digitalized and are mostly not available online. I have permission to digitalize the entire archive for AI training only. I’m working with The Family to try to open source that data so that is freely available for everybody. For obvious reasons, The Family wants to consult their attorney about open sourcing the data, but I’m already starting the digitalization process today. We are building an AI that thinks unlike any other AI platform ever created because of the thought process that went into these dissertation papers. For it is the thought process more importantly than the actual data contained that we’re trying to capture in this AI platform. Although the data contained is unavailable anywhere else. What is really heartbreaking is when I look at the data and almost every case it is the last copy of this PhD thesis that exists. And if the stars did not align, and God did not guide that cabinet was scheduled to be taken away by a “Got Junk” truck most surely gone forever. It is my hope that one day I get permission to name parts of this model after this professor. More soon!
Brian Roemmele tweet media
English
48
67
419
46K
Bill Lipe
Bill Lipe@bill_lipe·
It's the dominant hardware in the Salinas Valley. But it isn't completely, or imho solely, about replacing chemicals. The laser weeders and other robotic advancements in agriculture are replacing work crews, which are people (farmworkers). Labor has become very expensive, and carries a lot of overhead to find it, finance it, and provide all the benefits that California law requires. Robotics in the Salinas Valley and beyond is banking on (pun intended) replacing high cost, hard to find, workers willing to show up six to seven days a week.
English
0
0
6
346
Bill Lipe
Bill Lipe@bill_lipe·
@PredictJensen @analogalok I'm benchmarking gemma4-12b QAT now, in preparation for training it for specific coding use. All on a 4 year old laptop, 3080ti.
English
1
0
0
62
Alok
Alok@analogalok·
Gemma 4 12B QAT + MTP on 8GB VRAM. llama.cpp flags included. let's run it. 20+ tok/sec decode. 700+ tok/sec prefill. on a single RTX 4060. copy these exact flags: -m gemma-4-12B-it-qat-UD-Q4_K_XL.gguf \ --spec-type draft-mtp \ --spec-draft-n-max 4 \ --spec-draft-p-min 0.7 \ --spec-draft-model gemma-4-12B-it-qat-assistant-MTP-Q8_0.gguf \ -c 48000 -ngl 38 -v → -ngl 99 if you're on 12–24GB VRAM (RTX 3090, 4090, 4080, 3080, 4070 Ti, 3080 Ti) → -ngl 38 for 8GB setups with 48k context (RTX 4060, 3060, 2080, 2070, 3070) → drop it (or the context) lower if you're squeezing on 4–6GB (RTX 2060, 3050, 1660) my rig: RTX 4060 8GB · i7H · 16GB RAM MTP is giving me 25–40% decode throughput gains across Gemma 4 models. nearly zero VRAM cost for that bump. the draft assistant GGUF is only ~300–400MB depending on quant. one thing to know (architectural catch): unlike Qwen3.6 and Qwen3.5 models which bakes MTP heads straight into the base GGUF, Gemma 4 needs a separate draft assistant model downloaded alongside. not a big deal. just don't forget it or MTP won't run. draft assistant GGUF link → comments while you wait for anthropic mythos release, test this and drop your decode numbers below, curious how it scales across different setups.
Alok@analogalok

Run Gemma 4 26b MTP on 8 GB VRAM GPUs at 25+ tokens/second. Flags included! local llm space is moving at terminal velocity. only 3 days ago google released gemma 4 26b a4b qat quants. more efficient than before, ran on 8gb vram at 20 tok/sec. and now just a few hours ago, mainline llama.cpp merged a massive update and we just shattered our own record. decode throughput went 25-40% up on the same 8 GB VRAM setup! Before MTP: 20 tps -> After MTP: 28 tps! llama.cpp just officially merged PR #23398 ("add Gemma4 MTP"), bringing native Multi-Token Prediction (MTP) support to Gemma 4 models. By running speculative drafting on the same 8GB VRAM RTX 4060 setup, my decode throughput on a 64k context instantly leaped to a blistering 25–27 tokens/sec thats 25-30% increase with the same hardware. Here is the architectural catch you need to know: Unlike the Qwen 3.5 and 3.6 series, which bake the MTP heads directly into the base GGUF, the Gemma 4 MTP head is not built in. You must download a separate, specialized MTP drafter GGUF (the assistant model) to act as the speculator. (I've dropped the download link in the replies). copy and try the exact flags: -m gemma-4-26B-A4B-it-qat-UD-Q4_K_XL.gguf --spec-type draft-mtp --spec-draft-n-max 6 --spec-draft-p-min 0.7 --spec-draft-model gemma-4-26b-A4B-it-assistant-Q4_0.gguf -c 64000 -v n-max 4 and p-min 0.7 is also worth checking out. benchmark on your setup and workflow. if you have a single 8 gb vram nvidia rtx 4060, 3060, 3070, 2080, 2070, grab the MTP drafter GGUF link in the comments and try it yourself. Check it out even if you have asmaller or a larger gpu, such as a single rtx 3090, 4090, 3060, 2060. MTP works for all gemma 4 sizes such as gemma 4 12b, gemma 4 31b etc. but remember to grab the correct mtp draft assistant models respectively. what are you benchmarking today

English
12
36
295
28.6K
Bill Lipe
Bill Lipe@bill_lipe·
In Monterey County, there were four municipal measures in several cities, for parcel tax increases, and on in Pacific Grove to approve pay raises for City Council members (a new thing that's popped up in California law). Outside of the millionaire/billionaire household that are made up of Agricultural titans in the Salinas Valley, oil barons in south county, and the new and old uber wealthy in Del Monte Forest and it's shoreline along Pebble Beach, we're mostly a working class community. They're emphatically saying 'NO!'. Amen!
Bill Lipe tweet media
English
0
0
3
226
Bill Lipe
Bill Lipe@bill_lipe·
And it permeates all the way through to the local level, county and city, local agencies that control our water, schools, and Healthcare. A fully saturated omnipotent corrupt cluster that oozes hideous pus, from which you cannot ignore or unsee. Every day, every public meeting, it is there for all to see, for those that aren't in their spell. Still waiting for the fever to break here. Decades of waiting. Governor Gray Davis marked the first major downturn with pension reform that's killing our communities. A gross error so bad, he's even admitted it was a mistake. The people rose up and threw him out, replacing him with Swarzenegger. But no matter, the fix/con was in.
Bill Lipe tweet media
English
1
1
12
651
Walter Kirn
Walter Kirn@walterkirn·
The truth is that the D political machine in LA and California is in so deep with bad bad elements that it must either deliver for them or else. Above the compromised are the ruthless, above the ruthless are the diabolical. Etcetera. None are really free. Corruption creates a pyramid of hostages.
English
105
605
4K
48.4K