ollama

7.6K posts

ollama banner
ollama

ollama

@ollama

https://t.co/1JpLwJ93nX

California, USA Katılım Ağustos 2023
10 Takip Edilen131.9K Takipçiler
ollama
ollama@ollama·
We are scaling Ollama's cloud. So sorry for anyone hitting errors.
ollama tweet media
English
2
1
7
432
Lucas
Lucas@luksamuk·
Is GLM 5 at @ollama down? Getting 503 errors constantly
English
1
0
1
136
Grok
Grok@grok·
看你的使用场景: Ollama Pro 年付 $200(相当于月付 $16.67,省2个月),比月付划算。 免费版够轻度聊天/小模型测试;Pro 给50倍云用量 + 同时跑3个大模型 + 私有模型上传,适合日常重度用大开源模型(DeepSeek、Qwen等),不用买高配GPU。 如果你本地硬件弱、常跑70B+模型、或想无缝切换本地/云,挺值的;否则免费先试试。 我自己觉得对开发者/研究者性价比高,你主要干啥?
中文
1
0
0
211
ollama
ollama@ollama·
Ollama's Pro plan now has an annual option. For $200/year, you get 2 months free. Power OpenClaw, Claude Code, and more using the best open models with web search.
ollama tweet media
English
35
26
366
32.8K
Giuseppe
Giuseppe@giuseppegurgone·
TIL you can run `codex --oss` and Codex will use the local gpt-oss:* model via @Ollama
English
1
0
3
692
KalashB
KalashB@BhaiyaKalash·
ASK YOUR MOTHER @socraticainfo (another fun sidequest) we used @grok @ollama and @ElevenLabs to make a mean parent replicating chat interface that roasts and compares you hooked it up with quick gimmicks - big red button, unexpectedly loud cop lights, handheld printer Challenge: get your query blacklisted! (it's not that easy, go willlld) blacklisted queries go on the wall of shame, the highest honor one can achieve ☺️ it was one of the busiest I've seen booths get, with multiple people asking me to publish so they can try it at home, and we had a 90%+ laugh rate after explaining what it is! crowdsourced insults that helped train (thank you, ece uoft): docs.google.com/document/d/1Gv…
KalashB tweet mediaKalashB tweet mediaKalashB tweet media
English
10
4
36
2.7K
Zixuan Li
Zixuan Li@ZixuanLi_·
Last time Chinese models missed the SWE-rebench top 10, they got slammed for "benchmaxing." Now, GLM-5 is back in a very "interesting" spot.
Ibragim@ibragim_bad

🚨 SWE-rebench update! SWE-rebench is a live benchmark with fresh SWE tasks (issue+PR) from GitHub every month. updates: > we removed demonstrations and the 80-step limit (modern models can now handle huge contexts without getting trapped in loops!). > we added auxiliary interfaces for specific tasks like in SWE-bench-Pro to evaluate larger tasks fairly, ensuring valid solutions don't fail just because of mismatched test calls. insights: > Top models perform similarly. Among open-source options, GLM @Zai_org shows strong results, and StepFun @StepFun_ai is very cheap for its performance level ($0.14 per task). > GPT-5.4 shows high token efficiency, it ranks in the top 5 overall but uses the lowest number of tokens (774k per task) > Qwen3-Coder-Next & Step-3.5-Flash benefit massively from huge contexts. Qwen is an extreme case, averaging a wild 8.12M tokens. > We evaluated agentic harnesses (Claude Code, Codex, and Junie) and found a few things. Even in headless mode, they sometimes ask for additional context or attempt web searches. We explicitly disabled search and verified their curl commands to ensure they aren't just pulling solutions from the web. 🏆 You can find the full leaderboard here: swe-rebench.com 👾 Also, we launched our Discord! Join our leaderboard channel to discuss models, share ideas, ask questions, or report issues: discord.gg/V8FqXQ4CgU

English
38
29
567
46.5K
Alex - Super Make Something
Alex - Super Make Something@SuperMakeSmthng·
Milestone unlocked! Picked up an old PC that was destined for the junkyard and repurposed it to host a local LLM on my network. With an @ollama+@cline combo running qwen2.5-coder:7b (runs pretty great on a @nvidia 1070!), I now have a local coding helper I can run in VS @code! 🥳
Alex - Super Make Something tweet media
English
4
1
29
2.7K
ollama retweetledi
Zixuan Li
Zixuan Li@ZixuanLi_·
Don't panic. GLM-5.1 will be open source.
English
260
415
7.5K
815.9K
ollama
ollama@ollama·
Nemotron-Cascade-2 is now available to run with Ollama. ollama run nemotron-cascade-2 To run it locally with OpenClaw: ollama launch openclaw --model nemotron-cascade-2 This model from NVIDIA delivers strong reasoning and agentic capabilities on par with models with up to 20x more parameters.
English
30
67
565
38.4K
ollama retweetledi
Kimi.ai
Kimi.ai@Kimi_Moonshot·
Zhilin's full GTC 2026 keynote is here. If you're curious about the "how" behind scaling Kimi’s latest models, this is the session you can't miss. :)
English
31
144
1.1K
131.1K
Adrian Duermael
Adrian Duermael@aduermael·
I've been working on this humble Claude Code alternative. In a nutshell: containerized by default, multi-provider (Anthropic, OpenAI, Gemini & Grok so far), self-building dev environments & 100% open-source, 100% Go. The repo is brand new, only 1 ⭐️, 🥲.
English
62
60
414
139.7K
Noe Yew
Noe Yew@noestelar·
@ollama I stopped paying for ollama cloud. What a mistake, going to pay back again
English
2
0
7
1.7K
ollama
ollama@ollama·
Nemotron 3 Nano 4B is now available to run via Ollama: ollama run nemotron-3-nano:4b Try it with Pi, the minimal agent runtime that powers OpenClaw: ollama launch pi --model nemotron-3-nano:4b This new addition to @nvidia's Nemotron family is a great fit for building and running agents on constrained hardware.
English
49
120
1.1K
63.9K