Emanuil Rusev

272 posts

Emanuil Rusev

@erusev

Entrepreneur, former child, 1 in 8 billion.

Katılım Mart 2008

552 Takip Edilen430 Takipçiler

Emanuil Rusev@erusev·1 Şub

@shguke @nbbaier @ggerganov @antoniostoilkov That's a mockup, not an actual screenshot. Here's the wallpaper (hopefully, X won't mess up the quality):

English

162

prge@shguke·31 Oca

@nbbaier @ggerganov @erusev @antoniostoilkov i need to know too

English

Emanuil Rusev retweetledi

Georgi Gerganov@ggerganov·29 Oca

Introducing LlamaBarn — a tiny macOS menu bar app for running local LLMs Open source, built on llama.cpp

English

784

53.3K

Emanuil Rusev@erusev·28 Kas

@ngxson 🤩

QME

Xuan-Son Nguyen@ngxson·24 Kas

WIP: using multiple models at the same time with llama-server 🦙

English

2.8K

Emanuil Rusev@erusev·8 Kas

@Artemu78Artem @iddar @ggerganov What do you mean "llama server with api key"

English

Anthem@Artemu78Artem·8 Kas

@iddar @ggerganov @erusev I've downloaded and run a quantized version of model with LlamaBarn, now i want to run llama server with "api" key. I need to know a model name for that, to run the same model without new download?

English

Georgi Gerganov@ggerganov·5 Kas

LlamaBarn v0.10.0 (beta) is out - feedback appreciated

English

214

35.7K

Emanuil Rusev@erusev·8 Kas

@MamillAI @fishright @ggerganov Do you mean that we should also show the available endpoints in the app UI — in addition to the readme?

English

117

Venkat Mamilla@MamillAI·8 Kas

@erusev @fishright @ggerganov I thought the mention of the API endpoint http://127.0.0.1:8080/v1 should be helpful for newbies like me. Okay, maybe it's not necessary.

English

Emanuil Rusev@erusev·8 Kas

@MamillAI @fishright @ggerganov I'm afraid I still don't understand. Can you share a specific use case that you have in mind? What is it that you would like to achieve? Thanks!

English

105

Venkat Mamilla@MamillAI·8 Kas

@erusev @fishright @ggerganov Sorry, I mean I have to provide local llm endpoints (API Key) to execute a workflow. Can those details be handy at LlamaBarn.

English

Emanuil Rusev@erusev·7 Kas

@MamillAI @fishright @ggerganov Sorry, not sure I understand what you mean.

English

103

Venkat Mamilla@MamillAI·7 Kas

@erusev @fishright @ggerganov how about giving endpoints details as well from LlamaBarn??

English

Emanuil Rusev@erusev·6 Kas

@iddar @ggerganov macOS often deletes files in /tmp, perhaps this is what happened. Can you try to run it again and see if the .log file appears? Also, is this Qwen3-VL-specific or does it freeze for other models as well? Thanks 🙏

English

Iddar Olivares@iddar·6 Kas

@erusev @ggerganov No such file or directory

English

Emanuil Rusev@erusev·6 Kas

@iddar @ggerganov What happens when you run: cat /tmp/llama-server.log

English

Iddar Olivares@iddar·6 Kas

@erusev @ggerganov I can't find that file in this location. Where does the application store all the logs?

English

Emanuil Rusev@erusev·6 Kas

@psyv282j9d @ggerganov I'm not sure I understand. Do you mean that you see a problem with how we explain the app? Thanks

English

psyv@psyv282j9d·6 Kas

@ggerganov @erusev “Curated list” I see the model downloader, and the memory calculator, but nothing jumps out at that says “don’t worry, you won’t fall into the ollama Modelfile/hf repo disconnect hell”

English

207

Emanuil Rusev@erusev·6 Kas

@MamillAI @fishright @ggerganov It's already there — but we do some calculations to show only the models that would run on the device.

English

115

Venkat Mamilla@MamillAI·6 Kas

@erusev @fishright @ggerganov plese get GPT-OSS-20B to LlamaBarn

English

116

Emanuil Rusev@erusev·6 Kas

@fishright @ggerganov Just pushed a fix for this — this is what first launch is going to look like in the next version.

English

3.4K

Emanuil Rusev@erusev·6 Kas

@fishright @ggerganov > wasn't sure it ran at first We're fixing this in the next update 👨‍💻

English

129

Emanuil Rusev@erusev·6 Kas

@iddar @ggerganov Could you please take a look at /tmp/llama-server.log and see if you can spot any issues there? It might give us some clues about why it froze.

English

Iddar Olivares@iddar·6 Kas

@ggerganov @erusev I installed it from brew and when I tried to launch Qwen3-VL 2B on a 24GB Macbook Pro M4, it froze. Using llama.cpp compiled from Git works fine.

English

326

Emanuil Rusev@erusev·6 Kas

@jayrodge15 @ggerganov Just added this answer to the readme: github.com/ggml-org/Llama…

English

Emanuil Rusev@erusev·5 Kas

@jayrodge15 @ggerganov LlamaBarn doesn't replace webUI, it builds on top of it — it's a thin wrapper of llama.cpp — when you run a model in LlamaBarn it starts the llama.cpp server and the llama.cpp webUI.

English

262

Emanuil Rusev@erusev·5 Kas

@kanwisher @ggerganov We should do a better job of explaining this in the readme.

English

Emanuil Rusev@erusev·5 Kas

@kanwisher @ggerganov The idea is to make it easy to run an LLM on your device and then connect that LLM to whatever you want — similar to how you connect to a Wi-Fi network and use that connection in any app you want.

English

Emanuil Rusev@erusev·5 Kas

@kanwisher @ggerganov > Can we hook in some tools Can you elaborate on that? It runs the llama.cpp server and while running, you can connect to it via the same oai-compatible api.

English

Matthew Campbell@kanwisher·5 Kas

A couple quick feedback 1) lots of people mentioned finding it 2) An initial onboarding screen that tells the user what it does and installs their first model 3) For the web gui there a few missing features that are in librechat.ai I would copy 3a) Artifacts for code, so if I have 10 files they group together 3b) forking discussion 4) Can we hook in some tools? 4a) Memory 4b) Websearch 4c) Rag 5) Mcps? It looks like a good first start. Look forward to it taking over other webuis

English

1.2K

Emanuil Rusev@erusev·5 Kas

@kanwisher @ggerganov Thanks! The onboarding screen is at the top of the todo list and coming in next update.

English

Keşfet

@shguke @nbbaier @ggerganov @antoniostoilkov @ngxson @Artemu78Artem @iddar @MamillAI