mango

1K posts

mango

mango

@mango3771345

Pije sobie piwko

Katılım Ağustos 2024
86 Takip Edilen5 Takipçiler
mango
mango@mango3771345·
@KeitaroTR I need to fucking kill myself holy fuck
English
0
0
0
10
KeitaroTR
KeitaroTR@KeitaroTR·
Porque le sonríes a la pantalla...
Español
192
1.4K
21.7K
864.2K
mango
mango@mango3771345·
@0xDesigner Literal retard who thought a diet would cure cancer btw
English
0
0
1
14
0xDesigner
0xDesigner@0xDesigner·
i read the steve jobs biography like over a decade ago. i hardly remember much about the book but there was one part where old steve is on vacation in istanbul and a tour guide is explaining the history of turkish coffee and steve interrupts him with “why would anyone care about that?” and i think about that every time i read a viral ai post like this.
Andrej Karpathy@karpathy

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

English
91
179
6.4K
980.2K
mango
mango@mango3771345·
@yengyosi_ Deniers, falsifiers, modernizers, activists all of them
English
0
0
0
440
Dimitri
Dimitri@thedimitri·
The evil that is female intrasexual competition never ceases to amaze me
Dimitri tweet media
English
232
534
22.8K
967.3K
mango
mango@mango3771345·
@vikhyatk It's literally just spatial optimization
English
0
0
0
59
vik
vik@vikhyatk·
you are not a "first principles contrarian thinker" if you fold your clothes. completely unnecessary, useless ritual
English
71
33
759
35.6K
mango
mango@mango3771345·
@MattZeitlin But physics is german. Chemistry is more of a french thing really
English
0
0
0
231
Matthew Zeitlin
Matthew Zeitlin@MattZeitlin·
slightly goofy take i'm trying out: american culture structurally overrates physics and its importance because it's the "good" "liberal" science, meanwhile chemistry is german and therefore bad and we're less knowledgable and appreciative of it
English
55
25
708
48.1K
mango
mango@mango3771345·
@growing_daniel The poverty that let me eat raisins and butter cookies instead of "food" primarily consisting of HFCS and palm oil
English
0
0
0
74
Daniel
Daniel@growing_daniel·
what kind of poverty do you have to be raised in to want an oatmeal raisin cookie. A treat for 19th century english orphans
English
665
457
7.1K
435.8K
avy
avy@avycadotoast·
planned the coolest first date ever but i had to cancel because my mom was insisting i go shopping with her and GUESS WHAT he went with someone else instead ??😭😭😭😭
avy tweet media
English
494
1.4K
162.9K
4.7M
Rick Prime
Rick Prime@Ricckc137·
Mówię, że czegoś nigdy nie jadłam i nie zjem bo tego nie lubię. Ktoś odpowiada, że skoro nigdy nie jadłam to skąd wiem, że nie lubię. Człowieku, mówię, że tego nie lubię, a nie, że mi nie smakuje. Faktura, wygląd, zapach, kolor, kształt wystarczy żeby danej potrawy nie lubić.
Polski
26
9
163
7.9K
mango
mango@mango3771345·
@bunnisaki I don't even deserve to live
English
0
0
0
167
doll
doll@bunnisaki·
everyone deserves a gf who cosplay their fav characters for them
English
137
712
7.4K
214.6K
mango
mango@mango3771345·
@nyxanaa_ Użytkownik "kokainowa kurwa"
Polski
0
0
1
896
cocainewhore🦴
cocainewhore🦴@nyxanaa_·
czemu dziewczyny na siłę są body positive i jak inna laska ma otyłość to jej piszą body goal i body tea kurwa xd nie jest body goal tylko jest spasłą cysterną i powinna się wziąć za siebie a nie jeszcze jej wmawiacie że dobrze wygląda
Polski
61
36
1.3K
55K
mango
mango@mango3771345·
@sknerus_ Ma racje. Cwele typu zapiekanka bez pieczarek to powinni dwa razy tyle płacić
Polski
0
0
1
1.3K
𝚂𝚔𝚗𝚎𝚛𝚞𝚜
Polskie gastro to już do reszty pojebało. 4 zł dopłaty za odjęcie pieczarek z zapiekanki XDDD
Polski
33
13
1.7K
123.2K
tadano! 🪄
tadano! 🪄@x00ge_ii63·
the easiest way to tell if someone read marx is literally just to ask them what "Capital" is supposed to mean to marx
English
70
50
4.7K
334.8K
Angie Kulus
Angie Kulus@angiekulus·
@KacperDamian1 @akiberru No ja np. mam uczulenie na jabłka i człowiek chce coś na szybko, jakiś mus owocowy gdzie jest MALINATRUSKAWKAJAGODY, a patrzy w skład, a tam większość to jabłko 😅
Polski
2
0
0
4.6K
mango
mango@mango3771345·
@SamMacD86958750 @Empty_America You just neet a pot for deep frying. It's the same technology as boiling just with oil instead of water.
English
0
0
0
15
Smac
Smac@SamMacD86958750·
@Empty_America Also kind of requires a metal pan? Or something close? I remember going to a presentation about Native Americans boiling sap. They did it in birch-bark boxes. Incredible that they managed to survive. The metal cook pot was some kind of revelation, I think. Complete wizardry.
English
1
0
6
501
mango
mango@mango3771345·
@Empty_America Deep frying is an ancient culinary technique (much older than pan-frying and sauteeing) and was probably fairly common for feasts. Fat were way less expensive than the spices they used anyways.
English
0
0
1
1.1K
mango
mango@mango3771345·
@zielonaherbacia Po chuj wy wszyscy piszecie xeety wpół po angielsku?
Polski
0
0
1
182
seraph!!
seraph!!@zielonaherbacia·
Nienawidze jak ludzie w ogóle akceptują taki koncept jak mozliwosc bycia “too woke”. Brother we need u to be woke during these times
Polski
8
20
176
2.5K
mango
mango@mango3771345·
@hagen028 Nie kurwa nie będziemy kolegami bo ostateczny cel Niemców jest zniszczenie Europy, Polski, i wszelkich kultur z którymi mają styczność.
Polski
0
0
0
326