DinDin

220 posts

DinDin banner
DinDin

DinDin

@Dindin97101

iOS Developer

Taipei City, Taiwan Katılım Haziran 2013
507 Takip Edilen181 Takipçiler
大軍軍❤️
大軍軍❤️@DAGINGINGIN·
第一次真的離職🏆(1/1)
大軍軍❤️ tweet media
中文
11
0
180
7.9K
DinDin retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
English
2.9K
7.1K
58.5K
21M
13
13@ethanhuang13·
話說,我跟 @janechao_dev 還有幾個朋友 打算在今年暑假為台灣的學生籌備一場 iOS AI Coding 夏令營 有興趣出錢出力的朋友可以跟我說 細節我之後會寫在 13 報
中文
2
7
43
3.6K
DinDin
DinDin@Dindin97101·
原本都是用notion存ai日記與ai週記,結果這週ai太會寫,字數多到無法直接貼上,要先創md檔再匯入,想說倒不如轉成本地端作業,因此發現另一個好方法,除了開啟速度快,還可以使用AI IDE來協助我做筆記跟查詢內容了 youtu.be/IlNOhNeWGgY?si
YouTube video
YouTube
中文
1
0
1
217
大軍軍❤️
大軍軍❤️@DAGINGINGIN·
我不太會投資 現在每天聽財經皓角通勤 希望可以慢慢培養一些投資的概念
中文
3
0
12
818
DinDin retweetledi
13
13@ethanhuang13·
如果每天早上 6 點,你的信箱都有一篇新文章,帶來工具技巧、策略方法、系統思考、人生哲理,各種層次的刺激。持續累積,一年之後,你會發現自己的成長。 現在就是訂閱 13+ 的最好時機! (請幫忙推薦與轉發,謝謝!)
13 tweet media13 tweet media13 tweet media13 tweet media
中文
1
7
31
6.9K
DinDin
DinDin@Dindin97101·
@ethanhuang13 在美國放在胸前可當另一種保險?
中文
0
0
1
159
13
13@ethanhuang13·
Joz 就這樣把 iPhone Air 丟給主持人 還現場試折 這麼有自信 Apple 一定做了很多測試 好,買了
中文
7
3
35
5.9K
大軍軍❤️
大軍軍❤️@DAGINGINGIN·
入職之後好久沒有接到HH電話了 要不要去試試呢?
中文
1
0
12
657
大軍軍❤️
大軍軍❤️@DAGINGINGIN·
快要34了 今年是認真感到變老的一年 各種肩頸酸痛、失眠 第一條皺紋出現 白頭髮越來越猖狂 要維持一個人樣的成本逐漸上升中
大軍軍❤️ tweet media
中文
4
0
91
1.4K
DinDin
DinDin@Dindin97101·
天氣超好,感謝大神這次准許入山 #富士山 #fuji_mountain
中文
1
0
5
210
DinDin
DinDin@Dindin97101·
日本機師盤旋畫圈比我用筆畫的還準還用心
DinDin tweet media
中文
1
0
2
189
DinDin retweetledi
Dong
Dong@dongdong867·
恭賀今年成為 iPlayground 2025 順利結束 🎉 感謝 @theiPlayground 的邀請, 讓我第一次作為講者在這麼多人前面分享 在這兩天裡面遇到了各種人, 不管是各種資深的前輩們還是少年有成的新血, 都讓我獲益匪淺 (社交能量歸零), 希望我的小小演講能夠為各位帶來一些收穫 #iPlayground
Dong tweet mediaDong tweet mediaDong tweet mediaDong tweet media
中文
4
3
27
1.1K
DinDin retweetledi
Chiao
Chiao@Chiao3145·
年度發文記錄一下~終於忙完了🎉🎉🎉 這是我iPlayground的頭投影片,也有更新連結到共筆上,有興趣的朋友歡迎載回去看看: speakerdeck.com/chiaoteni/mcp-… #研討會後又忙了兩天TvT #杯子感覺到時代的演進 #研討會的一切很有趣 #iPlayground
Chiao tweet media
中文
2
3
19
811
DinDin
DinDin@Dindin97101·
@ethanhuang13 大家一人愛心鼓勵我們13老師好好放假,大家說好不好?
中文
1
0
2
85
13
13@ethanhuang13·
雖然這樣說有點對不起明天開始要上我課的同學們 但是我現在好想手刀買機票飛去九州九週
中文
1
0
18
1.5K
DinDin
DinDin@Dindin97101·
@DAGINGINGIN 能穿上就是智慧跟力量的象徵
中文
0
0
2
32
大軍軍❤️
大軍軍❤️@DAGINGINGIN·
這個磅數有點尬(?
大軍軍❤️ tweet media
中文
6
0
66
1.3K
DinDin
DinDin@Dindin97101·
@iamkuni 擺了就可以讓日本講者說研討會終於換菜色了(誤
中文
0
0
2
68
Ben
Ben@iamkuni·
iPlayground心得還來不及寫 結果香菇今天在說,如果可以接iplayground的餐點,好像很酷。 然後一直在旁邊跟我討論要怎麼做,我就跟她說400份難度很高,就算一半也還有200份,算了一下肉和醬還有飯讓她覺得困難。不過,最後根據場地限制,我們還是討論了一個可行的餐點,大概就是市集的咖哩熱狗堡之類的,也還問我上個月限定的口味優格雞咖哩的醬是不是可以做成熱狗堡的口味。 講著講著,在旁邊自己很開心的說,她明年想要參戰,要跟 @hokilaJ 說。(我好怕) #iplayground
中文
12
1
31
2.3K