Ayan Das

2.7K posts

Ayan Das banner
Ayan Das

Ayan Das

@dasayan05

AI Research @Huawei UK | PhD | Generative AI | Theory & Applied | Blogs @ https://t.co/wlzHeUFhk2 | Motorcyclist 🏍️ | Traveller 🏝️ | Cat 🐱 lover |🇬🇧 🇮🇳

UK | India 가입일 Ekim 2016
1.1K 팔로잉992 팔로워
Tesla Inside
Tesla Inside@TSLA_inside_·
The most difficult roundabout in the Netherlands, Keizer Karelplein in Nijmegen. No effort for FSD. $TSLA
English
238
292
3.9K
12.3M
Massimo
Massimo@Rainmaker1973·
Can you believe it? 1,500 calories can look completely different.
English
953
892
6.4K
1.1M
Ayan Das
Ayan Das@dasayan05·
@aaronp613 why/how tf this app is still on your phone ?
English
1
0
284
67K
Aaron
Aaron@aaronp613·
How/why the fuck is this app still getting bug fix updates
Aaron tweet media
English
77
88
21.4K
862.4K
Ayan Das
Ayan Das@dasayan05·
@iTejasJagtap but we want info about his daughter, specifically her instagram ID 🤣
English
0
0
0
42
Tejas
Tejas@iTejasJagtap·
Jameel Jamali's character is actually based on Altaf Hussain, not Nabil Gabol Altaf has only one daughter. He got married at the age of 47 with a 27 year old Baloch woman. He was accused of working for R&AW. Little to no public info about his daughter.
Tejas tweet media
English
115
564
7.3K
685.3K
Ayan Das
Ayan Das@dasayan05·
@twholman also true for any cool threejs projects
English
0
0
3
952
Tim Holman
Tim Holman@twholman·
The first rule of html-in-canvas api, never share anything but a video of your html-in-canvas demo.
English
8
10
322
27.7K
Jargon
Jargon@ajansJargon·
Doğum sırasında amniyotik kesesi yırtılmayan bir bebek, doktorların müdahalesiyle dış dünyayla tanıştı.
Türkçe
151
75
3.4K
4.7M
Ryan Els
Ryan Els@ryanels·
Can you help me figure out why my code isn't working? 🤔
Ryan Els tweet media
English
12
1
22
2K
Ayan Das
Ayan Das@dasayan05·
@sciencegirl I skipped the last part and almost concluded that the bird uploaded the video
English
0
0
0
70
Science girl
Science girl@sciencegirl·
A bird snatched a phone from a girl’s hands right in the middle of her recording
English
159
292
3K
113.7K
Liora
Liora@Liora_quotes·
she had ONE job i’m in tears😭
English
361
190
14.8K
11.4M
Ayan Das
Ayan Das@dasayan05·
@Samaytwt let me guess it asks for min 8 yrs of experience
English
0
0
0
76
Samay
Samay@Samaytwt·
wtf I thought Vibe Coding is just a meme, you guys were serious?
Samay tweet media
English
72
30
1.2K
50.5K
Ifran Bhuiyan 🇧🇩
Ifran Bhuiyan 🇧🇩@mohammad_i77884·
This map relates to the history, prosperity, and glory of the Muslims of the Bengal region. This is where the glorious "Sultanat-e-Bangalah" was located. Greater Bangladesh will be established one day, InshaAllah. 🇧🇩
Ifran Bhuiyan 🇧🇩 tweet media
English
380
20
242
43.5K
Vintage Maps
Vintage Maps@vintagemapstore·
Countries whose local names are extremely different from the names they're referred to in English
Vintage Maps tweet media
English
133
233
4.1K
430.2K
Ayan Das
Ayan Das@dasayan05·
@karpathy no shit, I was doing the same thing for a while now
English
0
0
0
81
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
English
2.7K
6.6K
55.7K
19.8M
Ayan Das
Ayan Das@dasayan05·
@YearOfTheKraken Bro is playing the long game, as he should, as he was trained for. 🫡
English
0
0
0
324
Sensei Kraken Zero
Sensei Kraken Zero@YearOfTheKraken·
Real Life Jameel Jamali, Nabil Gabol, is out here posting reels on Instagram with Dhurandhar 2 Music "Dil me Zakhm khaate hain, Jaan se guzar jaate hain." 😭😭😭
हिन्दी
33
119
954
67.1K