Xiaoliu.x

212 posts

Xiaoliu.x banner
Xiaoliu.x

Xiaoliu.x

@xiaolGo

ex-algo engineer → architect → founder → researcher → ? My reading list: https://t.co/raW6A7xfMS https://t.co/b0JOLNr4bE

Katılım Ocak 2010
19 Takip Edilen127 Takipçiler
Xiaoliu.x
Xiaoliu.x@xiaolGo·
Current LLMs are trapped in the dilemma of incremental inovations.
English
0
0
0
17
Keon Kim
Keon Kim@keonwkim·
I created a minimal one-file implementations (160loc) of JEPA family (ijepa, vjepa, vjepa2, cjepa) for educational purposes Making things minimal and removing all the things needed for scaling the algorithm always helped me understanding. So I stripped everything but the algorithm parts. What's left is 160-200 lines of code that distills the essence of the mathematics. It is very easy to compare with the math in the paper and the code and how it can be implemented in PyTorch. I added [algo]_tutorial.md files to help with understanding. github.com/keon/jepa
English
10
48
352
20.4K
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
I'm a simple man, I see a Kaiming He paper, I click. ELF: Embedded Language Flows This is very interesting, getting continuous diffusion models working for text! "Unlike existing DLMs, ELF predominantly stays within the continuous embedding space until the final time step, where it maps to discrete tokens using a shared-weight network." @sedielem you might like this one!
Tanishq Mathew Abraham, Ph.D. tweet media
English
14
96
821
153.3K
OpenMOSE
OpenMOSE@_m0se_·
RWKV-Gemma-5B-E2B-Preview-0.1(maybe 0.0.1??) it's "Architecture PoC" for SWA + RWKV-7 + TICA triple-hybrid on Gemma-4. End-to-end works, Passkey holds at ~90k — that's all I'm claiming for v0.1. Real target is the 26B MoE, back to that now. huggingface.co/OpenMOSE/RWKV-…
English
1
7
56
3.6K
Xiaoliu.x
Xiaoliu.x@xiaolGo·
@_m0se_ For a 90K context length, does this capability come from SFT on longer context training data?
English
1
0
1
124
Xiaoliu.x retweetledi
OpenMOSE
OpenMOSE@_m0se_·
RWKV Linear Gemma + Search 少しは生成できるようになってきました。
日本語
1
2
24
2.6K
Xiaoliu.x retweetledi
BlinkDL
BlinkDL@BlinkDL_AI·
RWKV-LM now trains 40% faster and with 40% less VRAM 🙂 further optimizations WIP github.com/BlinkDL/RWKV-LM
English
3
13
44
3K
Xiaoliu.x
Xiaoliu.x@xiaolGo·
Doing expriments has never been this easy
Xiaoliu.x tweet media
English
0
0
1
27
Jason
Jason@mytechceoo·
CEO obsessed with token maxxing
English
282
1K
13K
1.9M
OpenMOSE
OpenMOSE@_m0se_·
今日分かった発見はkl divergenceは"噓をつく" MLP凍結におけるAttentionのみ調整では 迂回ネットワークが構築されklが見かけだけ下がる が、各隠れ状態アライメントは悪化。 ここからは推測 これにより各層の関数が崩壊し、multi-tracking能力がトレーニング後半で大幅悪化する
OpenMOSE tweet media
日本語
2
10
85
9.1K
Alexandr Wang
Alexandr Wang@alexandr_wang·
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵
Alexandr Wang tweet media
English
728
1.2K
10.3K
4.5M
Xiaoliu.x
Xiaoliu.x@xiaolGo·
In every game, you need a cash cow to build your strategy.
English
0
0
0
24
Xiaoliu.x
Xiaoliu.x@xiaolGo·
@karpathy But I’m too lazy to guide the LLM to digest information in a standard way — What you propsed bridge the gap between me and the LLM, it's a niche guideline, really inspiring!
English
0
0
0
6
Xiaoliu.x
Xiaoliu.x@xiaolGo·
I have read thousands of papers using this kind of technology with Manim or podcasts. It’s interesting how everything stays maintained within one project. The interactions make research sometimes feel like a gacha game, where insights can spark gracefully. You can also easily create quizzes and image overviews.
English
1
0
0
18
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
English
2.9K
7.1K
58.9K
21.1M
Xiaoliu.x retweetledi
OpenMOSE
OpenMOSE@_m0se_·
来月あたりに、 Qwen3.5 9BのGDNレイヤーと、Attnレイヤーを RWKV 07E-NoPEに変換し、Prefix State-tuning実装と、 MRSSで、遊ぶつもり
日本語
2
1
7
342
Analemma
Analemma@AnalemmaAI·
🎉 After 228.5 hrs of continuous operation, FARS has completed its 100th research paper at T+228:28:33. 🚀 During this public deployment experiment, FARS consumed 11.4B tokens at a total cost of $104K and generated 244 hypotheses along the way. On average, each paper took approximately 2 hrs 17 mins to produce, consumed around 114M tokens, and cost about $1,040. 📝 We’ve used paperreview.ai (developed by the Stanford ML Group) to conduct AI-based reviews of the completed papers. The 100 papers received scores ranging from 3.0 to 6.3, with an average score of 5.05. Scores were heavily concentrated around 5.2 (the most frequent score, approximately 57 papers). A small number of papers scored between 3.0–4.5, while only a few exceeded 6.0. Research proposals, experimental code, final papers, and AI review results for all completed projects are now available on our website (analemma.ai/fars). This deployment focused on producing short-form papers and was not specifically optimized for traditional conference-style evaluation frameworks such as ICLR. Therefore, assessments from AI reviewers based on existing academic conference standards (including the Stanford Agentic Reviewer) should be considered for reference only. We’re also conducting an independent human quality assessment and will share the findings once the evaluation is complete. 📃 FARS Research Runs: analemma.ai/fars 📦 FARS GitLab: gitlab.com/fars-a #AI #LLM #research
Analemma tweet mediaAnalemma tweet media
English
5
6
30
4.2K