Ziyue Li

543 posts

Ziyue Li banner
Ziyue Li

Ziyue Li

@curiosity_notes

🌈 Data Scientist @remax | sharing what I learn about software engineering, AI, data, and science | ignorant and curious

Tampa, FL Katılım Mart 2018
123 Takip Edilen272 Takipçiler
Sabitlenmiş Tweet
Ziyue Li
Ziyue Li@curiosity_notes·
With the help of #seedance 2.0, I’m finally able to bring the world of cyber knights and beasts to life. The character designs were created back in 2023 using ComfyUI, and have been adapted to fit better together in this new world.
English
0
0
0
93
Ziyue Li
Ziyue Li@curiosity_notes·
Merry Christmas! 🎄 #VibeCoding This single-file web app is created and refined entirely with AI. Download it and play with your own photos: bit.ly/3L9NvCx
English
0
0
1
46
Ziyue Li
Ziyue Li@curiosity_notes·
I originally built Lexigen for myself and it has worked pretty well, so I’m sharing it in the hope that it’s useful to others as well. 👉 Free download on the App Store: apple.co/4jaS644
English
0
0
0
14
Ziyue Li
Ziyue Li@curiosity_notes·
Lexigen runs on Apple’s on-device Foundation Models instead of a cloud API Pros: • No subscriptions or accounts • Better privacy by design Cons: • Limited supported devices and languages • Strict safety guardrails prevents card generation occasionally
English
1
0
0
21
Ziyue Li
Ziyue Li@curiosity_notes·
AI is great for language learning, but most AI chat apps don’t help you retain what you learn. Flashcards work, but making good ones is tedious. So I built Lexigen: an AI-powered flashcard app that automatically generates definitions and usage examples for the words you save.
English
1
0
1
59
Ziyue Li retweetledi
Alexander Wei
Alexander Wei@alexwei_·
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
Alexander Wei tweet media
English
397
1.3K
7.3K
5.7M
Ziyue Li
Ziyue Li@curiosity_notes·
o1 is the first model I tested that was able to solve this ‘world’s hardest logic puzzle’ correctly, albeit with a lot of guidance and steering from the human.
Ziyue Li tweet media
English
0
0
1
71
Ziyue Li
Ziyue Li@curiosity_notes·
Tried @runwayml's Camera Control with a photo of me as a baby👶 The model failed to orbit around objects when I gave it artistic paintings, but it deals with photos of people relatively well. #AI #2Dto3D
English
0
0
2
144
Ziyue Li retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
These 94 lines of code are everything that is needed to train a neural network. Everything else is just efficiency. This is my earlier project Micrograd. It implements a scalar-valued auto-grad engine. You start with some numbers at the leafs (usually the input data and the neural network parameters), build up a computational graph with operations like + and * that mix them, and the graph ends with a single value at the very end (the loss). You then go backwards through the graph applying chain rule at each node to calculate the gradients. The gradients tell you how to nudge your parameters to decrease the loss (and hence improve your network). Sometimes when things get too complicated, I come back to this code and just breathe a little. But ok ok you also do have to know what the computational graph should be (e.g. MLP -> Transformer), what the loss function should be (e.g. autoregressive/diffusion), how to best use the gradients for a parameter update (e.g. SGD -> AdamW) etc etc. But it is the core of what is mostly happening. The 1986 paper from Rumelhart, Hinton, Williams that popularized and used this algorithm (backpropagation) for training neural nets: cs.toronto.edu/~hinton/absps/… micrograd on Github: github.com/karpathy/micro… and my (now somewhat old) YouTube video where I very slowly build and explain: youtube.com/watch?v=VMj-3S…
YouTube video
YouTube
Andrej Karpathy tweet media
English
198
1.8K
14.9K
1.6M
Ziyue Li
Ziyue Li@curiosity_notes·
I'm not sure what this is though 😂
English
0
0
0
29
Ziyue Li
Ziyue Li@curiosity_notes·
I tried @LumaLabsAI's image-to-video AI with one of my old AI artworks, and this is what I got. Interesting. There’s still lots of room for improvement, I would say.
English
3
0
1
77
Ziyue Li
Ziyue Li@curiosity_notes·
Another one with my AI artwork. Drifting away. It does keep the body parts relatively consistent most of the time 😅.
English
0
0
0
24
Ziyue Li retweetledi
andi (twocents.com)
andi (twocents.com)@Nexuist·
Marketing speak to terms you already know: semantic index -> embeddings app intents -> function calling on device language model -> 3B fine tuned LLM w/ included LoRA adapters on device image model -> diffusion model w/ included LoRA adapters orchestration -> Siri Neural Engine -> Apple's GPU
andi (twocents.com) tweet media
Max Weinbach@mweinbach

This is from Apple's State of the Union The local model is a 3B parameter SLM that uses adapters trained for each specific feature. Diffusion model does the same thing, adapter for each style. Anything running locally or Apple's Secure Cloud is an Apple model, not OpenAI.

English
20
130
1.4K
484.7K
Ziyue Li
Ziyue Li@curiosity_notes·
Finally got the flashcard animations to work 😅 The app is still in early development... #SwiftUI #iosdev
English
0
0
0
83