Marc Kelechava

20 posts

Marc Kelechava

Marc Kelechava

@marcmuon

[email protected]

San Francisco, CA Katılım Kasım 2024
114 Takip Edilen19 Takipçiler
Marc Kelechava retweetledi
Georgios Konstantopoulos
> The real story of this era will be who manages to avoid harming themselves in their AI psychosis. this checks out and i see this in a few programmers and non-programmers i have met over the years. programmers feel they can do everything themelves. non-programmers suddenly feel they can program production code. LLM psychosis induces hybris, should be met with humility, IMO. > Without fully endorsing all their ideas, I’m now in the LeCun/Marcus camp on LLMs. I don’t think models like this will ever be able to program, I think the process matters. I think that deep learning is still the solution, but real programming agents will need world models, not some RLVR shit that comments out the failing test and tells you all the tests are now passing. i dont know enough about world models (but i will study up now!) but i think the part about "LLMs get 80% right and then screw up the polish" is consistent with my experience and i wonder if it's a fundamental limit i don't know if i resonate so much with george's writings because we have the same name or believe similar things, or because i used limera1n as a kid but the last few writings hit the spot for me geohot.github.io/blog/jekyll/up…
English
19
14
223
15.2K
Marc Kelechava retweetledi
Daniel
Daniel@growing_daniel·
You can tell AI is a net good for society because mark zuckerberg is bad at making it
English
92
209
5.1K
135.3K
Marc Kelechava retweetledi
Lee Robinson
Lee Robinson@leerob·
You might believe you should spend less time thinking about code because of AI. I strongly disagree! We’re watching this play out live where tons of AI generated code becomes a liability. At the end of the day, an engineer needs to be responsible / on call for code that gets shipped to production. If you don’t understand the system you’re trying to debug, you’re probably going to have a bad time. Yes, AI can help with all of this, if you set up the proper systems. You can have agents triage prod logs, look at errors, etc. You can speed up parts of the investigation, but an engineer needs to make the call. There might be serious customer or financial implications from that change. I expect the trend continue for trimming dependencies, vendoring code so you can modify it directly, preferring simpler systems with fewer abstractions, and spending waaaay more time thinking about system design and code maintenance. I’ve said this before, but it’s a great time to get familiar with CS fundamentals and some of the history behind what great software looks like. Many parts will be different in the coming years as AI progresses, but also a lot more than people realize will stay the same.
English
258
518
4K
562.1K
Marc Kelechava retweetledi
Susan Zhang
Susan Zhang@suchenzang·
there is no better time in tech than now to be a jack of all trades, master of a few. just make sure to keep adding to the few year over year, such that the cumulative breadth of expertise you collect becomes an increasingly rare combo. remember, if you're top 10% in 3 different areas, that already makes you top 0.1%. keep switching it up until you get to "your best", and then switch it up again (great for a particular flavor of people who don't enjoy resting on laurels, maybe not so great for others). question all institutional value and pedigrees, all traditional career paths or corporate ladders: the college industrial complex is getting shaken up, alongside a disappearing managerial class, so if you're pursuing either make sure you are fully internally aligned with why. social/political capital in a particular institution can feel incredible, but if you're spending all your energy on complex political people games, you're not a technologist anymore, you're an unelected politician. if you're ok with that, then all's well. critical thinking is more important than ever: take nothing at face-value, question everything and everyone. the equivalent of ai slop can be found in humans operating under misaligned incentives and interests. the sooner you're clued into disambiguating the talkers/larpers from the doers, the better off you'll be figuring out where and who to invest your time in. the anxiety of job displacement is very real, since a surprising amount of white collar work/prestige is built on a performative house of cards, significantly lacking in correlation with technical breadth, depth, and skill. as long as you keep learning, keep building, keep producing receipts, you will be fine. if all that sounds ok to you, welcome to the world of technology! it's truly one of the few places you can experience child-like wonder every few years, and be constantly humbled & excited by new adventures, as scary as they may seem at first. don't give up, drink your water, get your sunlight, and take breaks as needed. tech careers are notoriously nonlinear, so you might as well embrace it and enjoy the ride!
English
51
329
2.7K
383.8K
alisa rae .☘︎ ݁˖
alisa rae .☘︎ ݁˖@RaeAlisa_·
to celebrate 3 months since lauching @lucent_ai, we're giving away 5 Codex Pro / Claude Max plans 🎁 to enter, like this post + comment which one you'd pick (codex vs claude) winners will be selected from comments in 5 days 🫶
alisa rae .☘︎ ݁˖ tweet media
English
2.1K
124
2.8K
149.2K
Marc Kelechava retweetledi
baby keem
baby keem@babykeem·
how do u fix openclaw internal reasoning leaking
English
651
1.7K
18.6K
3.6M
Marc Kelechava retweetledi
ayman nadeem
ayman nadeem@aymannadeem·
sf dudes be like "taste is all that matters" while having one patagonia fleece and a datadog summit 2022 t-shirt to their name
English
12
8
411
13.7K
Marc Kelechava retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...
vLLM@vllm_project

🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support. 🧠 Compresses visual contexts up to 20× while keeping 97% OCR accuracy at <10×. 📄 Outperforms GOT-OCR2.0 & MinerU2.0 on OmniDocBench using fewer vision tokens. 🤝 The vLLM team is working with DeepSeek to bring official DeepSeek-OCR support into the next vLLM release — making multimodal inference even faster and easier to scale. 🔗 github.com/deepseek-ai/De… #vLLM #DeepSeek #OCR #LLM #VisionAI #DeepLearning

English
559
1.6K
13.3K
3.3M
Marc Kelechava retweetledi
Speculator
Speculator@TheSpeculator0·
one of my favorite parts of AI is never having to write matplotlib again
Speculator tweet media
English
38
115
3K
124.3K
Marc Kelechava retweetledi
hayden
hayden@haydendevs·
codex just made a python script to edit one of my tsx files, what are we doing man
English
114
23
1.2K
96K
Marc Kelechava retweetledi
Can Vardar
Can Vardar@icanvardar·
americans saw water freeze at 0°C and said “let’s make that 32”
English
2.4K
7.4K
225.5K
29.4M
Marc Kelechava retweetledi
JNS
JNS@_devJNS·
👀?
JNS tweet media
78
84
1.1K
33K
Marc Kelechava retweetledi
Mathieu
Mathieu@miniapeur·
What do you think?
Mathieu tweet media
English
169
1.3K
17.3K
1M
Marc Kelechava retweetledi
Mathieu
Mathieu@miniapeur·
Mathieu tweet media
ZXX
7
61
698
19.9K
Marc Kelechava retweetledi
Mathieu
Mathieu@miniapeur·
Mathieu tweet media
ZXX
34
393
5.2K
166.4K
Marc Kelechava retweetledi
Nassim Nicholas Taleb
Nassim Nicholas Taleb@nntaleb·
Some people are so dumb that no AI model could ever replicate them.
English
208
672
6.6K
507.6K
Marc Kelechava retweetledi
gabriel
gabriel@gabriel1·
if you are so smart why are you not only doing things you enjoy
English
139
204
2.7K
116K
Marc Kelechava retweetledi
Patty
Patty@pattybuilds·
yo bro I have a cool idea for an app ill give you 50% equity if you build everything and do all the work
English
551
949
31.2K
2.6M