Vivek

613 posts

Vivek

Vivek

@vikx01

Engineer

เข้าร่วม Temmuz 2008
2.5K กำลังติดตาม149 ผู้ติดตาม
Ronak Malde
Ronak Malde@rronak_·
My favorite paper of 2026 so far 🔥 They took On-Policy Distillation (ie the Thinking Machines blog post), but then showed that the policy can be both the teacher and the student model. The idea is to condition the teacher off of a golden trajectory, and then train on the conditioned logprobs of the same model. The crazy part is, you can literally condition the teacher on anything!! This opens up an entire pandora's box of bridging prompt optimization/ICL + weight optimization that I'm very excited about for continual learning Authors: @IdanShenfeld @MehulDamani2 Jonas Hübotter @pulkitology
Ronak Malde tweet media
English
16
43
425
34.2K
Vivek
Vivek@vikx01·
@PeterBerezinBCA Barrels! 🤣🤣🤣 If it's "under condition of anonymity" it must be true.
English
0
0
4
483
Peter Berezin
Peter Berezin@PeterBerezinBCA·
Wait, they think that oil is actually shipped in barrels? The stagecoach is ready…
Peter Berezin tweet media
English
310
737
5.9K
369K
James Chanos
James Chanos@RealJimChanos·
Ben Sigman@bensig

Launched a new Bitcoin fund with @dotkrueger - here's how it works. Accredited investors put in min $100K. Fund buys Bitcoin. We hold. We use Bitcoin as collateral to borrow stablecoins. Every month we target sending investors ~1% of their capital as a distribution. These distributions are "return of capital" - tax-deferred until you exit. Meanwhile, you keep 100% exposure to Bitcoin's upside. Over the years, friends and family have asked: "Why can't I just give you my money and you buy Bitcoin?" Now you can. We don't trade. We don't time the market. Buy and hold. Simple. Who is this for? → You have Bitcoin and want income without selling → You want Bitcoin exposure with professional custody → You want tax-efficient distributions Accredited investors only. Target distributions not guaranteed.

QHT
12
3
101
25.2K
Ivan Burazin
Ivan Burazin@ivanburazin·
If you’re taking a break for the holidays your ngmi.
English
298
11
289
2.3M
Vivek
Vivek@vikx01·
@CliffordAsness Newsflash: RFK Jr. claims breathing air causes cancer. Anyone who's had cancer has breathed air in the past year. So... 🙃
English
1
0
2
449
Clifford Asness
Clifford Asness@CliffordAsness·
From the article: “There are still no studies that support the claim that any of the 20 doses of the seven infant vaccines recommended for American children before the first year of life do not cause autism.” Not serious people in charge of serious stuff. “When did the vaccines stop beating their spouses?”
Clifford Asness tweet media
English
19
18
163
26.5K
Vivek
Vivek@vikx01·
@Dan_Jeffries1 @akshay_pachaar This logic fails under the fact that that paper wasn't earth shattering at the time it was published in NIPS. Significant, yes. Earth shattering, no.
English
0
0
1
425
Daniel Jeffries
Daniel Jeffries@Dan_Jeffries1·
@akshay_pachaar Cool paper but the reason we know this is not Attention is All You Need v2 is because Google will never ever ever ever ever let their researchers publish something as broad and impactful as AIAYN ever again. :) They only let them publish lesser papers now.
English
23
9
293
29.4K
Akshay 🚀
Akshay 🚀@akshay_pachaar·
Google just dropped "Attention is all you need (V2)" This paper could solve AI's biggest problem: Catastrophic forgetting. When AI models learn something new, they tend to forget what they previously learned. Humans don't work this way, and now Google Research has a solution. Nested Learning. This is a new machine learning paradigm that treats models as a system of interconnected optimization problems running at different speeds - just like how our brain processes information. Here's why this matters: LLMs don't learn from experiences; they remain limited to what they learned during training. They can't learn or improve over time without losing previous knowledge. Nested Learning changes this by viewing the model's architecture and training algorithm as the same thing - just different "levels" of optimization. The paper introduces Hope, a proof-of-concept architecture that demonstrates this approach: ↳ Hope outperforms modern recurrent models on language modeling tasks ↳ It handles long-context memory better than state-of-the-art models ↳ It achieves this through "continuum memory systems" that update at different frequencies This is similar to how our brain manages short-term and long-term memory simultaneously. We might finally be closing the gap between AI and the human brain's ability to continually learn. I've shared link to the paper in the next tweet!
Akshay 🚀 tweet media
English
257
1K
6K
511K
Vivek
Vivek@vikx01·
@Abude_al_asaad On this note, let's change the personal finance rule: invest 100 + your age in equities funded by leverage. Wouldn't that be great?
English
0
0
0
160
Abdul Al-Asaad
Abdul Al-Asaad@Abude_al_asaad·
Our first post about Basic Capital reached over 25M people. The conversations that followed made one thing clear: the way Americans save for retirement is fundamentally broken. (1/
English
17
7
39
95.5K
Vivek
Vivek@vikx01·
@TMTMoats Soooo..."this time it's different" argument or "we're not there yet" argument? Likely the latter.
English
0
0
0
12
Conch Shell Capital
Conch Shell Capital@TMTMoats·
Eric Sheridan (GS Internet Analyst), thoughtful response when asked if we are in an AI Bubble: "Do I see signs that point me back to the late 90s or the 2007 time period? Sure. Private market valuations are well ahead of public market valuations. Public market valuations are above historical norms, but they're also below the peak of public market valuations that you saw in 1999 and 2000. Capital market activity is still well below the levels that were seen in 2020/2021, 2007/2008, & 1998/1999. I would argue there are signs of exuberance - there are signs that rhyme with past periods of time, but I wouldn't necessarily align it perfectly with the lessons we've learned in prior periods, at least not yet. That's arguably a duration narrative, that we're just not there yet - that is one framing I would give. The other thing that is a little bit different than prior periods is that typically the companies that generate no profits, and in 1999, it was companies that generated no revenue, are the ones that were driving the most exuberant valuations in the market. The Mag 7, most of those companies, generate free cash flow, buy back their stock, and pay dividends. There were very few companies buying back their stocks and paying dividends in 1999, so there are some key distinctions I would keep in mind as opposed to just drawing a straight analogy."
English
5
11
176
33.3K
Vivek
Vivek@vikx01·
@ALEXEIMARTOV Great points. This is the gem: "but it is hard to tell if the data center capacity will be better used with new ones at that point especially considering the cost of power / the power efficiency of the new gpus."
English
0
0
1
244
Martin Bradstreet
Martin Bradstreet@ALEXEIMARTOV·
people act like the depreciation on GPUs is really difficult, so i made this simple guide. you're the gpu maker: we've gone from a 2 year replacement cycle to an 18 month one, pump the stock pls. you're a growth ceo talking to investors: those gpus last 12 years, lend us money pls. you're the growth ceo talking to your CFO: depreciate those gpus as fast as possible, fuck paying taxes. you're the investor shorting the stock: those gpus only last 2 years, look at what the CFO just said! you're a mega cap CEO trying to protect your P/E ratio / bond rating, which is more important to you than tax optimization: those GPUs last 8 years you're an investor longing the stock: those gpus last 8 years. the reality: the GPUs probably last a little over 5 years but it is hard to tell if the data center capacity will be better used with new ones at that point especially considering the cost of power / the power efficiency of the new gpus.
Kakashii@kakashiii111

CoreWeave depreciated US$0.63 billion in one quarter (~US$2.5 billion per year), implying an effective annual depreciation rate of around 12% if crudely annualized. That’s roughly consistent with their stated 8 to 12 year useful life for data center infrastructure. Good to know CoreWeave might still be offering Blackwells well into 2037.

English
19
49
473
97.1K
Vivek
Vivek@vikx01·
@FromValue Doesn't make sense. There doesn't seem to be any correlation with and without any lag.
English
0
0
0
19
From Growth To Value
From Growth To Value@FromValue·
If you hear someone say the S&P 500 PE ratio is historically high, show them this. Context and knowledge.
From Growth To Value tweet media
English
41
59
441
53.5K
Vivek
Vivek@vikx01·
@junkbondinvest @verdadcap Aren't PE owned companies more leveraged than S&P or Nasdaq companies? Deleveraging to make it on par with S&P or Nasdaq would make PE performance even worse.
English
0
0
1
32
junkbondinvestor
junkbondinvestor@junkbondinvest·
Private equity vs public markets scorecard: PE 1-year: 8.1% S&P 500: 25.0% VC 1-year: 6.2% Nasdaq: 29.4% But hey, at least you can't check the price daily
junkbondinvestor tweet media
English
25
49
456
55K
Vivek
Vivek@vikx01·
@curiouswavefn Who in this world refers to themselves by their last name?
English
0
0
0
42
Ash Jogalekar
Ash Jogalekar@curiouswavefn·
Amazing story about how intensely Lincoln studied Euclid's "Elements" to find out the precise meaning of the word 'demonstrate'.
Ash Jogalekar tweet media
English
41
298
2.9K
267.2K
Vivek
Vivek@vikx01·
@LOGOinvestor @bgurley Bill very likely gets this. But it gets in the way of his IPO outrage narrative.
English
0
0
0
35
Eric Clark
Eric Clark@LOGOinvestor·
@bgurley I doubt. Heard in the boardroom pitch “we price it too low, we only release a small amount of stocks, and it goes vertical…then you can do multiple secondaries and pad your balance sheet”!!
English
1
0
13
2.3K
Bill Gurley
Bill Gurley@bgurley·
With $FIG we have another massive "POP" highlighting the gross inefficiency in the modern IPO process. It's very simple. They REFUSE to match supply/demand (that happened today). They brag about the mis-match - "30X oversubscribed." The outcome is expected & fully intentional.
English
137
253
4.3K
736.3K
Vivek
Vivek@vikx01·
@greg_ip Doesn't AI bubble being a confounding factor make this comparison quite a bit less kosher?
English
0
0
2
35
Vivek
Vivek@vikx01·
@george__mack That attitude seems to be a sign of insecurity.
English
0
0
0
7
George Mack
George Mack@george__mack·
If someone asks you “what do you do?” within seconds of meeting you — don’t be annoyed. Smile. You’ve just identified a transactional player. Instead, give them the lowest status answer. Then drop your highest IQ insights for the rest of the conversation. This breaks brains.
English
819
695
14.8K
2.8M
Vivek
Vivek@vikx01·
@eigensteve Love to proofread sections of it.
English
0
0
1
59
Steven Brunton
Steven Brunton@eigensteve·
New Book & Video Series!!! (late 2025) Optimization Bootcamp: Applications in Machine Learning, Control, and Inverse Problems Comment for a sneak peak to help proofread and I'll DM (proof reading, typos, HW problems, all get acknowledgment in book!)
English
243
205
2K
102.4K
Vivek
Vivek@vikx01·
@davidasinclair But is is causation or correlation? Which type of muscle: type 1 or type 2 or both? Is it the most important factor? Compared to VO2Max?
English
0
0
0
25
David Sinclair
David Sinclair@davidasinclair·
Muscle mass predicts lifespan better than BMI or weight - so get moving and get lifting
English
221
627
7.8K
807.5K
Vivek
Vivek@vikx01·
@InvestingVisual How did you reach the conclusion that "AI chip market is set to grow from $96B to $927B by 2034"? Any sources? I'm just curious regarding the veracity of this statement.
English
0
0
0
31
Investing visuals
Investing visuals@InvestingVisual·
The AI chip market is set to grow from $96B to $927B by 2034 — a massive 29% CAGR. Businesses best positioned to benefit: • Tier 1 - AI Foundation: $ASML, $TSM, $AMAT • Tier 2 - AI Chips: $NVDA, $AMD, $AVGO • Tier 3 - AI Infra : $NBIS, $CRWV, $CDNS
Investing visuals tweet media
English
45
221
1.1K
217.7K
Vivek
Vivek@vikx01·
@davidasinclair Why 14 hours and not 13? What's the mechanism? Yeah you do the research but where is the research paper that shows this?
English
0
0
0
25
David Sinclair
David Sinclair@davidasinclair·
Fasting for 14+ hours activates autophagy—your body’s cleanup mechanism
English
178
288
3.1K
371.5K