biased estimator

0

18

2.2K

biased estimator@selfattentive·21h

@Fefe_no_covfefe @actsmaniac lol come on

English

5

117

FeFe 🤺@Fefe_no_covfefe·21h

@actsmaniac The socialists seem to be doing pretty well with Spain. Just shows that a representative democracy is superior to revolution in every way.

English

6

0

1

935

space cadet 🇪🇺🌐🇩🇪@actsmaniac·1d

guy who is simply against socialism because it decreases annual growth rates by approximately two percentage points

Fabriz@FabrizHegeliano

Paper recomendado y muy basado

English

25

29

493

25.7K

biased estimator@selfattentive·21h

@actsmaniac Seems like a pretty good reason to me

English

224

biased estimator@selfattentive·22h

@macrocephalopod But peepeepoopoo said oil is gonna moon

English

7

673

cephalopod@macrocephalopod·22h

Apropos of nothing in particular but if are just copy trading someone you saw online you are a bit of an idiot. And if you are copy trading them in options without knowing strikes/expiries/hedges etc then you are an even bigger idiot.

English

11

3

223

25.5K

biased estimator@selfattentive·1d

@alz_zyd_ Maybe your experience is different from mine but I think most people need to work through and overcome being stuck.

English

0

240

alz@alz_zyd_·1d

@selfattentive no

0

2

705

alz@alz_zyd_·1d

The hard part about learning math in the past was getting stuck and not knowing what to do next. Now, math learning is no longer hard, because whenever the LLMs can unstuck you instantly whenever you get stuck

English

7

5

69

17.5K

biased estimator@selfattentive·1d

@ptuomov Then it’s “batteries are cool but the energy density of chemical propellants can’t be beat” and a few years from now they will have re discovered the cruise missile from first principles

English

1

50

Ptuomov@ptuomov·1d

Wait until they find out about the efficiency of a fixed wing.

Dabs🩸@DabsMalone

Quadcopters dominate today because they’re cheap, simple, and disposable But physics hasn’t changed Electric helicopter style UAVs are far more efficient, carry more, and go farther As autonomy improves and cost comes down, we’ll see them play a much bigger role in warfare🤝

English

0

10

1.8K

biased estimator retweetledi

Mad Engineer@1llegalEngineer·2d

Performative Robotics is a very large industry

English

15

13

263

12.9K

biased estimator@selfattentive·5d

@bauhiniacapital Maybe I’m misunderstanding your original post then.

English

77

baufinanciaphaster 👹@bauhiniacapital·5d

@selfattentive I know exactly what it does. I have written about on Twitter for years and off-Twitter for longer. Maybe 20-25yrs. Tell me clearly how transporting oil and gas from a foreign country to the US or vice versa is currently prohibited by the Jones Act.

English

0

1

68

baufinanciaphaster 👹@bauhiniacapital·5d

This Jones Act waiver - designed to allow US companies to do trade with Venezuela - does nothing to improve trade of oil with Venezuela. Jones Act didn't limit it before. This is garbage-y.

English

0

13

2.3K

biased estimator@selfattentive·5d

@teortaxesTex What is the motivation to do this? He can’t be earning any money off this.

English

Real Post Folder@RealPostFolder

109

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex·6d

holy fucking shit, do I hate these guys

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) tweet media

Bland@BlandInAmerica

@teortaxesTex It's both: deliberate rationing + forced degradation. The curve isn't pure math leak—it's hunter-killer ops grinding down finite, hard-to-replace mobile assets.Tweet form:

English

6

1

78

4.6K

biased estimator@selfattentive·6d

“Live under constant threat of humiliation but in exchange you get to be poor”

English

Haocheng Xi@HaochengXiUCB

63

biased estimator retweetledi

Anton Tsitsulin@graph_·17 Mar

hill I will die on: k-means is not an algorithm, it’s a problem

𝗞-𝗺𝗲𝗮𝗻𝘀 𝗶𝘀 𝘀𝗶𝗺𝗽𝗹𝗲. 𝗠𝗮𝗸𝗶𝗻𝗴 𝗶𝘁 𝗳𝗮𝘀𝘁 𝗼𝗻 𝗚𝗣𝗨𝘀 𝗶𝘀𝗻’𝘁. That’s why we built Flash-KMeans — an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves 30x speedup over cuML and 200x speedup over FAISS — with the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. A classic algorithm — redesigned for modern GPUs. Paper: arxiv.org/abs/2603.09229 Code: github.com/svg-project/fl…

English

Ariele 🏗️🌐🏳️‍⚧️@PositivistWitch

52

9.9K

biased estimator retweetledi

Vincent Geloso@VincentGeloso·16 Mar

People who hate the Laffer curve really only hate the presumption that we are on the right side of the curve. People, like me, who really hate the Laffer curve hate the presumption that revenue-maximizing should be an objective.

“the laffer curve is complete nonsense!” (five minutes later) “actually a revenue-maximizing rate does exist” every time

English

43

58

781

38.5K

biased estimator@selfattentive·17 Mar

@AlHendiify It isn’t a “blockade” at all. Anyone can trade with Cuba, America just refuses to subsidize our enemies. But it should be a blockade.

English

57

David AttenBruh@AlHendiify·16 Mar

Mind you Cuba is literally just some island nation that has never launched a military attack at us. The US has them under a blockade because wealthy americans don't like their economy. That's literally it.

Acyn@Acyn

CNN: Breaking news. Cuba's electrical grid has suffered a complete and total collapse. This is according to the country's power operator. It's the first nationwide blackout since the US effectively shut off the flow of oil to Cuba

English

1.3K

12.7K

55.6K

1.2M

biased estimator@selfattentive·17 Mar

@MikeyPhillipZ @ianmiles Should be five years. America is under no obligation to subsidize its enemies.

English

David AttenBruh@AlHendiify

2

76

🇺🇸MichaelPhillipZ@MikeyPhillipZ·17 Mar

@ianmiles If a ship docks with Cuba it can’t dock with the US for a year and a half. Fuck you, chode

English

13

0

19

2.6K

Ian Miles Cheong@ianmiles·17 Mar

Cuba isn’t under a blockade and over 100 countries trade with it. They’re just poor and don’t sell anything besides cigars because communism is retarded just like you.

Mind you Cuba is literally just some island nation that has never launched a military attack at us. The US has them under a blockade because wealthy americans don't like their economy. That's literally it.

English

81

508

4.9K

104.5K

biased estimator@selfattentive·17 Mar

@mudithj @part_harry_ @willccbb Ask me how I know you’re gpu poor

English

30

Mudith Jayasekara@mudithj·16 Mar

@part_harry_ @willccbb all roads leading to the single layer transformer

English

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

0

3

578

Harry Partridge@part_harry_·16 Mar

Attention residuals and mixture of expert reuse (x.com/yichen4nlp/sta…) are two independent results pointing in the same direction: a single transformer layer, looped n times, is more efficient than n independent transformer layers. As @willccbb has often remarked, the best, most enduring discoveries are when you get improved performance by making the architecture LESS complicated. It seems abundantly clear to me that a single ultra wide layer, looped n times, can be made into a strict generalisation of the current paradigm, whilst also being more elegant in its simplicity.

Kimi.ai@Kimi_Moonshot

English

4

64

18.8K

biased estimator@selfattentive·17 Mar

@part_harry_ @ccui42 @willccbb This has been explored extensively, there is a reason nobody uses UT models outside of toy problems. Also, once your model is big enough you need model parallelism and lose the advantages of parameter sharing.

English

1

64

biased estimator@selfattentive·17 Mar

@mattparlmer But we can also see f18s doing strafing runs. My guess is there are some parts of the country that are safer for American planes than others.

English

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

17

972

mattparlmer 🪐 🌷@mattparlmer·17 Mar

The CSIS guy who came on Odd Lots today mentioned the talking point that we are transitioning to gravity bombs in the Iran fight We can all see the B-52s loaded with JASSMs on OSINT feeds, so govt is not being honest here, and it would be good for natsec policy ppl to flag that

English

3

6

208

10.5K

biased estimator retweetledi

tender@tenderizzation·16 Mar

one weird trick to make the price of memory go up even more

Kimi.ai@Kimi_Moonshot

English

3

2

104

5.9K

biased estimator@selfattentive·14 Mar

@atlanticesque Pro life too I’d say

English