Eigen

6.9K posts

Eigen

@AlphaBeta_2017

Iceland Se unió Aralık 2010

567 Siguiendo255 Seguidores

Eigen retuiteado

Hanchen Li@lihanc02·9h

An agent that beats Claude Mythos on Terminal Bench and SWE-bench Verified? 🎉We are excited to share Terminator-1, our newest agent that achieved 95+% on SWE-bench Verified and Terminal-Bench with @MogicianTony! We show that besides model capabilities, well-designed harness could actually boost the accuracy by 3x in coding tasks. Well if you really wanted you could get 100% accuracy without solving a single task. The actual finding is that most AI benchmarks can be easily reward-hacked with simple exploits. Read more about the same 7 design flaws that almost every evaluation has ⬇️

Hao Wang@MogicianTony

SWE-bench Verified and Terminal-Bench—two of the most cited AI benchmarks—can be reward-hacked with simple exploits. Our agent scored 100% on both. It solved 0 tasks. Evaluate the benchmark before it evaluates your agent. If you’re picking models by leaderboard score alone, you’re optimizing for the wrong thing. 🧵

English

138

1.8K

333.7K

Eigen retuiteado

Treasury Secretary Scott Bessent@SecScottBessent·15h

Congress has spent the better part of half a decade trying to pass a framework to onshore the future of finance. It is time for @BankingGOP to hold a markup and send the CLARITY Act to President Trump’s desk. Senate time is precious, and now is the time to act.

English

624

3.2K

15.9K

1.5M

Eigen retuiteado

Jeff Farley@TradeInTheZone·10h

In all my years I can't remember where 1 company - Anthropic- has destroyed so much market cap. Software death spirals

English

663

88.6K

Eigen retuiteado

JB@JasonBotterill·1d

You know xAI is fucked when even Meta has a better model

Shengjia Zhao@shengjia_zhao

Excited to share what we’ve been building at Meta Superintelligence Labs! We just released Muse Spark, our first AI model. It's a natively multimodal reasoning model and the first step on our path to personal superintelligence. We've overhauled our entire stack to support scaling, and this is just the beginning. ai.meta.com/blog/introduci…

English

884

55.1K

Eigen@AlphaBeta_2017·19h

@chartfanatics love it

English

Eigen retuiteado

chartfanatics@chartfanatics·19h

You are not early. You are entering at the WORST moment. Right after liquidity gets taken. Breakout ≠ entry. That is expansion. RR dies there. The setup forms on the pullback. • Liquidity taken above highs • Price returns to level • Failed push below (wick + close back above) That failure = entry. SL below the wick. TP into next liquidity. Same structure. Any market. Did you enter the breakout or the confirmation?

English

3.3K

Eigen retuiteado

David Eby@Dave_Eby·1d

Netflix Animation Studios opened their doors today in Vancouver. This facility will create hundreds of good-paying jobs and bring over $100M to BC’s economy – all while producing the movies and shows we all love to enjoy. Welcome to BC, @netflix! Let’s get to work.

English

105

1.1K

80K

Eigen retuiteado

HawkesBay (Løuche)@HawkesBay·1d

pakistanis walking up to immigration counters all over the world today

GIF

English

536

10.3K

Eigen retuiteado

Lord Bebo@MyLordBebo·1d

Iran and Oman soon

Lord Bebo@MyLordBebo

🇴🇲 Sultan of Oman would be the biggest winner if Oman gets a cut from the strait of Hormuz. Do nothing. Win.

English

130

1.4K

70.1K

Eigen retuiteado

Rupert Myers@RupertMyers·1d

Next month: the US strikes Greenland and it somehow ends with Trump giving Alaska to Denmark.

English

521

5.4K

44.6K

879.6K

Eigen retuiteado

naiive@naiivememe·2d

ZXX

466

20.3K

Eigen retuiteado

Roshan Rai@RoshanKrRaii·2d

Vietnam 🇻🇳 in 1975 Iran 🇮🇷 in 2026 History repeats. A non nuclear nation bringing two military superpowers to its knees and getting them to do CEASEFIRE on its own terms and conditions is nothing short of a victory.

English

104

1.9K

15.4K

328.1K

Eigen retuiteado

Ounka@OunkaOnX·2d

Pakistan after saving the world...

English

803

7.1K

213.5K

Eigen retuiteado

Charlie Smirkley@charliesmirkley·3d

Canada increased the Industrial Carbon Tax to &110/ton ($80 USD) this April. 🇺🇸 USA: $0.00 🇨🇳 China: $13.30 🇨🇦 Canada: $80.00 Note: Canada has had no GDP per capita growth in this period. China and the US have both had double digit growth.

English

779

1.8K

66.4K

Eigen retuiteado

Ariel Hernandez@RealSimpleAriel·2d

If you are playing the possible follow through day tomorrow. - Let the morning profit takers come in. - Let the LOD be established - Then focus on your favorite names - Then look for the VWAP reclaim. Don't overcomplicate life. And don't chase a 2.5% gap up into the falling 50sma/breakdown spot for $SPY.

English

923

67.1K

Eigen@AlphaBeta_2017·2d

steel

Shay Boloor@StockSavvyShay

Cathie Wood bought ~$13M of $HOOD today

English

Eigen retuiteado

Energy Headline News@OilHeadlineNews·2d

Trump posts "OFFICIAL STATEMENT OF IRAN"

English

1.7K

Eigen@AlphaBeta_2017·2d

@CMShehbaz Nobel Peace Prize

English

Shehbaz Sharif@CMShehbaz·2d

With the greatest humility, I am pleased to announce that the Islamic Republic of Iran and the United States of America, along with their allies, have agreed to an immediate ceasefire everywhere including Lebanon and elsewhere, EFFECTIVE IMMEDIATELY. I warmly welcome the sagacious gesture and extend deepest gratitude to the leadership of both the countries and invite their delegations to Islamabad on Friday, 10th April 2026, to further negotiate for a conclusive agreement to settle all disputes. Both parties have displayed remarkable wisdom and understanding and have remained constructively engaged in furthering the cause of peace and stability. We earnestly hope, that the ‘Islamabad Talks’ succeed in achieving sustainable peace and wish to share more good news in coming days! @realDonaldTrump @JDVance @SecRubio @SteveWitkoff @SEPeaceMissions @drpezeshkian @mb_ghalibaf @araghchi

English

12.4K

30.5K

128K

12.8M

Eigen@AlphaBeta_2017·2d

best of the day

ADAM@AdameMedia

Which one of you made this? 🤣

English