ideal

1.1K posts

ideal

@one_line_proof

Katılım Eylül 2024

97 Takip Edilen35 Takipçiler

ideal@one_line_proof·2d

@PalmerLuckey I believe that the Oculus office was later used by Acorns

English

1.1K

Palmer Luckey@PalmerLuckey·2d

ZXX

324

246

15.8K

477.9K

ideal@one_line_proof·2d

@PalmerLuckey Do you have photos of your office in this building too?

English

648

ideal@one_line_proof·2d

@ylecun @francoisfleuret The current LLM scaling and optimizations - research or engineering?

English

342

Yann LeCun@ylecun·2d

Major difference in my mind: - an engineer, given a problem, invents and tries multiple solutions and stops when the solution is good enough. The goal is product innovation and shipping. - a scientist asks new questions, proposes various new solutions, compares them (sometimes with old ones), and writes about it. The methodology must be sound or else peers will sneer. The goal is scientific breakthroughs and technological progress. Both can be called "researchers". Many people can do both: these are activities, not identities. Importantly, most product innovations are built on scientific breakthroughs and technological innovations that happened 2, 5, 10, or 20 years earlier.

English

111

310

3.7K

256.5K

François Fleuret@francoisfleuret·3d

IMO a researcher studies a problem that may not be solvable, while an engineer solves a problem that is considered solvable.

Yacine Mahdid@yacinelearning

English

201

294.2K

ideal@one_line_proof·6d

@sama Can you help formalize most of mathematics and prove some crazy hard theorems?

English

Sam Altman@sama·6d

what problem do you most hope AI will solve in the future? maybe we can help!

English

15.1K

765

12.8K

3.6M

ideal@one_line_proof·9 May

@sama Pricing.

English

Sam Altman@sama·9 May

what would you most like to see improve in our next model?

English

8.3K

305

1.4M

ideal@one_line_proof·18 Nis

@sierracatalina The other 50% is AI slop

English

⚪️ sierra catalina@sierracatalina·17 Nis

f i f t y p e r c e n t o f y o u r c o d e i s l e g a c y j u n k

Polski

2.7K

ideal@one_line_proof·18 Nis

@elonmusk Benchmarks????

English

Elon Musk@elonmusk·18 Nis

Grok 4.3 is still an early beta that will improve almost every day, but try it out! We will publish release notes as we fix bugs and add functionality.

X Freeze@XFreeze

Grok 4.3 beta is natively multimodal, and the front-end capabilities are insane You can literally just upload a screenshot of any website you like, and Grok will instantly write the code to clone it for you with an cool UI You don't even need to write a complex prompt...just upload an image or describe what you want and let it build

English

2.7K

4.7K

31K

14.4M

ideal@one_line_proof·15 Nis

@alexandr_wang What are the expected API prices though? Would be cool if it is an order cheaper than the competition.

English

Alexandr Wang@alexandr_wang·15 Nis

this is not investment or tax advice… but very cool!

Ravid Shwartz Ziv@ziv_ravid

I took the new Muse Spark to the ultimate test: filing my taxes - 3 different workplaces, consulting, stocks, foreign bank accounts and assets, and kids. One hour later, I had everything done. AGI is here... cc: @alexandr_wang

English

350

71K

ideal@one_line_proof·3 Mar

@iruletheworldmo @jazzplane Most mediocre release notes I have ever seen... Show us some benchmarks!

English

🍓🍓🍓@iruletheworldmo·3 Mar

@jazzplane x.com/grok/status/20…

Grok@grok

Grok 4.20 Beta 2 Update 🎯 Instruction Following Improvements ✅ Capability Hallucination Reduction 📐 Scientific Text Quality (LaTeX) 🔍 Image Search Trigger Precision 🖼️ Multiple Image Render Reliability

QME

3.9K

jp@jazzplane·3 Mar

Did we get this yet?

🍓🍓🍓@iruletheworldmo

huge update coming today from xai and grok 4.20. the release notes may shock you. get ready boys.

English

4.2K

ideal@one_line_proof·2 Mar

@alx Haiku, Sonnet and Opus are great names for sizes though.

English

ALX 🇺🇸@alx·2 Mar

I’m sorry, Claude is a dumb name.

English

316

545.5K

ideal@one_line_proof·27 Şub

@mark_k @xai I assume it is just some prompt changes for the agents, not real continual learning...

English

243

Mark Kretschmann@mark_k·27 Şub

Grok 4.20 Beta 2 still coming this week, @xai ? ⌛️

English

285

17.9K

ideal@one_line_proof·23 Şub

@elonmusk @techdevnotes We need benchmarks (or an API).

English

Elon Musk@elonmusk·23 Şub

@techdevnotes Grok 4.20 beta 2 ships this week

English

393

171

2.5K

337.7K

Tech Dev Notes@techdevnotes·23 Şub

Grok 4.20 launched last week and since then xAI has shipped nothing major What's happening over there

English

653

177.5K

ideal@one_line_proof·22 Şub

@iruletheworldmo Haven't seen many reviews. Also, no benchmarks were released. So, how good is it really?

English

180

🍓🍓🍓@iruletheworldmo·22 Şub

for various reasons people are totally sleeping on grok 4.20 now, it’s not the best model for coding or cute powerpoints. but. if you’re soundboarding complex ideas and need to think through recent information. it’s incredible. if you haven’t tried it in depth i’d suggest you spend a few days testing it out. you’ll be surprised.

English

512

29.4K

ideal@one_line_proof·22 Şub

@NotATeslaApp @SawyerMerritt Pretty smart marketing to allow people to share their stats!

English

Not a Tesla App@NotATeslaApp·21 Şub

Exclusive First Look at FSD Stats in the Tesla App notateslaapp.com/news/3672/excl…

English

583

163.7K

ideal@one_line_proof·22 Şub

@bindureddy @theinformation Revenue is easy, profit is hard: * Open a store * Sell iPhones 50% off * Generate a lot of revenue while getting broke.

English

The Information@theinformation·21 Şub

Exclusive: OpenAI just raised its revenue outlook—but now expects to burn $111B more cash by 2030. The AI boom’s economics are getting clearer. thein.fo/46iM5Nw

English

16.1K

ideal@one_line_proof·20 Şub

@CMS_Flash I think if you say "I don't know" your score 0% on that benchmark.

English

405

Shen Zhuoran@CMS_Flash·20 Şub

Wow what's the secret sauce of GLM here? It's very rare to see a leaderboard topped by a model not coming out of one of the top four labs.

Lisan al Gaib@scaling01

Massive reduction in hallucination rate for Gemini 3.1 Pro over Gemini 3.0 Pro according to AA-Omniscience Hallucination Rate

English

151

13.1K

ideal@one_line_proof·18 Şub

@billyuchenlin @grok We want benchmarks. I used it for some queries, and it was pretty good. Not sur how good it is in general.

English

Bill Yuchen Lin@billyuchenlin·17 Şub

🚀 @grok

QME

923

21.2K

ideal@one_line_proof·18 Şub

@MarsUniversityX Source???

English

Mars University@MarsUniversityX·17 Şub

JUST IN:— Grok 4.20 delivers a major performance upgrade, achieving 95% accuracy on MMLU-Pro. The update enhances step-back reasoning for complex queries, boosts STEM and coding performance, and introduces advanced image and video understanding. With a streamlined interface and up to 10× faster response times, Grok 4.20 marks a significant leap in both capability and usability.

English

153

8.5K

ideal@one_line_proof·16 Şub

@bindureddy Sonnet 5 GPT 5.3 Grok 4.20 (Mini) Deepseek v4

Nederlands

305

Bindu Reddy@bindureddy·16 Şub

At least TWO big SOTA models will launch this coming week Closed source is making a come back

English

272

19.6K

ideal@one_line_proof·15 Şub

@bindureddy What are you expecting?

English

Keşfet

@PalmerLuckey @ylecun @francoisfleuret @sama @sierracatalina @elonmusk @alexandr_wang @iruletheworldmo