2.3K posts

RB

@530RB

Katılım Temmuz 2011

9 Takip Edilen40 Takipçiler

RB@530RB·7h

@burkov Based on everyone’s comments, I think your argument would be stronger with a solid reason why it also fails for CoT/thinking models

English

BURKOV@burkov·11h

Always remember that when an LLM prints the beginning of a text, it has no idea what the end will be. Therefore, when it says "The answer is yes, and this is why:" the text after "why" would most likely be a very elaborate lie combined with gaslighting in case "yes" was the wrong answer.

English

483

53.1K

RB@530RB·7h

@burkov Or it explains why the LLM does a 180 after saying yes

English

RB@530RB·8h

@profithuntercfo @collin_ruth89 It’s not X, it’s Y It’s not that you can’t speak, it’s that you speak like an AI

English

318

Tom Dillon, CFA@profithuntercfo·12h

@collin_ruth89 it’s not that hybrids don’t work, it’s just that they’re not always worth the extra cost and complexity.

English

544

60.5K

Collin Rutherford@collin_ruth89·13h

I don’t understand why every vehicle isn’t a hybrid. Why not use regenerative braking to charge a battery in every car? Huge increase in gas mileage. Why do they still make non-hybrids?

English

1.1K

2.7K

623.2K

RB@530RB·8h

@OlexGameDev Variable name has it

English

Olex (Solo gamedev Diablo-like)@OlexGameDev·13h

I wonder why new "fancy" coding languages refuse to provide user-defined literals. I find them very handy.

Olex (Solo gamedev Diablo-like) tweet media

English

283

45.2K

RB@530RB·9h

@KIRI_Engine_App But where did the lamp come from

English

122

KIRI Engine - 3D Scanner App@KIRI_Engine_App·21h

AI-Enhanced LiDAR. Left vs right. A real LiDAR device costs thousands of dollars. What's in your iPhone Pro is a "baby LiDAR". Limited depth resolution, noisy output, not really built for high-precision 3D. You can't change the hardware. So we built an ML layer on top. Denoising, geometry completion, detail recovery. Processed server-side. Same sensor. Very different result.

English

610

42.9K

RB@530RB·9h

@redtachyon Just don’t ask it to count r’s

English

Ariel@redtachyon·17h

Ok look. Maybe it can generate plausible-looking text. Maybe it can answer general knowledge questions. Maybe it can generate code snippets. Maybe it can answer simple math questions. Maybe it can autonomously research complex topics. Maybe it can do research with a feedback loop. Maybe it can build entire applications in one go. Maybe it can solve open problems in mathematics. But it's not *really* intelligent, you just have AI psychosis

English

197

12.5K

RB@530RB·9h

@RyanClogg There are a few occasions I would have paid for this

English

274

Ryan Clogg@RyanClogg·13h

If you think about it... This is still absolutely insane.

English

14.9K

2.2M

RB@530RB·14h

@1Hassium That was awesome when signals were sent back to the start to continue growth

English

525

108Hassium@1Hassium·19h

#cellularautomaton #セル・オートマトン x = 15, y = 15, rule = B2e3aeij4a/S1c2-i3-a4-ajrw o$2o$b2o10$12bo$12b2o$13b2o!

398

29.2K

RB@530RB·14h

@Hitchslap1 That’s not true at all lol

English

Hitchslap@Hitchslap1·18h

Vocabulary is way better at measuring intelligence. It is not even close.

English

206

493

61.4K

RB@530RB·15h

@PingStruggles Yes but Windows 13 will be the first 100% vibe coded os making Windows 12 partially vibe coded os putting it in the green zone What the graph is missing is a downward slope of both red and green

English

145

Max@PingStruggles·1d

Windows 12 better not break the cycle just because it’s vibe coded

English

280

128

3.1K

2.8M

RB@530RB·1d

@orion78fra @pikuma “Unfortunately, you did not get the answer we were looking for, the solution is was: U(n+1) = 2(U(n) +1) “

English

Guillaume Turchini@orion78fra·1d

@pikuma U(n+1) = 2 U(n) + 2 or U(n) = 2^(n+3) - 2

228

pikuma.com@pikuma·1d

If you complain that a company asked you to code a simple palindrome check function during an interview, arguing that it "has nothing to do with the actual job", remember that it's just "cognitive screening." It's similar to asking the simple pattern recognition question below.

Beyza@hicasamadim

bunu çözersen, sen bir dahisin. çözebilir misin?

English

225

26.4K

RB@530RB·1d

@gro_tsen Nice to learn something new, especially when I was about to say this was dumb

English

1.5K

Gro-Tsen@gro_tsen·1d

Surprising math fact of the day: a monkey is hitting keys at random (uniformly, independently & at constant speed) on a keyboard. The expected value of the time T₁ it takes to type “abracadabra” is greater than the expected value of the time T₂ it takes to type “abracadabrz”.

English

2.8K

362.9K

RB@530RB·2d

@NatureUnedited Fear factor was the last time I saw these things

English

Nature Unedited@NatureUnedited·3d

Whip spider attacking with its pedipalps (Euphrynichus amanica)

English

245

1.3K

14.6K

5.8M

RB@530RB·2d

@KarlMuth Could just not include AI and let them see the 0 instead of curving them to 0

English

1.3K

Karl T. Muth 🌐✈️📊@KarlMuth·2d

I know there are many (understatement) approaches to AI use where students are being evaluated, and that there is variation between disciplines and levels of study, but I thought I'd share one and perhaps stoke the debate. Anyone (including my students) is welcome to comment...

Montréal, Québec 🇨🇦 English

478

150.3K

RB@530RB·2d

@Jonathan_Blow Give it a lot of words (say a depth/tree of thesaurus searches) and have it choose the best word from that set. AI is not creative, but it is good at tasks.

English

Jonathan Blow@Jonathan_Blow·2d

I've been trying to use ChatGPT as a thesaurus but it doesn't seem to be very good ... it keeps making generic suggestions even when prompted like "imagine you are very learned, with a huge vocabulary"... it then just picks older generic words. Any hints? English has 600k words!

English

134

497

57.8K

RB@530RB·2d

@YosarianTwo The lizard brain

English

Yosarian2@YosarianTwo·3d

I am very amused by the "why do people like doordash" discorse You push a button on the magical rectangle in your pocket and a few minutes later any food you can imagine appears Sure it's overpriced and mediocre but this is still high magic and the back of your brain knows it

English

1.7K

18.7K

RB@530RB·2d

@RandomSprint No you described how the human is a computer

English

RandomSprint🧭@RandomSprint·3d

If you think about it, a fully mechanical car can be driven to any position on a piece of infinite tape. It can leave marks by spinning out its tires. The driver can follow instructions regarding where to drive based on the tire marks. Every car is a computer.

messed up cars@messedupcars

English

1.3K

58.1K

RB@530RB·2d

@akidderz @nikicaga It’s not X, it’s Y Said the AI

English

109

akidderz@akidderz·2d

@nikicaga Scissor stairs weren’t banned because everyone was dumb; they were banned because fire code prized redundancy: two truly separate exits. That safety concern is real. We don’t price the tradeoff: safer vs cheaper. We just act shocked that housing costs exploded.

English

295

30K

Nikolaj🇺🇦🇵🇸@nikicaga·3d

Why would they ever be banned???

Saad Asad@realsaadasad

Washington unanimously legalized scissor stairs, a building code reform that frees up to 56% more living space per floor. Less wasted space means cheaper homes on smaller lots. Most US states banned this since the 1970s for no good reason.

English

209

5.7M

RB@530RB·3d

@cozyblaze265065 What about how many strawberries are in r

English

134

cozyblaze@cozyblaze265065·3d

I redid the multi-digit multiplication experiment, now with gpt-5.5. With medium reasoning and 7 samples each cell, it pretty much aced the test with 99.46% accuracy. The model had no tools to call and had to rely on its reasoning. Can it go further? (1/4)

Yuntian Deng@yuntiandeng

For those curious about how o3-mini performs on multi-digit multiplication, here's the result. It does much better than o1 but still struggles past 13×13. (Same evaluation setup as before, but with 40 test examples per cell.)

English

970

178.6K

RB@530RB·4d

@kittykareninas If you wrote it you can answer questions, in detail, about it. It’s very easy to do if you actually wrote it, but near impossible otherwise.

English

kitty@kittykareninas·4d

i hate ai detection programs so much. my fully HUMAN WRITTEN essay shows up as 87% ai written. what are you even supposed to do if your teacher brings it up? how can you disprove them?

English

1.6K

7.7K

158.2K

4.2M

Keşfet

@burkov @profithuntercfo @collin_ruth89 @OlexGameDev @KIRI_Engine_App @redtachyon @RyanClogg @1Hassium