Brian Gyss

36.7K posts

Brian Gyss banner
Brian Gyss

Brian Gyss

@BGyss

product guy

Los Angeles, CA Katılım Temmuz 2008
5.7K Takip Edilen712 Takipçiler
Brian Gyss
Brian Gyss@BGyss·
@inductionheads "I actually meant launching missiles at datacenters, not dropping bombs on them"
English
1
0
6
240
Super Dario
Super Dario@inductionheads·
Guys please, airstriking is obviously not bombing, he obviously meant dumping glitter on them, please stop being so hopelessly pedestrian
Super Dario tweet media
English
31
20
271
81.7K
Brian Gyss retweetledi
Brian Gyss retweetledi
Jessy Han
Jessy Han@hjessy_·
Wikipedia editors are now debating if Eric Adams should be described as Albanian-American
Jessy Han tweet media
English
116
1.8K
27.9K
682K
Brian Gyss retweetledi
毎日新聞
毎日新聞@mainichi·
自作ボードゲーム10種超 参加強要の消防士長を懲戒処分 愛知 mainichi.jp/articles/20260… ボードゲームは10種類以上あり、消防士長が白紙に文字を書くなどして全て自作していました。参加は最も多い職員で14回、延べ35時間。金銭のやり取りはなかったとしています。
日本語
639
5.3K
22.1K
15.5M
Brian Gyss retweetledi
New York Mets
New York Mets@Mets·
Absolutely CRUSHED 😤
English
63
362
3.5K
96.8K
Brian Gyss retweetledi
Polymarket
Polymarket@Polymarket·
JUST IN: Massive chimpanzee group in Uganda has reportedly split into rival factions & descended into a deadly “civil war”
English
2.2K
3.7K
37.5K
13M
Brian Gyss retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code. But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along. So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions. TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.
staysaasy@staysaasy

The degree to which you are awed by AI is perfectly correlated with how much you use AI to code.

English
946
2.3K
19.4K
3.8M
Brian Gyss retweetledi
Did Da Yankees Lose?
Did Da Yankees Lose?@DAAAYANKEESLOSE·
FIRST SHUTOUT OF THE YEAR Athletics 1 Yankees 0 DAAAAAAAAAA YANKEES LOSE!!
English
19
140
1.6K
19.3K
Brian Gyss retweetledi
Kevin Roose
Kevin Roose@kevinroose·
I think it is generally a good idea to take AI company claims about unreleased models with a grain of salt, but at this point the deniers are implying there is an industry-wide conspiracy to let Anthropic claim they are finding zero-day exploits in critical software with, I guess, a basement full of world-class white hat hackers pretending to be Claude?
Dean W. Ball@deanwball

It’s crazy that some are just straight up in denial about mythos having the capabilities anthropic says it does. Usually the in-denial-about-AI community is able to cloak their views in at least *some* intellectual garb, but this time it’s just, “it’s not real.” Wild. Also sad.

English
24
26
343
55.2K
Brian Gyss retweetledi
Cinesthesia
Cinesthesia@cinesthesian·
THE CAT (1992) Lam Ngai Kai Source: 88 Films Blu-ray
English
4
61
437
59K
Brian Gyss retweetledi
Allan
Allan@allan_cheapshot·
One of Kodo Fuyuki’s dying wishes was to wrestle Shinya Hashimoto one last time in a barbed wire deathmatch at Kawasaki Stadium on 5 May 2003. Fuyuki passed before it could happen, so Hashimoto and Kintaro Kanemura carried his ashes into the wire as his wife sobbed at ringside.
GIF
English
13
342
2.1K
99.5K
Brian Gyss retweetledi
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
Anthropic running 10,000 Mythos models in parallel to find cutting-edge cyber exploits... meanwhile your sister using Microsoft Copilot with some Haiku-sized model and she thinks AI is just hype. "The future is already here, just not evenly distributed" has never been more apt
English
96
394
5.5K
145.5K
Brian Gyss retweetledi
AI Engineer
AI Engineer@aiDotEngineer·
The queue to get into @mattpocockuk workshop "AI Coding for real engineers" is around the floor. We're re-translating this workshop to the Gielgud room as well. Matt is starting with a smart zone | dumb zone idea from @dexhorthy
AI Engineer tweet media
English
7
6
51
6K
Brian Gyss retweetledi
小島秀夫
小島秀夫@Kojima_Hideo·
「フェノミナ」4KUHD購入。
小島秀夫 tweet media
日本語
8
45
455
37.1K
Brian Gyss retweetledi
Super Dario
Super Dario@inductionheads·
The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
77
132
2.4K
156K