Gil Noh (노태길)

2.7K posts

Gil Noh (노태길)

Gil Noh (노태길)

@tailblues

Machine Learning Research Engineer at OMQ GmbH

Heidelberg, Germany Katılım Mart 2010
729 Takip Edilen133 Takipçiler
Gil Noh (노태길) retweetledi
Scott
Scott@scottjla·
@simonw I feel like we need to stack these tests now
Scott tweet media
English
18
26
363
122.9K
Gil Noh (노태길) retweetledi
Dr Mo' Flo' Mojo
Dr Mo' Flo' Mojo@MoFloMoJo·
@ZPostFacto Doesn't need to be toddlers. It's enough to know that some people will press blue to justify pressing blue to save them. That's the only option that saves everyone, not this "everyone just press red" fantasy selfish cunts use to rationalise throwing everyone else under the bus.
English
8
1
90
3.1K
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
@bcherny @GergelyOrosz This easily reproducible thing was not even tried, and the reaction was to point "the must have turned it off the search". For anyone who suffer this: ask Claude to update memory with "a) 4.7 is buggy and tend to not use search, b) when unknown word/concept comes, always search"
English
0
0
0
23
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
@bcherny @GergelyOrosz This is really lame and must be painful for @bcherny - because this was easily reproducible, on any users. On me, Opus4.7 mapped "openclaw" to "opencode" without search. Not a big deal, you can add memory to "search each time when an unknown word comes in". However...
English
1
0
0
413
Gergely Orosz
Gergely Orosz@GergelyOrosz·
Claude just keeps regressing for me, day after day. I swear that until a few days ago, when Claude did not know something, it kicked off a web search, figured out, and answered. Now it just refuses to do the work that I pay for. It's like showing you the middle finger. Really?
Gergely Orosz tweet media
English
248
74
2.2K
198.6K
Gil Noh (노태길) retweetledi
Natalie Khalil
Natalie Khalil@natalienkhalil·
Basically
English
81
1.4K
6.9K
1.3M
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
Pretty neat and probably right (?) approach. I mean, if I had to bet, I think the term "Mismanaged Geniuses Hypothesis" will stick to the community and will likely be shown (at least partially) to be true. Exciting times.
alex zhang@a1zhang

x.com/i/article/2041…

English
0
0
0
31
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
@morganlunt Nice feature, but two bugs stopped me from using it. - bedrock region: typing region "eu-central-1" not possible: char '-' stops the input. - once configured, then /login command disappear. I cannot come back to my Claude subscription unless edit the conf file...
English
1
0
0
383
Morgan Lunt
Morgan Lunt@morganlunt·
Tired of manually writing config files and env vars to use Claude Code with Amazon Bedrock or Google Vertex? Me too. Just shipped a setup wizard that handles it for you. Bonus: Claude Code now notices if you have an older model pinned and suggests a newer one if you have access!
English
10
14
161
85.4K
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
@vboykis Maybe we have hope. Claude Code/buddy pet gives me meaningless jokes every time it sees an interruption from the corner of his side of the terminal...
English
0
0
0
41
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
@yoavgo Ah, good point. And I predict that just like the RTFM era, people won't read it and we will keep putting a lot of emphatic 'F' in RTF-anything.
English
0
0
1
65
Gil Noh (노태길) retweetledi
James Garcia Alver
James Garcia Alver@JayAlver·
-93 bytes, one of the best Wikipedia line deletions I’ve ever seen.
James Garcia Alver tweet media
English
47
1.2K
11.7K
804.4K
Gil Noh (노태길) retweetledi
Michael Hla
Michael Hla@hla_michael·
I trained an LLM from scratch on pre-1900 text to see if it could come up with quantum mechanics and relativity. While the model is too small to do meaningful reasoning, it has glimpses of intuition. When given observations from past landmark experiments, the model can declare that “light is made up of definite quantities of energy” and even suggest that gravity and acceleration are locally equivalent. I’m releasing the dataset + models and leave this as an open problem to the research community. I also include what this project has taught me about intelligence in a mini essay linked below. 🧵(1/n)
English
117
261
2K
310.8K
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
The Wall fell before my time, but its strata turn up in the oddest places.
English
0
0
0
9
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
🇬🇧 From a Korean perspective, Germany is an enviably well-unified country. But 35 years on, the East-West line still snags on the smallest things. Not just election maps — blood glucose meters ship in mg/dL (West) vs. mmol/L (East), still a top return reason on Amazon.
English
1
0
0
20
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
한국에서 보면 독일은 부러울 만큼 잘 통일된 나라다. 그런데 35년이 지나도 동서 구분은 턱처럼 걸린다. 선거 지도만이 아니다. 혈당측정기 단위가 서쪽은 mg/dL, 동쪽은 mmol/L - 아마존 반품 사유 1위. 장벽은 옛저녁에 무너졌어도, 흔적은 쭉 이렇게 살아가는구나.
한국어
1
0
0
64
Gil Noh (노태길)
Gil Noh (노태길)@tailblues·
"AI can’t think, doesn’t have a mind and, in fact, is inherently untrustworthy." sigh... What was the term for such BS? 'not even wrong'?
Kevin Hartnett@KSHartnett

A strikingly lucid column from @mims. The Turing Test has become the Turing Trap. One of my favorite aspects of AI is that it essentially acts as an invariant which distinguishes between modes of human cognition that were previously harder to tease apart. The nut graf: "At the time, language was thought to be closely associated with reasoning, but modern neuroscience shows us that it’s a separate process. Speaking isn’t the same as thinking, let alone being." wsj.com/tech/ai/ai-too…

English
0
0
0
24
Gil Noh (노태길) retweetledi
Psyho
Psyho@FakePsyho·
@lossfunk It's a shame, because this was a good concept for a benchmark. At least, thanks to your benchmark, I've learned that Chollet will happily share anything that fits his narrative. That's a useful bit of information.
English
1
1
78
3.2K