Máté Gelei

659 posts

Máté Gelei

Máté Gelei

@MateGelei

Experienced #DevOps engineer with an MBA and a background in IT #ServiceManagement. Usually rambling about #cloud, #finops, and #AI. Tweets/opinions are my own.

Budapest, Hungary Tham gia Temmuz 2024
60 Đang theo dõi31 Người theo dõi
Máté Gelei
Máté Gelei@MateGelei·
@ClementDelangue "Hey Claude, create a Github workflow: if a user has 2 open PRs and submits a 3rd, close all of their PRs." You're welcome.
English
0
0
0
35
clem 🤗
clem 🤗@ClementDelangue·
Our biggest open-source repos are getting overwhelmed by AI slop which literally makes Github unusable (~a new pull request every 3 minutes). Fun new challenges in an agentic world!
clem 🤗 tweet media
English
160
103
1.2K
184.2K
Máté Gelei đã retweet
The Untraceable
The Untraceable@untraceable_the·
@bookercodes You see slop i see free tokens. Hope its using opus tho
English
4
3
143
10.3K
Brian Roemmele
Brian Roemmele@BrianRoemmele·
Elon, I think it is the best AI model upgrade across all platforms thus far. I have convinced the last OpenAI hold out clients to move to X.ai APIs. The absolute tonnage of @Grok Heavy in lifting power is stunning and closeed the last hold out. We will need the space telescope to observe number two so far behind. Thank you and the team!
English
202
161
1.5K
24.2M
Máté Gelei
Máté Gelei@MateGelei·
@johncrickett Right, about 10 years ago we used Visual Studio which was a several GBs large IDE. There are people with several 100s of line sin their .vimrc file. But God forbid an AI harness have 5-6 different methods with different purpose and mechanics to influence the underlying agent.
English
0
0
1
16
John Crickett
John Crickett@johncrickett·
I spent the weekend actually reading the Claude Code docs. It's a rabbit hole. CLAUDE.md files. MCP configs. Skills. Subagents. Hooks. Plugins. Agent Teams. You could spend more time configuring Claude Code than building software. All of it is productivity theatre. The only thing that actually matters: think first, then give it focused, relevant context.
English
113
29
735
77.8K
zack's lab
zack's lab@zackslab·
@MateGelei @svpino you could have easily said this to a human and they'd have done the same thing. if it kicks off at 8AM but doesn't return results until 8:03AM, it didn't happen at 8AM.
English
1
0
0
67
Santiago
Santiago@svpino·
Claude is retarded. All of these models are. I wanted to schedule a skill every day at 8:00 am. Claude decided to schedule it at 7:57 am "to avoid the on-the-dot" surge. I SAID 8:00 AM! Do the darn thing the way I asked you to do it! You gotta be crazy to trust these models.
Santiago tweet media
English
260
33
877
146.8K
zack's lab
zack's lab@zackslab·
@svpino make your spec explicit. no different than assigning humans tasks.
English
4
0
16
7.6K
Kartik Sarjine
Kartik Sarjine@Kartikez·
@amritwt If I am right its been a while since Google released a new model ? When Antigravity came out it came with 3.1 pro.
English
3
0
0
463
amrit
amrit@amritwt·
This is the AI leaderboard on code Top five is all anthropic Then there's 5.4 high Gemini doesn't even feel like it's anything Among the beasts, we have two open source models here
amrit tweet media
English
23
0
134
6.9K
riley.
riley.@lamxnt·
Buying a Gemini subscription is genuinely the most user unfriendly experience I’ve ever had
English
111
33
1.7K
126.5K
Ashley Peacock
Ashley Peacock@_ashleypeacock·
@MateGelei @somi_ai @cryptopunk7213 If it takes humans 2-3 hours and are not trivial, I wouldn’t trust an LLM with it either. You can likely get a steer from an LLM much cheaper anyway
English
1
0
0
16
Ejaaz
Ejaaz@cryptopunk7213·
this is fucking ridiculous lol - anthropic just killed a $50B industry with a single feature (again): - companies pay $50K a year to scan their code for vulnerabilities. - anthropics Code Review does it for you in minutes for a fraction of the cost. - deploys multiple agents to hunt for bugs in your code. internal results show its amazing (84% hit rate on 1000+ line code base) for comparison: anthropic cost = $15-25 PER review, trad competitor cost = $99+ complete fucking no brainer. watch the appsec stocks react to this one
Claude@claudeai

Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.

English
232
179
3.1K
904.3K
Ashley Peacock
Ashley Peacock@_ashleypeacock·
@MateGelei @somi_ai @cryptopunk7213 That’s a different ball game 😅 OSS isn’t paying for code review from AI, and in an enterprise company, that would typically be reviewed in a timely manner
English
1
0
0
23
𝐌𝐀𝐉𝐈𝐊/𝕤𝕥𝕦𝕕𝕚𝕠𝕤
xAI commands the long-term timeline. They have more compute than anyone and more ability to add compute than anyone. 1-2yrs is more than enough for them to easily pull ahead without doing anything different. @Grok has the strongest model foundation and the most runway. Model ≠ Pipeline ≠ Tools
English
3
0
1
532
Máté Gelei
Máté Gelei@MateGelei·
@RhysSullivan This is on you, folks. My doorbell doesn't even require a phone.
English
1
0
18
1.4K
Rhys
Rhys@RhysSullivan·
I have to pay a monthly subscription to get notifications from my doorbell are you fucking kidding me
Rhys tweet media
English
68
25
914
38.1K
Máté Gelei
Máté Gelei@MateGelei·
@_ashleypeacock @somi_ai @cryptopunk7213 It's not about the number of lines changed. I had a PR with 4 lines (four!) to a crypto lib that fixed a bug around salting passwords. It took hours, if not days to validate.
English
1
0
0
27
Ashley Peacock
Ashley Peacock@_ashleypeacock·
Who are these senior engineers and how are they spending 2-3 hours reviewing one big PR? It would have to be thousands upon thousands of lines across 100’s of files, at which point… it’s way too big, gets broken down, and reviewed in sensible chunks (that won’t take 2-3 hours, even combined across broken down PRs)
English
1
0
2
113
Máté Gelei
Máté Gelei@MateGelei·
@dudufolio Mistral isn't much behind free-tier ChatGPT, which is the version of ChatGPT that the absolute majority of people interact with.
English
0
0
0
64
dudu
dudu@dudufolio·
Imagine being this European
dudu tweet media
English
14
0
56
9.4K
Máté Gelei
Máté Gelei@MateGelei·
You need to understand that as a paying customer I can only compare existing products, that I can actually buy. Gemini having a huge potential is not something I can actually use in my job.
English
0
0
9
424
Chayenne Zhao
Chayenne Zhao@GenAI_is_real·
gemini hasnt failed, people just judge AI labs on the wrong timeline. google has the deepest research bench in the world, the most compute, and distribution across 2 billion chrome users. the problem is that big company evolution has natural blockers - review cycles, cross-team dependencies, launch approvals - that slow down iteration speed. anthropic and openai move fast because theyre still small enough to. but google has been through this cycle before with search, cloud, android. they start slow and then the institutional gravity kicks in. i would not count them out @pcshipp
pc@pcshipp

Still I’m wondering why Gemini fails against Claude and GPT. - Owns Chrome - Backed by Android - Stores most search results - Holds ~95% search history - Google has the biggest user data - Even incognito data isn’t fully private So what’s the problem?

English
23
13
438
49.3K
Máté Gelei
Máté Gelei@MateGelei·
@justbyte_ Yes, I'll just use strings instead of char[]s, who cares anyway
English
0
0
0
138
Aryan
Aryan@justbyte_·
Dear developers, can you code now??
Aryan tweet media
English
384
52
810
54.9K
Sarthak
Sarthak@Sarthak4Alpha·
Interviewer: If cache is faster than database, why not store everything in cache?
English
187
48
2.3K
413.2K