Juan G.

4.2K posts

Juan G.

Juan G.

@gonpinju

Software engineer, Java passionate, AI/Machine Learning Architect. Msc. Big Data and Analytics.

Valladolid, España Katılım Eylül 2010
362 Takip Edilen383 Takipçiler
Juan G.
Juan G.@gonpinju·
@Bill06988524652 @MagellanQuest Already done, renewable energies are being added as tension control. And BTW, fosil energy producers are sanctioned as they disconnected when blackout events started to happen.
English
0
0
4
124
Bill
Bill@Bill06988524652·
@MagellanQuest They might should look into fixing the, multiple, issues that blacked out the entire country before getting too carried away with what’s in the future.
English
5
0
3
2K
MagellanQuest 🇪🇺/acc
MagellanQuest 🇪🇺/acc@MagellanQuest·
Spain isn't projecting "a little more" electricity. It's projecting to increase gross production from 273 TWh in 2019 to 422 TWh in 2030. Solar PV: 9% → 138 TWh. Wind: 56% → 130 TWh. Renewables: 37% → 81% of the mix. The bottleneck is no longer energy production. It's the grid, storage, and industrial demand.
MagellanQuest 🇪🇺/acc tweet media
English
30
74
325
25.6K
Juan G. retweetledi
Sukh Sroay
Sukh Sroay@sukh_saroy·
New research just exposed the biggest lie in AI coding benchmarks. LLMs score 84-89% on standard coding tests. On real production code? 25-34%. That's not a gap. That's a different reality. Here's what happened: Researchers built a benchmark from actual open-source repositories real classes with real dependencies, real type systems, real integration complexity. Then they tested the same models that dominate HumanEval leaderboards. The results were brutal. The models weren't failing because the code was "harder." They were failing because it was *real*. Synthetic benchmarks test whether a model can write a self-contained function with a clean docstring. Production code requires understanding inheritance hierarchies, framework integrations, and project-specific utilities. Different universe. Same leaderboard score. But it gets worse. A separate study ran 600,000 debugging experiments across 9 LLMs. They found a bug in a program. The LLM found it too. Then they renamed a variable. Added a comment. Shuffled function order. Changed nothing about the bug itself. The LLM couldn't find the same bug anymore. 78% of the time, cosmetic changes that don't affect program behavior completely broke the model's ability to debug. Function shuffling alone reduced debugging accuracy by 83%. The models aren't reading code. They're pattern-matching against what code *looks like* in their training data. A third study confirmed this from another angle: when researchers obfuscated real-world code changing symbols, structure, and semantics while keeping functionality identical LLM pass rates dropped by up to 62.5%. The researchers call this the "Specialist in Familiarity" problem. LLMs perform well on code they've memorized. The moment you show them something unfamiliar with the same logic, they collapse. Three papers. Three different methodologies. Same conclusion: The benchmarks we use to evaluate AI coding tools are measuring memorization, not understanding. If you're shipping code generated by LLMs into production without review, these numbers should concern you. If you're building developer tools, the question isn't "what's your HumanEval score." It's "what happens when the code doesn't look like the training data."
Sukh Sroay tweet media
English
135
246
1.1K
230.6K
Juan G. retweetledi
Grady Booch
Grady Booch@Grady_Booch·
@gdb Vibe coding is yet just another incremental rise in abstraction in the history of software engineering
English
16
20
347
14.1K
Juan G. retweetledi
Grady Booch
Grady Booch@Grady_Booch·
The rise of AI programming agents is changing the nature of software development in the same way as did the introduction of compilers in the time of Grave Hopper. I’ll say it again: the entire history of software engineering is one of rising levels of abstraction.
Ryan Dahl@rough__sea

This has been said a thousand times before, but allow me to add my own voice: the era of humans writing code is over. Disturbing for those of us who identify as SWEs, but no less true. That's not to say SWEs don't have work to do, but writing syntax directly is not it.

English
89
213
1.8K
260.9K
Juan G. retweetledi
Klaas
Klaas@forgebitz·
i changed all our "loading..." states to "thinking.." we are an agentic AI startup now
English
192
617
10.1K
420.7K
Juan G. retweetledi
Santiago
Santiago@svpino·
A friend asked me to look at their code, and *Holy Mother of God!* He works for a medium-sized company. They are building some sort of internal CRM. They've been vibe-coding some of the new features. They are professional developers. Most, I assume, know what they are doing, but I guess it's exhilarating to build 1 week's worth of work in 1 hour and call it a day. The difference between old and new code is clear as day: The old code looks intentional and clean, as if it were written by someone who cared. The new code looks verbose, soulless, and sloppy. Every vibe-coder is now generating as much technical debt as 10 regular developers in half the time. We'll need three generations of developers to clean the mess. We are all becoming rich, for sure.
English
216
140
1.9K
151.7K
Juan G. retweetledi
Gary Marcus
Gary Marcus@GaryMarcus·
Personal Update: My p(doom) has gone up. I don’t foresee machines with malice any time soon. But I am starting to see the harm that powerful yet reckless humans who are indifferent to humanity could cause.
English
92
78
973
133.3K
Juan G. retweetledi
sid
sid@immasiddx·
One-Factor Authentication
sid tweet media
English
343
1.6K
31.5K
2.7M
Juan G. retweetledi
mmadrigal
mmadrigal@SoyMmadrigal·
“No puedes convencer a un creyente de nada porque sus creencias no están basadas en evidencia; están basadas en una enraizada necesidad de creer” - Carl Sagan Buenas noches a tod@s
Español
6
115
432
15.1K
Juan G. retweetledi
nixCraft 🐧
nixCraft 🐧@nixcraft·
nixCraft 🐧 tweet media
ZXX
148
810
7.8K
469.2K
Juan G. retweetledi
BURKOV
BURKOV@burkov·
I don't understand how someone can be an AI influencer, reposting the same unverified information provided by lying CEOs and VCs and paraphrased by ChatGPT, knowingly fooling thousands of people, not creating any value, simply wasting Earth's limited resources, and still be able to look at themselves in the mirror and have a good appetite.
English
12
11
164
8.3K
Juan G. retweetledi
Jaime Gómez-Obregón
Jaime Gómez-Obregón@JaimeObregon·
¿Qué seguridad proporciona en 2025 firmar CON EL DEDO? Llevo dos años abriendo cuentas bancarias, recogiendo certificados y firmando contratos con dibujitos de Doraemon… y no pasa nada. Eso sí; al notario no le hizo gracia. 😂
Jaime Gómez-Obregón tweet media
Español
148
822
12.4K
718.8K
Juan G. retweetledi
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭
results replicated 😬 any comment, @grok? if this has indeed been made part of the sys instructions, we’re gonna need to know why. manipulating search results seems like a funny way to combat misinformation, no? community could use some clarity on this
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 tweet media
English
149
261
2.5K
590.6K