Juan G.

4.2K posts

Juan G.

@gonpinju

Software engineer, Java passionate, AI/Machine Learning Architect. Msc. Big Data and Analytics.

Valladolid, España Katılım Eylül 2010

362 Takip Edilen383 Takipçiler

Juan G.@gonpinju·1d

Exactly

Saylor@Kev96790724

Vector RAG: ~1,359 input tokens per query (just the top-6 retrieved chunks passed to the LLM). PageIndex: ~9,071 input tokens per query (LLM reasons over previews/metadata for all nodes in a “Table of Contents” step, plus selected content). This is why I’ll stick with vector rag

English

Juan G.@gonpinju·5d

@Bill06988524652 @MagellanQuest Already done, renewable energies are being added as tension control. And BTW, fosil energy producers are sanctioned as they disconnected when blackout events started to happen.

English

124

Bill@Bill06988524652·5d

@MagellanQuest They might should look into fixing the, multiple, issues that blacked out the entire country before getting too carried away with what’s in the future.

English

MagellanQuest 🇪🇺/acc@MagellanQuest·5d

Spain isn't projecting "a little more" electricity. It's projecting to increase gross production from 273 TWh in 2019 to 422 TWh in 2030. Solar PV: 9% → 138 TWh. Wind: 56% → 130 TWh. Renewables: 37% → 81% of the mix. The bottleneck is no longer energy production. It's the grid, storage, and industrial demand.

English

325

25.6K

Juan G. retweetledi

Grady Booch@Grady_Booch·26 Nis

I think that @DarioAmodei does not understand software engineering and that he is working feverishly to pump up the valuation of his company in anticipation of its forthcoming IPO.

AI Edge@aiedge_

Anthropic CEO (Dario Amodei): "Coding is going away first, then all of software engineering." What do you think about this?

English

133

233

2.4K

168.4K

Juan G. retweetledi

Sukh Sroay@sukh_saroy·1 Mar

New research just exposed the biggest lie in AI coding benchmarks. LLMs score 84-89% on standard coding tests. On real production code? 25-34%. That's not a gap. That's a different reality. Here's what happened: Researchers built a benchmark from actual open-source repositories real classes with real dependencies, real type systems, real integration complexity. Then they tested the same models that dominate HumanEval leaderboards. The results were brutal. The models weren't failing because the code was "harder." They were failing because it was *real*. Synthetic benchmarks test whether a model can write a self-contained function with a clean docstring. Production code requires understanding inheritance hierarchies, framework integrations, and project-specific utilities. Different universe. Same leaderboard score. But it gets worse. A separate study ran 600,000 debugging experiments across 9 LLMs. They found a bug in a program. The LLM found it too. Then they renamed a variable. Added a comment. Shuffled function order. Changed nothing about the bug itself. The LLM couldn't find the same bug anymore. 78% of the time, cosmetic changes that don't affect program behavior completely broke the model's ability to debug. Function shuffling alone reduced debugging accuracy by 83%. The models aren't reading code. They're pattern-matching against what code *looks like* in their training data. A third study confirmed this from another angle: when researchers obfuscated real-world code changing symbols, structure, and semantics while keeping functionality identical LLM pass rates dropped by up to 62.5%. The researchers call this the "Specialist in Familiarity" problem. LLMs perform well on code they've memorized. The moment you show them something unfamiliar with the same logic, they collapse. Three papers. Three different methodologies. Same conclusion: The benchmarks we use to evaluate AI coding tools are measuring memorization, not understanding. If you're shipping code generated by LLMs into production without review, these numbers should concern you. If you're building developer tools, the question isn't "what's your HumanEval score." It's "what happens when the code doesn't look like the training data."

English

135

246

1.1K

230.6K

Juan G.@gonpinju·26 Şub

@Grady_Booch @ChrSzegedy Exactly, we just changed one language/abstraction/"prompt" for another

English

Christian Szegedy@ChrSzegedy·25 Şub

This wave is nothing like "higher order programming languges". It's the start of a complete paradigm shift away from using programming languages directly.

Grady Booch@Grady_Booch

Very little about software engineering has changed over past last three months. A great deal has changed about coding, not unlike when we saw the rise of high order programming languages and compilers, the difference today being that the number of developers is far larger and distribution channels are such that the velocity and breadth of change is far greater. The entire history of software engineering is one of raising the level of abstraction.

English

175

35.1K

Juan G. retweetledi

Bilgin Ibryam@bibryam·15 Şub

The Great Software Quality Collapse techtrenches.dev/p/the-great-so…

English

343

35.4K

Juan G. retweetledi

Grady Booch@Grady_Booch·1 Şub

@gdb Vibe coding is yet just another incremental rise in abstraction in the history of software engineering

English

347

14.1K

Juan G. retweetledi

Grady Booch@Grady_Booch·19 Oca

The rise of AI programming agents is changing the nature of software development in the same way as did the introduction of compilers in the time of Grave Hopper. I’ll say it again: the entire history of software engineering is one of rising levels of abstraction.

Ryan Dahl@rough__sea

This has been said a thousand times before, but allow me to add my own voice: the era of humans writing code is over. Disturbing for those of us who identify as SWEs, but no less true. That's not to say SWEs don't have work to do, but writing syntax directly is not it.

English

213

1.8K

260.9K

Juan G. retweetledi

Klaas@forgebitz·5 Eyl

i changed all our "loading..." states to "thinking.." we are an agentic AI startup now

English

192

617

10.1K

420.7K

Juan G. retweetledi

Santiago@svpino·27 Tem

A friend asked me to look at their code, and *Holy Mother of God!* He works for a medium-sized company. They are building some sort of internal CRM. They've been vibe-coding some of the new features. They are professional developers. Most, I assume, know what they are doing, but I guess it's exhilarating to build 1 week's worth of work in 1 hour and call it a day. The difference between old and new code is clear as day: The old code looks intentional and clean, as if it were written by someone who cared. The new code looks verbose, soulless, and sloppy. Every vibe-coder is now generating as much technical debt as 10 regular developers in half the time. We'll need three generations of developers to clean the mess. We are all becoming rich, for sure.

English

216

140

1.9K

151.7K

Juan G. retweetledi

Gary Marcus@GaryMarcus·14 Tem

Personal Update: My p(doom) has gone up. I don’t foresee machines with malice any time soon. But I am starting to see the harm that powerful yet reckless humans who are indifferent to humanity could cause.

English

973

133.3K

Juan G.@gonpinju·8 Tem

El luto que nadie entiende: cuando muere tu mascota jotdown.es/2025/07/el-lut…

Español

Juan G. retweetledi

sid@immasiddx·19 Haz

One-Factor Authentication

English

343

1.6K

31.5K

2.7M

Juan G. retweetledi

mmadrigal@SoyMmadrigal·16 May

“No puedes convencer a un creyente de nada porque sus creencias no están basadas en evidencia; están basadas en una enraizada necesidad de creer” - Carl Sagan Buenas noches a tod@s

Español

115

432

15.1K

Juan G.@gonpinju·26 Mar

This is probably one of the most stupid things said around AI

Tesla AI@Tesla_AI

You drive with eyes and a brain, not a suite of sensors Our cars do the same

English

Juan G. retweetledi

nixCraft 🐧@nixcraft·18 Şub

ZXX

148

810

7.8K

469.2K

Juan G. retweetledi

BURKOV@burkov·10 Mar

I don't understand how someone can be an AI influencer, reposting the same unverified information provided by lying CEOs and VCs and paraphrased by ChatGPT, knowingly fooling thousands of people, not creating any value, simply wasting Earth's limited resources, and still be able to look at themselves in the mirror and have a good appetite.

English

164

8.3K

Juan G.@gonpinju·9 Mar

WTF

˗ˏˋmewtru´ˎ˗@trunarla

Here’s a coding tip!!

QST

Juan G. retweetledi

Jaime Gómez-Obregón@JaimeObregon·26 Şub

¿Qué seguridad proporciona en 2025 firmar CON EL DEDO? Llevo dos años abriendo cuentas bancarias, recogiendo certificados y firmando contratos con dibujitos de Doraemon… y no pasa nada. Eso sí; al notario no le hizo gracia. 😂

Español

148

822

12.4K

718.8K

Juan G. retweetledi

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·23 Şub

results replicated 😬 any comment, @grok? if this has indeed been made part of the sys instructions, we’re gonna need to know why. manipulating search results seems like a funny way to combat misinformation, no? community could use some clarity on this

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 tweet media

English

149

261

2.5K

590.6K

Keşfet

@Bill06988524652 @MagellanQuest @DarioAmodei @Grady_Booch @ChrSzegedy @gdb @elonmusk @BarackObama