Owen Rees-Hayward

6.5K posts

Owen Rees-Hayward banner
Owen Rees-Hayward

Owen Rees-Hayward

@owen4d

Analytical thinker. Insanely curious person. Dreamer. Believer. Husband. Father // Creator of spark-fires // Data Engineer - all views my own, etc.

Bristol, UK Katılım Temmuz 2009
162 Takip Edilen335 Takipçiler
Owen Rees-Hayward retweetledi
Black Hole
Black Hole@konstructivizm·
Saturn's famous hexagon. This atmospheric phenomenon rotates counterclockwise at approximately 320 km/h, completing one rotation every 10 hours and 40 minutes. The hexagon is approximately 25,000 km in diameter. Remarkably, while storms on other planets change and disappear, this hexagon remains unchanged for at least 40 years.
Black Hole tweet media
English
141
634
3.6K
147.5K
Owen Rees-Hayward retweetledi
Sundar Pichai
Sundar Pichai@sundarpichai·
Introducing EmbeddingGemma, our newest open model that can run completely on-device. It's the top model under 500M parameters on the MTEB benchmark and comparable to models nearly 2x its size – enabling state-of-the-art embeddings for search, retrieval + more.
English
199
517
7.5K
534.7K
Owen Rees-Hayward retweetledi
Ari Morcos
Ari Morcos@arimorcos·
Today, we introduce BeyondWeb, our synthetic data generation approach which significantly outperforms all open synthetic data and is a key component of our curation pipeline. Many talk about “doing” synthetic data, but generating high-quality synthetic data at scale is extremely difficult and requires getting a ton of details right. We've performed thousands upon thousands of experiments to figure out the best way to generate trillions of synthetic tokens at scale. Datology means the science of data, and rigorously understanding what actually makes a data point useful for learning is key to curating the best datasets. Check out the below thread for some of the scientific insights into what makes synthetic data high-quality!
Pratyush Maini@pratyushmaini

1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares BeyondWeb, our synthetic data approach & all the learnings from scaling it to trillions of tokens🧑🏼‍🍳 - 3B LLMs beat 8B models🚀 - Pareto frontier for performance

English
3
23
131
28.6K
Owen Rees-Hayward retweetledi
Nathan Marz
Nathan Marz@nathanmarz·
Great post from G+D Netcetera on how Rama transformed their business. This goes into more depth than the case study I wrote. Highlights: "I inherited a costly problem that plagues most modern backends: architectures built by stitching together dozens of independent components, creating waste and complexity in pursuit of scale... The traditional architecture design made this worse by performing expensive data denormalization at request time... Our team decided to try to flip the architecture by moving computation from request time to change time... Rama introduced a new approach to backend development that felt aligned with the needs of modern systems... This unified approach gave us a path to eliminate redundant infrastructure and dramatically simplify our entire system topology... Our team used Rama to build an incremental content denormalization engine, which is essentially a live, always-up-to-date materialized view of our entire content graph. This goes beyond streaming processing as it provides a persistent, queryable state that updates in real-time... The strongest resistance came from management, and their concerns were completely valid. They weren’t questioning the technical vision; they were questioning the specific execution risks... While I did not have definitive answers to every concern that was raised, I had conviction and a strategy to move us forward... We began to see the transformation gains unfold before our eyes, as content updates that previously took multiple minutes through our Varnish cache now propagated in under a second. System load that previously overwhelmed our infrastructure during traffic spikes simply disappeared... Everyone acknowledged the potential after seeing the progress made, but when it came time for final commitment, belief and accountability proved to be different things. Ultimately, it was my risk to own, which made the decision simple: we would fully commit... Having reached our deadline on-time and not exceeding our budget, I can happily report that the results exceeded expectations. Our denormalization engine now supports hundreds of thousands of page views across 15 web applications and 30 native apps, while cutting AWS costs and infrastructure complexity by more than half... I won’t sugar coat it, the learning curve is quite steep. Although mastering the implementation took a few months, our engineers became productive with Rama in just a few weeks with the gains coming in early and consistently. We did not just hit performance targets; we also built a foundation for a new class of solutions."
English
1
8
31
3K
Owen Rees-Hayward retweetledi
Richard Seroter
Richard Seroter@rseroter·
"The test demonstrated Keeta Network is capable of over 11 million transactions per second (TPS), significantly outperforming traditional layer-1 blockchains and opening new opportunities for what is possible with blockchain technology." cloud.google.com/blog/topics/fi… < Spanner!
English
0
1
1
467
Owen Rees-Hayward retweetledi
Chris Albon
Chris Albon@chrisalbon·
Yep totally agree. I like memory, often times I want to explore an idea (e.g. a career decision or something) and then discard it, but right now those memories "infect" all other conversations.
Ethan Mollick@emollick

The new features for projects in ChatGPT are useful... but the fact that memory continues to be universal across all chats is a limitation. We need real control: At the project level, turn memory on/off, choose universal or not. The AI learning specific contexts would be great.

English
1
2
8
2.4K
Owen Rees-Hayward retweetledi
Miettinen Jesse - Blenderesse
Miettinen Jesse - Blenderesse@JesseMiettinen·
#geometrynodes Transformers gear wormy guy with simple physics. I´ve seen this in my feed so many times past week that had to test some #simulationnodes based rig for it. #b3d Gotta do some destruction practice next as well.
English
26
99
1.2K
49.1K
Owen Rees-Hayward retweetledi
Junior Rojas
Junior Rojas@junior_rojas_d·
Many people have asked me which library I use for my simulations, and I didn't have an answer until now. Excited to share that I'm releasing algovivo, a JS + WASM library to simulate soft-bodied virtual creatures. demo: juniorrojas.com/algovivo code: github.com/juniorrojas/al… 🧵
English
22
191
1.2K
153.1K
Owen Rees-Hayward retweetledi
Dagster
Dagster@dagster·
Dagster's project Airlift makes it easy to observe and migrate Apache Airflow pipelines to Dagster, bringing value ASAP.
English
1
4
4
1.2K
Owen Rees-Hayward retweetledi
Matei Zaharia
Matei Zaharia@matei_zaharia·
Really cool result from the Databricks research team: You can tune LLMs for a task *without data labels*, using test-time compute and RL, and outperform supervised fine-tuning! Our new TAO method scales with compute to produce fast, high-quality models. databricks.com/blog/tao-using…
Matei Zaharia tweet media
English
9
70
432
34.1K
Owen Rees-Hayward retweetledi
Jeff Dean
Jeff Dean@JeffDean·
🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding. Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on @lmarena_ai leaderboard. 🥇
Jeff Dean tweet mediaJeff Dean tweet media
English
90
299
2.6K
248.9K
Owen Rees-Hayward retweetledi
Google Cloud Partners
Google Cloud Partners@gcloudpartners·
Use partner-ready campaigns that drive results!🚀 @qodea_official leveraged the Google Cloud AI & Data Trends Report in Partner Marketing Studio to engage key decision-makers and boost website traffic, ad clicks, & qualified leads. Watch to discover how → goo.gle/3FBH8oH
GIF
English
1
1
2
195
Owen Rees-Hayward retweetledi
Bogdan Gaza
Bogdan Gaza@hurrycane·
Definitely a paradigm shift we're still learning to navigate intelligently as an industry. Knowing when and how to use AI tools effectively is becoming essential.
martin_casado@martin_casado

I was on a call a few nights ago with a number of very senior devs. And every one of them used AI to code, and found it faster than working with junior devs. It’s really worth thinking through what that means for the industry. Would love thoughts.

English
1
4
11
2.1K
Owen Rees-Hayward retweetledi
Meta Open Source
Meta Open Source@MetaOpenSource·
🔎 Meta Open Source 101🔎 Pysa is a static analysis tool designed to detect security issues in Python code. It works by analyzing data flow to identify vulnerabilities. Start by integrating Pysa into your CI/CD pipeline to add security checks. Link: pyre-check.org/docs/pysa-basi…
English
1
4
9
1.7K
Owen Rees-Hayward retweetledi
Matei Zaharia
Matei Zaharia@matei_zaharia·
Super excited about our new partnership with @AnthropicAI and native availability of Claude 3.7 Sonnet in Databricks on AWS, Azure and GCP! Stay tuned for more integrations and great support across the entire agent development and MLOps stack. sprou.tt/19muXYu3ZuR
English
3
13
115
7.3K
Owen Rees-Hayward retweetledi
Nathan Marz
Nathan Marz@nathanmarz·
Starting a new blog series next week called "Next-level backends with Rama" where each post is a detailed tutorial of using Rama for a specific use case, with all code in both Java and Clojure. The first post will be about storing and traversing graphs.
English
3
2
62
2.5K