Treeverse

16 posts

Treeverse

@Treeverseio

The team behind @lakeFS

เข้าร่วม Haziran 2020

7 กำลังติดตาม58 ผู้ติดตาม

Treeverse@Treeverseio·25 Haz

@samgoodwin89 @coltonpadden @lakeFS Both are available with lakeFS using lakeFS hooks. lakeFS hooks are similar to GitHub actions, just for data...

English

sam@samgoodwin89·19 Haz

@coltonpadden @lakeFS Totally, this is a great fundamental layer for versioned storage. But there’s two other aspects where I think explicit versioning helps: 1. When to re-compute data that is expensive because the code logic changed 2. Communicating published versions for downstream consumption

English

sam@samgoodwin89·19 Haz

Has semantic versioning been applied to data? Here's what I am thinking: +patch for updates without a schema change. +minor for additive changes like a new column, or narrowing a type (nullable->non-null). +major for breaking changes like removing/renaming columns.

English

819

Treeverse@Treeverseio·20 Haz

@samgoodwin89 @FunWithTheCloud @dagster You can use lakeFS for both storage and Catalog.

English

sam@samgoodwin89·20 Haz

@FunWithTheCloud Yeah. We are considering LakeFS for s3 versioning and nessie for catalog versioning. But we use @dagster to materialize our data sets and need to design a per-asset versioning strategy for optimization and it could help with publishing. I think semver would work well.

English

Treeverse@Treeverseio·10 Mar

@hellhax @AdiPolak @lakeFS @ozkatz100 @EinatOrr @datawhisp @vinodhini_sd @databricks @confluentinc @getdbt It exists for both OSS and cloud. Including a listing in Azure marketplace. Check out the documentation for more details lakefs.io

English

Wojciech Jakubowski@hellhax·10 Mar

@AdiPolak @lakeFS @ozkatz100 @EinatOrr @datawhisp @vinodhini_sd @databricks @confluentinc @getdbt Where are microsoft/azure offerings?

English

Treeverse@Treeverseio·7 Mar

@samgoodwin89 One can use lakeFS with a client, so there is no hosted gateway between you and the data. Check out lakeFS python client for example.

English

176

sam@samgoodwin89·6 Mar

There’s also lakefs which includes versioning of s3 objects and not just tables like Nessie. But it requires a hosted gateway in between you and S3 which scares me. lakefs.io

English

151

sam@samgoodwin89·6 Mar

Git-like branching, commits and tags for data lakes is starting to emerge. We are reaching a point where data lakes of any size can be managed just like version controlled code. Make a change, push a PR, preview the change, merge to main if happy. projectnessie.org

English

414

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·30 Mar

Thank you to everyone who helped us reach 1,000 stars on Github!

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·23 Şub

When it comes to your data, quality matters! Incorrect data can harm a reputation, misdirect resources, and lead to false insights and missed opportunities 🤦‍♀️. Learn how teams today test data validity and accuracy to ensure #DataQuality. lakefs.io/data-quality-t…

English

Treeverse รีทวีตแล้ว

Startup Stash@startupstash·27 Oca

With more than 45 #unicorns, Israel is one of the world's leading startup hubs in the world. Here are some of the best Israeli startups to watch in 2021! startupstash.com/israeli-startu…

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·5 Oca

📣 We’re thrilled to announce a new integration between @Minio & lakeFS. MinIO users can now power their storage environment with Git-like operations to easily version data at scale. Check out our blog to see how easy it is: bit.ly/2LqkFP1

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·21 Ara

Season’s greetings from the entire crew at Treeverse. Wishing you the best this coming year. Happy Holidays 🤍

English

Treeverse รีทวีตแล้ว

Einat Orr@EinatOrr·15 Ara

Why Data Versioning as an Infrastructure Matters? This is what I think: lakefs.io/data-versionin…

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·24 Kas

Want to level-up your data lake? Join us tomorrow (Wednesday, Nov 25) at @BigDataConfEU to learn best practices and principles in data versioning for big data sets. Grab your seat: bit.ly/3q1YTkP

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·11 Kas

Finding creative solutions to big data problems is our thing! We’re looking for passionate data enthusiasts who love all things #opensource to join our team. Open positions: - Solution Architect - Developer Advocate To learn more and apply: bit.ly/3kh8osf

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·3 Kas

Check out this week's episode of @DataEngPodcast featuring Einat Orr and @ozkatz100 of lakeFS. 🎧 Tune in: bit.ly/2I36M86

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·27 Eki

A data development environment contains everything required to build and deploy data intensive applications. Learn how easy it is to setup: bit.ly/2HE7Xue

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·14 Eyl

A rare and entertaining glimpse into the everyday life of a data engineer. Don't miss out on @ozkatz100 first diary entry 👇📖 #BigData #DataEngineering lakefs.io/2020/09/14/dia…

English

Treeverse รีทวีตแล้ว

lakeFS@lakeFS·3 Ağu

It's open! Introducing lakeFS: a powerful open source platform that delivers resilience and manageability to object-storage based data lakes. Check out our new blog, and get started today lakefs.io/2020/08/03/int…

English

ค้นพบ

@samgoodwin89 @coltonpadden @lakeFS @FunWithTheCloud @dagster @hellhax @AdiPolak @ozkatz100