Treeverse

16 posts

Treeverse

Treeverse

@Treeverseio

The team behind @lakeFS

เข้าร่วม Haziran 2020
7 กำลังติดตาม58 ผู้ติดตาม
sam
sam@samgoodwin89·
@coltonpadden @lakeFS Totally, this is a great fundamental layer for versioned storage. But there’s two other aspects where I think explicit versioning helps: 1. When to re-compute data that is expensive because the code logic changed 2. Communicating published versions for downstream consumption
English
1
0
1
39
sam
sam@samgoodwin89·
Has semantic versioning been applied to data? Here's what I am thinking: +patch for updates without a schema change. +minor for additive changes like a new column, or narrowing a type (nullable->non-null). +major for breaking changes like removing/renaming columns.
English
2
0
2
819
sam
sam@samgoodwin89·
@FunWithTheCloud Yeah. We are considering LakeFS for s3 versioning and nessie for catalog versioning. But we use @dagster to materialize our data sets and need to design a per-asset versioning strategy for optimization and it could help with publishing. I think semver would work well.
English
2
0
0
50
Treeverse
Treeverse@Treeverseio·
@samgoodwin89 One can use lakeFS with a client, so there is no hosted gateway between you and the data. Check out lakeFS python client for example.
English
0
0
0
176
sam
sam@samgoodwin89·
There’s also lakefs which includes versioning of s3 objects and not just tables like Nessie. But it requires a hosted gateway in between you and S3 which scares me. lakefs.io
English
1
0
0
151
sam
sam@samgoodwin89·
Git-like branching, commits and tags for data lakes is starting to emerge. We are reaching a point where data lakes of any size can be managed just like version controlled code. Make a change, push a PR, preview the change, merge to main if happy. projectnessie.org
English
1
1
3
414
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
Thank you to everyone who helped us reach 1,000 stars on Github!
lakeFS tweet media
English
1
2
10
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
When it comes to your data, quality matters! Incorrect data can harm a reputation, misdirect resources, and lead to false insights and missed opportunities 🤦‍♀️. Learn how teams today test data validity and accuracy to ensure #DataQuality. lakefs.io/data-quality-t…
English
0
2
4
0
Treeverse รีทวีตแล้ว
Startup Stash
Startup Stash@startupstash·
With more than 45 #unicorns, Israel is one of the world's leading startup hubs in the world. Here are some of the best Israeli startups to watch in 2021! startupstash.com/israeli-startu…
English
1
6
12
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
📣 We’re thrilled to announce a new integration between @Minio & lakeFS. MinIO users can now power their storage environment with Git-like operations to easily version data at scale. Check out our blog to see how easy it is: bit.ly/2LqkFP1
English
0
7
26
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
Season’s greetings from the entire crew at Treeverse. Wishing you the best this coming year. Happy Holidays 🤍
lakeFS tweet media
English
0
1
7
0
Treeverse รีทวีตแล้ว
Einat Orr
Einat Orr@EinatOrr·
Why Data Versioning as an Infrastructure Matters? This is what I think: lakefs.io/data-versionin…
English
0
2
7
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
Want to level-up your data lake? Join us tomorrow (Wednesday, Nov 25) at @BigDataConfEU to learn best practices and principles in data versioning for big data sets. Grab your seat: bit.ly/3q1YTkP
lakeFS tweet media
English
0
1
4
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
Finding creative solutions to big data problems is our thing! We’re looking for passionate data enthusiasts who love all things #opensource to join our team. Open positions: - Solution Architect - Developer Advocate To learn more and apply: bit.ly/3kh8osf
English
0
1
8
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
A data development environment contains everything required to build and deploy data intensive applications. Learn how easy it is to setup: bit.ly/2HE7Xue
English
0
1
3
0
Treeverse รีทวีตแล้ว
lakeFS
lakeFS@lakeFS·
It's open! Introducing lakeFS: a powerful open source platform that delivers resilience and manageability to object-storage based data lakes. Check out our new blog, and get started today lakefs.io/2020/08/03/int…
English
0
5
10
0