Data Eng Weekly

121 posts

Data Eng Weekly

Data Eng Weekly

@DataEngWeekly

Data Eng Weekly covers the week's top news in data engineering-related open source and cloud products. Curated by @joecrobak

加入时间 Ağustos 2015
29 关注998 粉丝
Data Eng Weekly 已转推
Adam Kawa
Adam Kawa@adam_kawa·
To all my dear #BigData colleagues! Feel invited to submit an abstract to #BigDataTechWarsaw 2020, so that we can meet in Warsaw, talk about data and have a beer 🍺 This is the conference that I co-organise :) The CfP is open until Sep 30th. medium.com/getindata-blog…
English
0
3
8
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
@dacort Sorry about that! Did it end up in your spam folder by any chance?
English
1
0
0
0
Data Eng Weekly 已转推
Fabian Hueske
Fabian Hueske@fhueske·
Aaand another chapter is done! 🎉 The Early Release of "Stream Processing with @ApacheFlink" was updated with a new chapter about connectors and end-to-end consistency. Only two chapters ("Setup & Configuration", "Operations") are left. I should start looking for another hobby 🤔
Fabian Hueske tweet media
English
3
39
114
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
Data Eng Weekly 279 ▪︎ Benchmarking Hive on MR3 ▪︎ Event Sourcing ▪︎ Efficiently writing to a db ▪︎ Jepsen for Dgraph ▪︎ PulsarIO ▪︎ Scheduling of notebooks at Netflix ... and more! dataengweekly.com/Data-Eng-Weekl…
English
0
1
5
0
Data Eng Weekly 已转推
Apache Airflow
Apache Airflow@ApacheAirflow·
Apache Airflow 1.10.0 is out ❤️🎉 !! Highlights: - New RBAC web interface in beta - First class kubernetes operator - Experimental kubernetes executor - Timezone support - Performance optimizations for large DAGs - Many GCP and S3 integration improvements - Tons of Bug Fixes
English
3
82
154
0
Data Eng Weekly 已转推
ApacheArrow
ApacheArrow@ApacheArrow·
We've just released Apache Arrow 0.10.0, the biggest release yet with 4 months of work and nearly 500 issues closed. We've added 3 new languages to the project: Go, Ruby, and Rust. Read more arrow.apache.org/blog/2018/08/0…
English
3
47
82
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
Data Eng Weekly Issue #274 Lots of stream processing coverage this week—Apache Kafka, Wallaroo, Apache Samza, WSO2, and Amazon SQS + a couple of posts on Kubernetes, db monitoring + 2 new books + a proposed data ethics checklist. dataengweekly.com/Data-Eng-Weekl…
English
0
1
8
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
ERA5 atmospheric data is now available on S3 as a public data set. Currently available from 2008 on-wards, all 9 petabytes dating back to 1950 will be released incrementally. medium.com/planet-os/era5…
English
0
0
0
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
Speeding up ETL in PySpark by parallelizing db access. @joaopedro.pinheiro88/how-to-increase-data-load-speed-from-database-with-pyspark-3741cccdf928" target="_blank" rel="nofollow noopener">medium.com/@joaopedro.pin…
English
0
0
5
0
Data Eng Weekly 已转推
Cloudera Community
Cloudera Community@cldrcommunity·
HDP 3.0 delivers new capabilities for the enterprise to enable agile application deployment, new #machinelearning /deep learning workloads, real-time database, & security and governance. Learn more about the enhancements, here: bit.ly/2JZms8t #BigData
English
0
6
13
0
Data Eng Weekly
Data Eng Weekly@DataEngWeekly·
Data Eng Weekly #273 is out. It was a tough one—so much great content to choose from. Coverage includes Scio, make at Propublica, Paypal's NameNode analytics, MySQL on Kubernetes, Kinesis+Lambda, data replication at Hotels.com, & much more. dataengweekly.com/Data-Eng-Weekl…
English
0
0
3
0