

Bartosz Konieczny
860 posts

@waitingforcode
Freelance Data Engineer and instructor, enjoy solving data problems with #ApacheSpark #AWS #GCP #Azure 👨🏭 | [email protected]














Ten things to understand about your database: 1) High level Architecture 2) How writes work? (Replication, data distribution, internal organisation etc) 3) How reads work? (Consistency guarantees, tuning options, etc) 4) CAP theorem, ex. CP or AP 5) Transactions and Concurrency models 6) How does it scale? 7) How are failures handled? 8) Best practices on Querying data 9) How is geo-distribution supported, so you can plan ahead in time? 10) How to optimise cost? Episodes to watch to understand the above for different databases: DynamoDB: youtu.be/ifSckJlatWE Cassandra: youtu.be/V1EO_0i3RNA CockroachDB: youtu.be/1NuvxQEoVHU General database internals: Part-1 youtu.be/DiLA0Ri6RfY and Part-2 youtu.be/IW4cpnpVg7E Realtime Analytics with Apache Pinot: youtu.be/cGTffWg2EFs Geo Distribution of databases: youtu.be/JQfnMp0OeTA CDC and Debezium: youtu.be/VGH6TlhEJpM Twisp - A ledger database: youtu.be/VGb54yNQrHM Kafka internals youtu.be/d89W_GzWnRw YugaByteDB Internals: youtu.be/cXIPIA7e220 Write ahead logging: youtu.be/yV_Zp0Mi3xs and youtu.be/2MqY_mT1vw8 B-Trees on Disk: youtu.be/dTfR0S_rBGg Graph Database Internals: youtu.be/iihJXKAQZkA ScyllaDb internals: youtu.be/AqY13RjWwJg Duckdb Internals: youtu.be/f9QlkXW4H9A RisingWave Streaming Database: youtu.be/nckuW02gI3Y Clickhouse Internals: youtu.be/sh5EBqrrwEU











