

databricksdaily
380 posts

@databricksdaily
Your daily Databricks guide | Tips, tricks & real-world use cases | Data Engineering enthusiast | Exploring Databricks opportunities



Ever increased workers to speed up a Databricks job… and nothing changed? That’s usually a vertical scaling problem. If executors are running out of memory (big joins, caching, heavy shuffle), adding more machines won’t help. You need stronger machines → bigger node type. Scale up when the bottleneck is inside each node, not cluster size. #databricks


Before working with Databricks clusters, it helps to know that the compute underneath isn’t custom hardware it’s cloud virtual machines. Node types like Standard_DS3_v2 or Standard_E8ds_v5 are standard Azure VM sizes. Databricks simply uses them as cluster nodes, while organizations may restrict or template which ones teams can use for cost control and governance. #databricks















Analytics teams want to spend more time answering questions and less time tuning systems or tracking down costs. Recent improvements have made Databricks SQL 5x faster on average, with performance delivered automatically and no index or parameter management required. Learn how Databricks SQL delivers faster analytics with no tuning. databricks.com/blog/sql-datab…

