Wrapping on with #dezoomcamp by @DataTalksClub . This week I completed the streaming project. This was a tricky one, definitely I need to review it again.
Next week I'll start with the capstone project
This week in #dezoomcamp BY @DataTalksClub was about DW.
I was able to review differences and best practices in BigQuery, such as using partitioning and clustering.
Also, created a data pipeline that download the gz files, convert them into parquet and load them in GCS
This week in #dezoomcamp I've been learning how to use Prefect for workflow orchestration
✅ Load data into a GSC
✅ Run a deployment manually and scheduling
✅ Load data to BigQuery
✅ Run Prefect in a Docker container
✅ Enable notifications
Thanks @DataTalksClub
I started again #dezoomcamp by @DataTalksClub and this week I was able to learn about how to
✅ Set up a VM instance
✅ Install Docker
✅ Run Postgres and PgAdmin using docker compose
✅ Ingest data to Postgres
github.com/aliescont/de-z…
I found an old collection of my tweets put together in a notion doc
Topics:
🔸 Processes
🔸 ML Engineering
🔸 MLOps
🔸 Data science
🔸 Data engineering
🔸 Tools
🔸 Math
🔸 Learning ML
And other things
100+ tweets in total
Retweet and I'll DM it to you
Day 100 of #100DaysOfDeFI
Can't believe I completed the challenge! 🎉
I have summarized all I learned in a 100+ page doc so you can start with Solidity, blockchain and DeFI too!
🔖 Resources
🐍 Tutorials
👩🏻💻 Code
😊 Fun
I'll send it to you via DM in exchange for an RT+follow.
Day 59-66 R2 #66daysofdata this last week I was reviewing some old code to refactor, reviewing OOP in Python and continued reading SQL for Data Scientist.
This round was kind of different, it wasn't as expected, but I learned and had fun and finally got a job as Data Analyst
Day 56-58 R2 #66daysofdata this weekend was about updating a little bit, I played around a bit with a colab notebook for art generation and watched some videos about analog technology for ML. I found this video really interesting youtu.be/GVsUOuSjvcg
Day 55 R2 #66daysofdata I took a break, I was tired between work and interviews. Today back to SQL, reading basic concepts in this book focus on SQL for EDA and building datasets for ML
Day 54 R2 #66daysofdata today I was playing around with MongoDB using Python
✅ Find documents based on criteria using operators
✅ Count documents
✅ Used dot notation
Day 51-53 R2 #66daysofdata I was busy in the last few days but managed to do a little each day
✅ Started to study AWS fundamentals
✅ Watched videos about DBT
✅ Reviewed ETL vs ELT
Day 50 R2 #66daysofdata today I was reviewing some old code for an ETL pipeline to load the data in a Data Lake in AWS and reviewing some basics of Pyspark
Day 49 R2 #66daysofdata today I went to a in person workshop about Airflow, the first one in a long time. Some things didn't work as expected, but it was fun!
Day 48 R2 #66daysofdata reviewing some basic concepts of working in shell I found this book that seems really interesting Data Science at the Command Line
Day 41-47 R2 #66daysofdata Last week was working on a classification mini project using Logistic Regression and Adaboost algorithm and keep practicing SQL and Python basics
Day 40 R2 #66daysofdata tweaking a little bit my model, playing around with feature selection to improve my score. I'm dealing with an unbalanced dataset, so I'll need to do more tests