Ayush

0

5

1.9K

Ayush@TensorThrottleX·14h

Day 94 :

Day 311 : DataScience Journey Random Forest is essentially an improved version of Decision Trees designed to solve their biggest weakness : overfitting. A single decision tree tends to memorize the training data, which means it has low bias but high variance and performs poorly

1

15

Ayush@TensorThrottleX·14h

Day 311 : DataScience Journey Random Forest is essentially an improved version of Decision Trees designed to solve their biggest weakness : overfitting. A single decision tree tends to memorize the training data, which means it has low bias but high variance and performs poorly

English

0

1

993

Ayush@TensorThrottleX·1d

@CodeAyushD man fabulous!!

Français

0

1

12

Ayush Dubey@CodeAyushD·1d

Started from 0. No audience. No support. Just consistency. I’m building my coding YouTube channel from scratch 💻 If you believe in growth, support me here👇 @technicallauncher2192" target="_blank" rel="nofollow noopener">youtube.com/@technicallaun… One day this will be BIG 🚀 #buildinpublic #coding #startup

English

3

0

5

49

Ayush@TensorThrottleX·1d

Day 93 :

Day 310 : DataScience Journey Worked through a full Titanic pipeline from messy real-world data to a functioning Random Forest model and the interesting part wasn’t just the model, but the small practical issues along the way.Handling missing values

30

Ayush@TensorThrottleX·1d

Day 310 : DataScience Journey Worked through a full Titanic pipeline from messy real-world data to a functioning Random Forest model and the interesting part wasn’t just the model, but the small practical issues along the way.Handling missing values

English

0

4

4.3K

Ayush@TensorThrottleX·2d

Day 92 :

Day 309 : DataScience Journey After the data, The main effort went into preprocessing: removing irrelevant features, handling missing values using median and mode focused on understanding its structure identifying missing values, checking feature types, and observing how the

1

34

Ayush@TensorThrottleX·2d

Day 309 : DataScience Journey After the data, The main effort went into preprocessing: removing irrelevant features, handling missing values using median and mode focused on understanding its structure identifying missing values, checking feature types, and observing how the

English

0

5

4.7K

Ayush@TensorThrottleX·4d

Day 91 :

Day 308 : DataScience Journey While training models using bagging (like Random Forest), not every data point is used in building each tree. On average, about one-third of the data remains unused for a given tree these are called out-of-bag samples.

3

40

Ayush@TensorThrottleX·4d

Day 308 : DataScience Journey While training models using bagging (like Random Forest), not every data point is used in building each tree. On average, about one-third of the data remains unused for a given tree these are called out-of-bag samples.

English

0

5

5.8K

Ayush@TensorThrottleX·6d

Day 90 :

Day 307 : DataScience Journey So a single decision tree… it tends to overfit pretty easily and can give unstable results. Bagging basically fixes this by training a bunch of trees on slightly different parts of the data and then combining their outputs. Since each tree sees a

4

42

Ayush@TensorThrottleX·6d

Day 307 : DataScience Journey So a single decision tree… it tends to overfit pretty easily and can give unstable results. Bagging basically fixes this by training a bunch of trees on slightly different parts of the data and then combining their outputs. Since each tree sees a

English

0

3

175

Ayush@TensorThrottleX·30 Mar

Day 89 :

Day 306 : DataScience Journey Gradient Boosting works by building models step by step, where each new tree tries to fix the mistakes made by the previous ones. Initially, a simple model is trained on the data.Then, we calculate the residuals (errors)

2

44

Ayush@TensorThrottleX·30 Mar

Day 306 : DataScience Journey Gradient Boosting works by building models step by step, where each new tree tries to fix the mistakes made by the previous ones. Initially, a simple model is trained on the data.Then, we calculate the residuals (errors)

English

0

5

1.3K

Ayush@TensorThrottleX·28 Mar

Day 88 :

Day 305 : DataScience Journey Instead of depending on a single algorithm, we combine multiple models like Logistic Regression, Random Forest, and SVM. Each one has its own way of making mistakes, so when we combine them using a Voting Classifier, the final result becomes more

0

2

54

Ayush@TensorThrottleX·28 Mar

Day 305 : DataScience Journey Instead of depending on a single algorithm, we combine multiple models like Logistic Regression, Random Forest, and SVM. Each one has its own way of making mistakes, so when we combine them using a Voting Classifier, the final result becomes more

English

0

6

3.9K

Ayush@TensorThrottleX·27 Mar

Day 87 :

Day 304 : DataScience Journey Bagging and Pasting in Scikit-Learn are powerful ensemble techniques that improve model performance by reducing variance and enhancing generalization. Instead of relying on a single model, they train multiple models (typically Decision Trees) on diff

6

66

Ayush@TensorThrottleX·27 Mar

Day 304 : DataScience Journey Bagging and Pasting in Scikit-Learn are powerful ensemble techniques that improve model performance by reducing variance and enhancing generalization. Instead of relying on a single model, they train multiple models (typically Decision Trees) on diff

English

0

4

1.4K

Ayush@TensorThrottleX·26 Mar

Day 86 :

Day 303 : DataScience Journey Random Patches and Random Subspaces are techniques used to increase diversity in ensemble models. Instead of training each model on the full dataset, we randomly sample either data points, features, or both. When both samples and features are random

1

36

Ayush@TensorThrottleX·26 Mar

Day 303 : DataScience Journey Random Patches and Random Subspaces are techniques used to increase diversity in ensemble models. Instead of training each model on the full dataset, we randomly sample either data points, features, or both. When both samples and features are random

English

0

3

3.5K

Ayush@TensorThrottleX·25 Mar

Day 85 :

Day 302 : DataScience Journey Decision Trees (CART) follow a greedy approach choosing the best split at each step without looking ahead. While this doesn’t guarantee the optimal tree, it works efficiently in practice. Prediction is fast, typically O(log⁡n), since it just