Similar Posts
2. Undersampling Techniques
This method works with the majority class. It reduces the number of observations from the majority class to make the data set balanced. When there exists a class that is in abundance, undersampling aims to reduce the size of the abundant class to balance the dataset. It uses all of the rare events but reduces…
Collaborative Filtering
Collaborative filtering is a system that predicts user behavior based on historical user data. From this, we can understand that this is used as a recommendation system. For example, Amazon recommends products or gives discounts based on historical user data or YouTube recommends videos based on your history. Why do we need this though? We…
3. Embedded Methods
The main goal of feature selection’s embedded method is learning which features are the best in contributing to the accuracy of the machine learning model. They have built-in penalization functions to reduce overfitting: These encompass the benefits of both the wrapper and filter methods, by evaluating interactions of features but also maintaining reasonable computational cost….
C. Recursive Feature Elimination
It is a greedy optimization algorithm that aims to find the best-performing feature subset. It repeatedly creates models and keeps aside the best or the worst performing feature at each iteration. It constructs the next model with the left features until all the features are exhausted. It then ranks the features based on the order…

Why data normalization is important for non-linear classifiers
The term “normalization” usually refers to the terms standardization and scaling. While standardization typically aims to rescale the data to have a mean of 0 and a standard deviation of 1, scaling focuses on changing the range of the values of the dataset. As mentioned in [1] and in many other articles, data normalization…
Decoding Indian Journalism on Twitter with Data
I believe that if used the right way, data can become the fifth pillar to maintain and improve our democracies. It is up to us to stand up and present concrete facts before society to keep the powerful accountable to the last person standing in the queue. With this post, we debut our upcoming series…