Our Latest Articles

  • 10 datasets for beginners
    As a beginner, learning Machine Learning and Data Science can be a mountain of a task. Thankfully there exist a few datasets which help you in building confidence and honing your skills! Here are 10 datasets that I think are suited for beginners – 1. Beginner’s Classification Dataset It’s as…
  • 10 awesome ML datasets that deserve your attention!
    A list of not-so-popular machine learning articles that are highly useful yet underrated
  • Mean, Median, and Mode (now with Python!)
    Mean, median, and mode are the most commonly used measures for central tendencies in descriptive statistics. Everyone learns them in school…so I hope you paid attention back in your school days because it’ll be very relevant in this article. But even if you didn’t, don’t worry, that’s what I am…
  • Analysis of Covid-19 in India and its reasons
    The COVID-19 pandemic became one of the worst things that ever happened in India. It caused over 3 million deaths throughout the nation and left a mark on the economy that would take a few years to heal. COVID-19 The COVID-19 pandemic, also known as the coronavirus pandemic, is an…
  • Ridge Regression (now with interactive graphs!!!)
    So… Ridge Regression is a modified version of Linear Regression. and a classic example of regularization using L2 penalty. So to learn about Ridge Regression, you have to make sure you understand Linear Regression. If you don’t then click here. If you don’t know what Gradient Descent is, then click here. It…
  • Gradient Descent (now with a little bit of scary maths)
    Buckle up Buckaroo because Gradient Descent is gonna be a long one (and a tricky one too). The whole article would be a lot more “mathy” than most articles as it tries to cover the concepts behind a Machine Learning algorithm called Linear Regression. If you don’t know what Linear Regression is, go…
  • A simple review of Term Frequency – Inverse Document Frequency
    TF-IDF is short for Term Frequency-Inverse Document Frequency. It is a vectorization technique used in the field of Natural Language Processing. Yes I know, it is a daunting looking phrase, but trust me, it’s a lot simpler than it sounds. Uses of TF-IDF Natural Language Processing or NLP is the…
  • A review of MNIST Dataset and its variations
    MNIST, short for Modified National Institute of Standards and Technology, is a dataset consisting of images showing handwritten digits from 0 to 9 (both inclusive). You can find the link to the official dataset here – MNIST You can find the link to the dataset on Kaggle here – Kaggle…
  • Everything you need to know about Reinforcement Learning
    The phrase “Reinforcement Learning” could sound a little intimidating at first, but when we break it down, it’s actually quite simple. Let’s start with the phrase itself. What does Reinforce mean? No, don’t get googling already! I’ll tell you. It simply means to strengthen or support something. So Reinforcement Learning…
  • The statistical analysis t-test explained for beginners and experts
    During the last months, I’ve probably run the t-test dozens of times but recently I realized that I did not fully understand some concepts such as why it is not possible to accept the null hypothesis or where the numbers in the t-tables come from. After doing some research, I…