Posts

FEATURE ENGINEERING : SOME FAMOUS TECHNIQUES TO HANDLE MISSING VALUES

Image
SOME FAMOUS TECHNIQUES TO HANDLE MISSING VALUES Author: Bhaskar Kumar Das Introduction:                                  Humans are prone to commit errors and more importantly in many cases, errors occur not due to human negligence and faults.They occur due to variety of reasons that are beyond of human imagination, and one such frequent type of error that we encounter in Data Science is due to the presence of missing values. Missing values are generally caused when who takes/prepares data set fails to include a value or a person/system becomes unwilling to share information.( E.g it is observed that men are not likely to share info  about their salary and women are sometimes reluctant to share their ages).. So, being a data scientist, it becomes our duty to handle those missing values. We have two choices: 1) Drop those rows containing the NAN values. 2) Replace those NAN values by some value. Her...

EDA on Haberman’s Survival Data Set

Image
  Exploratory Data Analysis on Haberman's Survival Dataset Introduction: The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago's Billings Hospital on the survival of patients who had undergone surgeryfor breast cancer. Dataset Attribute Information: 1) Age of patient at time of operation (numerical) 2) Patient's year of operation (year - 1900, numerical) 3) Number of positive axillary nodes detected (numerical) 4) Survival status (class attribute)         1 = the patient survived 5 years or longer         2 = the patient died within 5 year 5) Number of Instances: 306 Data Source:  https://www.kaggle.com/gilsousa/habermans-survival-data-set    This blog is in continuation with previous blog,where we have discussed several EDA techniques on Iris Dataset .If you haven't check out that blog,i would highly recommend you to visit this link  https://bhaskar47899.blogspot.com...

Exploratory Data Analysis (EDA)

Image
Exploratory Data Analysis  On Iris Dataset Author: Bhaskar Kumar Das Github:  https://github.com/BKD471   Linkedin:  https://www.linkedin.com/in/bhaskar-kumar-das-64019a168/                      Introduction:                                     Have you ever wondered ,what makes Google to offer their world class service at 0 cost ? . Google, literally offers  search, youtube ,gmail, gdrive ,playstore,G maps, Classroom,blogger and many many more without taking a single penny from customers.Have you ever wondered what makes cab providers like Uber gives you cab facility at such a low cost ? ....have you ever thought of it?. Are they mad?.The answer is no and the only reason that allows them to not only offer valuable services but also generate a huge profit of unimaginable proportion. The  weapon they use is no...