Baselight

Breast Cancer Dataset

Binary Classification Prediction for type of Breast Cancer

@kaggle.yasserh_breast_cancer_dataset

About this Dataset

Breast Cancer Dataset

Description:

Breast cancer is the most common cancer amongst women in the world. It accounts for 25% of all cancer cases, and affected over 2.1 Million people in 2015 alone. It starts when cells in the breast begin to grow out of control. These cells usually form tumors that can be seen via X-ray or felt as lumps in the breast area.

The key challenges against it’s detection is how to classify tumors into malignant (cancerous) or benign(non cancerous). We ask you to complete the analysis of classifying these tumors using machine learning (with SVMs) and the Breast Cancer Wisconsin (Diagnostic) Dataset.

Acknowledgements:

This dataset has been referred from Kaggle.

Objective:

  • Understand the Dataset & cleanup (if required).
  • Build classification models to predict whether the cancer type is Malignant or Benign.
  • Also fine-tune the hyperparameters & compare the evaluation metrics of various classification algorithms.

Share link

Anyone who has the link will be able to view this.