April 10, 2020
Goal We aim to explore the options when we have highly skewed dataset to prepare the model and best performance meteric to be used for evaluation and prepare a model comparison report.
Work Plan:
What performance metric we should choose and Why? How to deal with highly skewed dataset - Undersampling , Oversampling , SMOTE Perform Precision-Recall trade off Go to Github source code