End-to-End Machine Learning Project Part-3

I did not understand the whole program

  1. What is stratified?

  2. what is shuffle split?

  3. Stratified sampling using sciket-learn’s stratified shuffle split

Please tell me the steps.

please tell me the program at number 3

please reply.

I think this is explain in part-2. Please see at this time (2:06:36)

Not sure if that is a standard term but probably is being used for splitting the data after shuffling.

First you have create a strata (basically a column) on the basis of which it stratified sampling would happen. In the session, We have created a column “income_cat” and then did the split. Please see the code below: