Skip to content

Latest commit

 

History

History
38 lines (25 loc) · 1.31 KB

File metadata and controls

38 lines (25 loc) · 1.31 KB

3.3 Setting up the validation framework

Slides

Notes

Splitting the dataset with Scikit-Learn.

Classes, functions, and methods:

  • train_test_split - Scikit-Learn class for splitting datasets. Linux shell command for downloading data. The random_state argument set a random seed for reproducibility purposes.
  • df.reset_index(drop=True) - reset the indices of a dataframe and delete the previous ones.
  • df.x.values - extract the values from x series
  • del df['x'] - delete x series from a dataframe

The entire code of this project is available in this jupyter notebook.

⚠️ The notes are written by the community.
If you see an error here, please create a PR with a fix.

Navigation