Skip to content

Latest commit

 

History

History
59 lines (49 loc) · 2.21 KB

README.md

File metadata and controls

59 lines (49 loc) · 2.21 KB

cross_validation_custom

Two custom ways of calculating cross_val_score. Please note that X and y should be pandas.DataFrame and pandas.Series respectively.

cross_validation_score_statement(estimator,X,y,scoring,n_splits=5,statement=None,random_state=0):

Evaluate a score by cross-validation. 
The fit method will be performed on the entire train subset at each iteration,
the predict method and scoring will be performed only for objects from test subset where statement is True

Parameters
----------
estimator : estimator object implementing 'fit' and 'predict'
    The object to use to fit the data.
X : pandas.DataFrame
    The data to fit.
y : pandas.Series
    The target variable to try to predict.
scoring : callable 
    The scoring function of signature scoring(y_true,y_pred).
statement : boolean numpy.array of shape equal to y.shape
    The mask showing the objects we want to evaluate estimator on.
n_splits : int
    Number of folds for cross-validation
random_state : int
    Random_state for KFold and StratifiedKFold    

Returns
-----------
scores : array of float, shape=(n_splits,)

cross_validation_score_fit_subset(estimator,X,y,scoring,n_splits=5,statement=None,random_state=0):

Evaluate a score by cross-validation. 
The fit method will be performed on the subset of train subset at each iteration where statement is True.
The predict method and scoring will be performed on the entire test subset.

Parameters
----------
estimator : estimator object implementing 'fit' and 'predict'
    The object to use to fit the data.
X : pandas.DataFrame
    The data to fit.
y : pandas.Series
    The target variable to try to predict.
scoring : callable 
    The scoring function of signature scoring(y_true,y_pred).
statement : boolean numpy.array of shape equal to y.shape
    The mask showing the objects we want to fit estimator on.
n_splits : int
    Number of folds for cross-validation
random_state : int
    Random_state for KFold and StratifiedKFold    

Returns
-----------
scores : array of float, shape=(n_splits,)