About

R program that implements K-Nearest Neighbour with K-Fold Cross Validation. Although the program analyzes "Appointment1000" dataset but can easily be tweeked for other datasets.

Discussion

The data set is divided into 10 different parts and the KNN model is fitted with the k-1 parts (training set) and predicted using 1 part (test set). Following this, the test mean square error is calculated and the above procedure is repeated 10 times for each part to be treated as a validation set. In the end, the test error rate is calculated as the average of all 10 errors. Finally for all the values of K (0-200 in this case), 10 folds are fited and error rates are determined therfore giving the optimal K with minimum error.

Plot

KNN Cross-Validation error rates against all values of K and folds

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
knn-cv.R		knn-cv.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Discussion

Plot

About

Releases

Packages

Languages

umangU/KNN-KFold

Folders and files

Latest commit

History

Repository files navigation

About

Discussion

Plot

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages