Classification of Data Using Decision Tree and Random Forest Based on Three Different Criteria

C++ implementation of Decision Trees and Random Forests for classification of Insurance Dataset

We build decision trees and random forests for a insurance dataset, evaluating it for various experiments . Dataset taken from : https://archive.ics.uci.edu/ml/datasets/Insurance+Company+Benchmark+%28COIL+2000%29

HOW TO RUN :

Go to the folder :

   cd Final

Compile the program by entering the following command :

   g++  -o ID3 ID3.cpp

Run the executable by entering the following command :

   ./ID3  ticdata2000.txt  experiment_no

ticdata2000.txt contains the dataset for creating the tree.

Press enter to print the output.
Please refer to the Results and Conclusion file to see the final results of all the experiments.

Experiments :

We vary the "stopping criteria" that prevents further splitting of node. Changes in accuracy and complexity of model are observed.

Add noise to the dataset and evaluate the accuracy of the model along with the change in its complexity (number of nodes)

Perform "Reduced Error Pruning" on the tree and measure the change in accuracy of the tree.

Create a random forest using "Feature Bagging" approach where we select a subset of features, make multiple trees, and take majority vote for the result.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Final		Final
Main file		Main file
60IJAERS-04202057-Iterative.pdf		60IJAERS-04202057-Iterative.pdf
BSSE1431 mid presentation of SPL-1.pptx		BSSE1431 mid presentation of SPL-1.pptx
BST_assignment2.c		BST_assignment2.c
Binarytree_assignment1.c		Binarytree_assignment1.c
Classification+with+Tree+based+Models.pdf		Classification+with+Tree+based+Models.pdf
Cross validation and decision tree.pdf		Cross validation and decision tree.pdf
Cross validation.txt		Cross validation.txt
CrossValidation.c		CrossValidation.c
Decision_tree_using_Entropy.c		Decision_tree_using_Entropy.c
Entropy&InformationGain.c		Entropy&InformationGain.c
Final Report.pdf		Final Report.pdf
Gini_index.c		Gini_index.c
Hellinger_distance&InformationGain.c		Hellinger_distance&InformationGain.c
Hellinger_distance.c		Hellinger_distance.c
Hellinger_distance.exe		Hellinger_distance.exe
Information_gain_using_Gini_index.c		Information_gain_using_Gini_index.c
Main copy.cpp		Main copy.cpp
Main.cpp		Main.cpp
README.md		README.md
SPL1 Project Proposal Form.docx		SPL1 Project Proposal Form.docx
SPL1 Project Proposal Form.pdf		SPL1 Project Proposal Form.pdf
a.exe		a.exe
bsse1431 Software Project Lab-01 final presentation.pptx		bsse1431 Software Project Lab-01 final presentation.pptx
gini.txt		gini.txt
hellinger_distance.pdf		hellinger_distance.pdf
iris.data		iris.data
resources.txt		resources.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classification of Data Using Decision Tree and Random Forest Based on Three Different Criteria

HOW TO RUN :

Experiments :

About

Releases

Packages

Languages

pronobkarmoker/SPL-1

Folders and files

Latest commit

History

Repository files navigation

Classification of Data Using Decision Tree and Random Forest Based on Three Different Criteria

HOW TO RUN :

Experiments :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages