Simple Linear Regression Library

This project is a lightweight C library designed for simple linear regression. It provides an efficient and flexible way to perform data analysis, enabling users to model relationships between two variables with ease. The library includes functions for data normalization, dataset splitting, and model training, ensuring optimal performance even on systems with limited resources.

Key Features:

Efficiency: Optimized for high performance, delivering accurate results with minimal computational overhead.
Flexibility: Easily integrates into larger C projects, supporting various dataset formats and workflows.

Perfect for developers seeking a foundational yet robust tool for regression tasks in C.

Project Structure

Project Root Directory
|-- build
| `-- test
|-- EDA
|   |-- DataAnalysis.c
|   |-- DataAnalysis.h
|-- Regression
| |-- Linear.c
| |-- Linear.h
|-- Test
| |-- test.c
|-- compile_commands.json
|-- License
|-- makefile
|-- README.md
|-- winequality.names
|-- winequality-red.csv
`-- winequality-white.csv

How to use?

1. Include necessary Header files

Below is the example of how to use this library.

const char *filename = "your_file_name";
int main(){
   
    /* Read the Dataset */
    getFile *read_data = Read_Dataset(filename, "speicfy the Independent_var col", "specify the Dependent_var col");
    
    /* Apply Normalization */
    NormVar *normalize = Normalize(read_data->x, read_data->y, read_data->num_rows);
    
    float train_ration = 0.8, lr = 0.01, lambda1 = 0.05, lambda2 = 0.05;

    /* split the dataset */
    SplitData *split_data = Split_Dataset(normalize->X, normalize->Y, size_x, train_ratio);

    /* fitting model */
    Beta *model = Fit_Model(split_data->X_Train, split_data->Y_Train, split_data->train_size, split_data->train_size, epochs, lr, lambda1, lambda2);
    
    /* Make predictions */
    float *prediction = Prediction_Model(split_data->X_Test, split_data->test_size, *model);
    
    ...

}

The above values are normazlied so its scale in b/n 0 to 1. To Denormalize check below example,

{
    .....
    
    float *denormalize = Denormalize(prediction, normalize->y_min, normalize->y_max, split_data->test_size);
    
    .....
}

To find the models accuracy, use the following methods,

{
    ....
    
    metricResult rmse = Root_Mean_Square_Error(split_data->Y_Test, predictions, size_y);
    metricResult r_squre = R_Square(split_data->Y_Test, predictions, size_y);
    metricResult mse = Mean_Square_Error(split_data->Y_Test, predictions, size_y);
    metricResult mae = Mean_Absolute_Error(split_data->Y_Test, predictions, size_y);

    ....

}

How to Debug or Trace Memory Allocation Errors?

Use GDB for debugging and Valgrind to check for memory issues.

Debugging with GDB:

gdb ./build/test

Refer to the official GDB documentation for more details.

Memory Leak Detection with Valgrind:

valgrind --leak-check=full --track-origins=yes -s ./build/test

Refer to the official Valgrind documentation for further information.

File Descriptions

build/test: The compiled binary file generated after running the make command.
EDA/DataAnalysis.c and EDA/DataAnalysis.h: Source and header files for exploratory data analysis (EDA) utilities.
Regression/Linear.c and Regression/Linear.h: Source and header files for simple linear regression implementation.
Test/test.c: Test file to validate and demonstrate library functionality.
compile_commands.json: Compilation database for debugging tools like clangd or VSCode.
winequality-red.csv and winequality-white.csv: Example datasets for testing and demonstrating the library.

License

The project is licensed under the MIT License. Check the License file in the root directory for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
Datasets		Datasets
EDA		EDA
Regression		Regression
Test		Test
images		images
.clang-format		.clang-format
.gitignore		.gitignore
License		License
README.md		README.md
compile_commands.json		compile_commands.json
makefile		makefile
report.txt		report.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple Linear Regression Library

Key Features:

Table of Contents

Project Structure

How to use?

1. Include necessary Header files

How to Debug or Trace Memory Allocation Errors?

Debugging with GDB:

Memory Leak Detection with Valgrind:

File Descriptions

License

About

Uh oh!

Releases

Packages

Languages

License

Hemanthsp999/Simple-Linear-Regression-C-Library

Folders and files

Latest commit

History

Repository files navigation

Simple Linear Regression Library

Key Features:

Table of Contents

Project Structure

How to use?

1. Include necessary Header files

How to Debug or Trace Memory Allocation Errors?

Debugging with GDB:

Memory Leak Detection with Valgrind:

File Descriptions

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages