Movie Sorter

by Leo Scarano and Jarisha Olanday

Summary

It is very useful in many cases to sort large sets of data. One good example of this is sorting a large set of movie data. Movies have several attributes involved, allowing for lots of data be analyzed, mined, sorted and distributed throughout wide arrays of applications. This program, in short, will take in movie data from a .csv file (most typically associated with Microsoft Office Excel) and sort the file based on a certain type of data.

Usage

Compiling and linking necessary files is made simple using the "make" command in terminal:

$make

Running the project is accomplished by outputting the data into standard input:

$cat file.csv | ./sorter -c col_to_sort

Implementation

The following list contains each file and what it is used for in the project:

Preprocessor - Header Files

sorter.h
mergesort.h
intCompare.h
stringCompare.h
floatCompare.h

C files

sorter.c

get_sort_col_info()
get_col_type()

Movie* mov

fgets()

fprintf()

mergesort.c

n*log(n)

n = # of elements to be sorted

log(n)

merge()

n

intCompare.c

mergeSort()

stringCompare.c

mergeSort()

floatCompare.c

mergeSort()

Abstraction

What is most beneficial about Movie Sorter is the capacity at which data can be sorted. The method signature for the merge-sort implementation is as follows:

void mergeSort(void* Arr, int arrSize, int structSize,
int Compare(void*, void*, int, int),
void Assign(void*, void*, int, int));

Since the arrays are passed into the merge-sort function are void pointers, it allows for a list of any type to be sorted. Since dereferencing void pointers in C is not allowed, the data must be down-casted to a type that can be referenced in order to perform the comparisons needed. This is done with the intCompare.c, stringCompare.c, and floatCompare.c files, which contain methods that cast the void pointers appropriately so they can be properly referenced. With this implementation, one can hypothetically create compare functions for any type of structure, thus adding a layer of "abstraction." Parameterized types are more commonly known as generics in languages such as Java, C# and others.

Imlementing merge-sort for various types of data is loosely expressed as:

mergeSort(mov, size-1, typetSize,  typeCompare, typeAssign);

Data is Beautiful

It is also very useful to analyze data we are processing (in this case, sorting). We have implemented simple checks throughout the document to determine if any of the data is "outlier" data. For an example, if any of the data being sorted is close to a small number, or exceeds an arbitrarily large number, it is saved in array for further analysis. If a user wanted to adjust the values of the outlier data, they can simply change the values of the checks in the compare functions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Movie Sorter

Summary

Usage

Implementation

Preprocessor - Header Files

C files

Abstraction

Data is Beautiful

Files

README.md

Latest commit

History

README.md

File metadata and controls

Movie Sorter

Summary

Usage

Implementation

Preprocessor - Header Files

C files

Abstraction

Data is Beautiful