COMP66090 PredictingPsA:

Code developed for MSc Project: Predicting the development of Psoriatic Arthritis using Feature Selection and Locally Linear Embedding

The GeneticLLEToolbox directory contains code to: apply LLE, create out-of-sample extension for new samples, adaptively select the neighbourhood, and code to efficiently calculate the genetic distance in parallel. Additionally, it contains our implementation of the MLE intrinsic dimensionality estimator introduced by [1]. All LLE code developed, is based on the Matlab Toolbox for Dimensionality Reduction (https://lvdmaaten.github.io/drtoolbox/) developed by Laurens van der Maaten [2].

The Smote directoy contains our implementation of the SMOTE [3] and BorderlineSMOTE [4] algorithms

Lastly, the ModelTraining directoy contains our code for feature selection and for training our models in cross validation & external validation, as well as the code for evaluation.

The libs directory contains third party software:

FEAST [5]: http://www.cs.man.ac.uk/~gbrown/fstoolbox/
Noguera's Fleiss Kappa Stability [6]: https://github.com/nogueirs/JMLR2018

References:

[1] Levina, Elizaveta, and Peter J. Bickel. "Maximum likelihood estimation of intrinsic dimension." Advances in neural information processing systems. 2005.

[2] Van Der Maaten, Laurens, Eric Postma, and Jaap Van den Herik. "Dimensionality reduction: a comparative." J Mach Learn Res 10.66-71 (2009): 13.

[3] Chawla, Nitesh V., et al. "SMOTE: synthetic minority over-sampling technique." Journal of artificial intelligence research 16 (2002): 321-357.

[4] Han, Hui, Wen-Yuan Wang, and Bing-Huan Mao. "Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning." International conference on intelligent computing. Springer, Berlin, Heidelberg, 2005.

[5] Brown, Gavin, et al. "Conditional likelihood maximisation: a unifying framework for information theoretic feature selection." Journal of machine learning research 13.Jan (2012): 27-66.

[6] Nogueira, Sarah, Konstantinos Sechidis, and Gavin Brown. "On the Stability of Feature Selection Algorithms." Journal of Machine Learning Research 18 (2017): 174-1.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
GeneticLLEToolbox		GeneticLLEToolbox
ModelTraining		ModelTraining
Smote		Smote
Utils		Utils
libs		libs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COMP66090 PredictingPsA:

References:

About

Releases

Packages

Languages

License

MHStadler/COMP66090_PredictingPsA

Folders and files

Latest commit

History

Repository files navigation

COMP66090 PredictingPsA:

References:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages