Skip to content
View andres-dfc's full-sized avatar

Block or report andres-dfc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
andres-dfc/README.md

About me

Hey there! My name is Andrés, Data Scientist and Economist. I have experience working on data analytics in the fields of Sustainable Development Goals and Infrastructure sector. As I became passionate about problem solving through data, I decided to become a Data Scientist. Through Machine Learning models, different statistical tools and data visualization, I seek valuable information through pattern recognition in specific topics.

📚 Projects

  1. Predictive Factors of Powerlifting Competition Performance (Master´s Thesis): A Logistic Regression model on R to predict athletes reaching the podium with up to 79% accuracy and 85% AUC. Optimized through threshold adjustment and feature engineering.
  2. Flight Delays project: Using a Random Forest classifier and oversampling techniques with Python, determined the variables that predicted flight delays with a 89% accuracy.
  3. Lego Sets: A visual analysis project on R to identify patterns of Lego sets from 2018-2020. Star Wars is the most important theme, as it has an important impact in most variables measured.
  4. Oncologic cases: Detecting patients with tumors with a KNN model with an 85% accuracy rate using R.
  5. Telecom company study: Through statistical inference, determined that customer leakage can be explained up to 67% by customer seniority.
  6. Data analysis of small businesses: With SQL Snowflake, extracted meaningful data of small businesses in relation to sellings, economical values, returns and countries of origin.

🛠️ Tools

  • Languages: R, Python, SQL
  • ML libraries: Caret, glmnet, scikitlearn
  • Data preprocessing: tidyr, dplyr, numpy, pandas
  • Data visualization libraries: ggplot2, matplotlib
  • Other tools: Tableau, Snowflake

👋🏻 Connect with me

Popular repositories Loading

  1. Flight-Delays-A-Random-Forest-Model Flight-Delays-A-Random-Forest-Model Public

    Jupyter Notebook

  2. Lego-Sets-An-Analysis-Project Lego-Sets-An-Analysis-Project Public

    HTML

  3. andres-dfc andres-dfc Public

  4. Predictive-Factors-of-Powerlifting-Competition-Performance Predictive-Factors-of-Powerlifting-Competition-Performance Public

    Models of proportions and machine learning to predict athletes who reach the podiums in Powerlifting competitions.

    HTML