Skip to content
View DataStalker's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DataStalker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DataStalker/README.md

Hello, I'm Islam 👋

I'm a Bioinformatics Data Scientist passionate about translating complex biological data into clinical insights. My expertise lies at the intersection of biostatistics, data analysis, and genomics, with a focus on cancer research, immunotherapy, and public health.

I leverage a unique background in clinical healthcare to build robust analytical pipelines in R, uncovering biomarkers from large-scale datasets like TCGA and GEO to help advance personalized medicine.


🛠️ Core Competencies & Skills

This section lists my key technical skills. You can see them applied in my pinned repositories below.

  • Languages: R (tidyverse, ggplot2, survminer), SQL
  • Bioinformatics & Genomics:
    • TCGA & GEO Data Acquisition and Processing (TCGAbiolinks, GEOquery)
    • RNA-Seq and Gene Expression Analysis (TPM/FPKM Normalization, GSVA)
    • Biomarker Discovery and Validation
  • Statistics & Machine Learning:
    • Survival Analysis (Kaplan-Meier Curves, Log-rank Test, Cox Proportional Hazards Models)
    • Predictive Modeling (Logistic Regression)
    • Hypothesis Testing (Chi-squared Test, etc.)
  • Data Visualization & Reporting:
    • Publication-Quality Graphics (ggplot2, pheatmap)
    • Interactive Dashboards (Power BI)
    • Data Reporting (Excel, Google Sheets)

📈 My Work & Professional Focus

I specialize in conducting end-to-end data analysis projects that answer critical questions in biomedical research. My work typically involves:

  • Developing and implementing robust, reproducible analysis pipelines in R.
  • Analyzing large-scale genomic and clinical data to identify potential prognostic or predictive biomarkers.
  • Building and interpreting statistical models to assess the significance of clinical and biological variables.
  • Communicating complex findings through clear data visualizations, reports, and interactive dashboards.

👋🏻 Connect with Me


Pinned Loading

  1. Melanoma-Metabolic-Conflict-Analysis Melanoma-Metabolic-Conflict-Analysis Public

    End-to-end bioinformatics analysis of a biomarker in melanoma.

    R

  2. Mpox-Analysis Mpox-Analysis Public

    This repository features visualizations of monkeypox cases and deaths from Jan 2022 to Aug 2024. It includes a choropleth map and scatter plots analyzing the distribution and trends of the disease …

    R 1

  3. Survival-Cox Survival-Cox Public

    This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation…

    R

  4. Cholera-Spatial-Analysis Cholera-Spatial-Analysis Public

    Analysis of global cholera data (1949-2016), including trends, visualizations, and insights into case distributions and fatality rates across countries.

    R

  5. Hospital-Mortality Hospital-Mortality Public

    A Logistic Regression-based model to predict in-hospital mortality using patient demographics, medical history, and vital signs. Built with tidymodels for efficient data preprocessing and model tra…

    R

  6. Fetal-Health-Class Fetal-Health-Class Public

    Classify fetal health states using Cardiotocogram (CTG) data in this project. The dataset features various CTG metrics to build a model that categorizes fetal health into Normal, Suspect, and Patho…

    R