Skip to content
View scriptdruid's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report scriptdruid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
scriptdruid/README.md

Vipul Rai

Senior Data Engineer | Distributed Computing
πŸ“ Amsterdam, Netherlands πŸ”— LinkedIn


Summary

Senior Data Engineer with 14+ years of experience in distributed computing, big data architecture, and DevOps automation.
Currently at Knab, I architect and deliver scalable data solutions leveraging AWS, Airflow, dbt, and Spark, optimizing ETL workflows to drive data-driven insights.
Expertise in cloud platforms, data pipeline development, and automation to ensure efficiency, security, and high performance in financial data operations.


Technical Skills

  • Big Data & Cloud: Apache Spark, Databricks, Airflow, Kafka, Azure, AWS, dbt
  • Programming & Frameworks: Python, PySpark, SQL, Django
  • DevOps & Automation: CI/CD, Terraform, Docker, Kubernetes, Pytest
  • Machine Learning & Analytics: MLOps, Feature Engineering, Predictive Analytics
  • Streaming & IoT: IoT Hub, Event Hubs, Stream Processing
  • Data Storage & Modeling: Redshift, MongoDB
  • AI & LLM Integrations: Hugging Face, OpenAI API, AI Agents

Professional Experience

Knab N.V. – Amsterdam, Netherlands

Senior Data Engineer | June 2024 – Present

  • Architecting scalable data platforms using AWS, EMR Spark, dbt, and Airflow.
  • Optimizing ELT pipelines and data ingestion workflows for enhanced system performance.
  • Collaborating with cross-functional teams to deliver high-impact financial analytics solutions.

Metyis – Amsterdam, Netherlands / Bengaluru, India

Principal - Data Engineering / Architect | Feb 2021 – May 2024

  • Led the development of a scalable IoT platform for shrimp farming operations.
  • Designed a low-latency, secure data-sharing system with IAM access controls.
  • Automated Data Science model deployments and data workflows using Airflow, Spark, and MLOps.

Data Engineering Manager | Aug 2020 – Jan 2021

  • Established DevOps and automation pipelines using Spark and Pytest, boosting deployment efficiency.
  • Provided technical leadership and mentorship, fostering a culture of innovation.

SmartNomad – Bengaluru, India

Senior Analytics Engineer | Jun 2017 – Apr 2020

  • Developed Python-based ranking algorithms for optimized selections (flights, hotels, restaurants).
  • Designed AWS-based microservices for personalized itinerary generation.
  • Built a Django backend that reduced itinerary planning time by 30%.

Activision Blizzard – Bengaluru, India

Big Data Consultant | Sep 2016 – May 2017

  • Saved $10M+ by implementing data-driven gaming analytics strategies.
  • Upgraded the tech stack to Apache Spark, boosting processing speed by 60%.
  • Developed automated QA and fraud detection models for the Call of Duty franchise.

Expedia Group – Bengaluru, India

Senior Consultant | Nov 2015 – Aug 2016

  • Optimized retail revenue strategies and pricing decisions, increasing profitability.
  • Improved search and ranking processes, enhancing customer engagement.

AIG – Bengaluru, India

Big Data Developer | Feb 2015 – Nov 2015

  • Developed predictive maintenance models using PySpark and Kafka, reducing downtime by 35%.
  • Built real-time data parsers using Apache Storm for equipment reliability analysis.

Blue Star Infotech – Bengaluru, India

Software Engineer | Aug 2013 – Feb 2015

  • Established a Hadoop cluster for large-scale data migration and processing.
  • Built a smart search recommendation engine for Amex, improving travel counselor productivity.

Certifications

  • Deep Learning Specialization
  • Academy Accreditation - Databricks Lakehouse Fundamentals
  • Microsoft Certified: Azure Data Engineer Associate (DP-203)

Education

  • Jawaharlal Nehru Technological University, Kakinada – B.Tech, Computer Science & Engineering (2012)
  • Kendriya Vidyalaya – Senior School Certificate (Class 12) (2008)
  • Navy Children School – Secondary School Examination (Class 10) (2006)

Languages

  • English – Native/Bilingual
  • Hindi – Native/Bilingual
  • Dutch – A2

Pinned Loading

  1. ml-projects ml-projects Public

    ML projects from various MOOCs

    Python 1

  2. project-on-opencv project-on-opencv Public

    All the important functionality provided by opencv

    Python 1

  3. tensorflow-projects tensorflow-projects Public

    Projects implemented using Tensorflow and Keras

    Jupyter Notebook 1

  4. hacker_rank_solutions hacker_rank_solutions Public

    Solution to hacker rank problems

    Python

  5. web-classifier-cats-dogs web-classifier-cats-dogs Public

    NN hosted as service using Flask and JS

    HTML

  6. python-snippets python-snippets Public

    This repository contains commonly used code snippets and tutorials.

    Python 4