Skip to content

Forecasting Smart Meter Energy Usage using Distributed Systems and Machine Learning

License

Notifications You must be signed in to change notification settings

dongchris/Smart-Meter

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed Energy Usage Forecasting

Author: Chris Dong, Lingzhi Du, Kyna Ji, Amber Song, Vanessa Zheng

"Forecasting Smart Meter Energy Usage using Distributed Systems and Machine Learning". The 16th IEEE International Conference on Smart City

Goal

The EU aims to replace at least 80% of electricity meters with smart meters by 2020. We aimed to leverage the vast amount of smart meter data and build scalable machine learning model to forecast future energy usage. This will make people more informed on energy consumption patterns and benefit energy companies management.

Project Overview

Following is our model building process:

  • Produced automated data pipeline
    • Store data into Amazon S3
    • Import from Amazon S3 to MongoDB running on AWS EC2 instance
    • Import from MongoDB to Amazon EMR cluster (YARN)
  • Preprocessed data and implemented feature engineering using Pandas and Spark SQL
  • Forecasted bi-hourly London smart meter usage one day ahead with a scalable random forest model (SparkML)
  • Implemented the model on Amazon EMR clusters
  • Optimized computational performance by tuning configurations for Yarn cluster
    • Level of parallelism
    • Caching
    • Memory settings

About

Forecasting Smart Meter Energy Usage using Distributed Systems and Machine Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 66.2%
  • Shell 33.8%