Skip to content

Electric Load and Weather variable dataset for training ML model.

Notifications You must be signed in to change notification settings

romilandc/load-weather-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

This data was generated using a forks of https://github.com/Karlheinzniebuhr/the-weather-scraper and https://github.com/jikaczmarski/caiso-data-downloader. Adjustments were made to aggregate 5-minute interval data to hourly averages and then scrubbed using the preprocess.py script attached.

Assumptions

  • Using weather stations KCASANFR698, KCASANJO17, KCABAKER271, KCAELSEG23, KCARIVER117 to serve as represetation of major load serving areas in CAISO
  • We swap missing data between SF and SJ as it was determined to have .9155 correlation
  • All other missing data is of two types: daylight savings time (2021-03-14', '2022-03-13', '2023-03-12') or missing from source which is www.wunderground.com; we fill in these values using interpolation (see preprocess.py)

About

Electric Load and Weather variable dataset for training ML model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages