All 3W Dataset data files (files in the subdirectories of the dataset directory) are licensed under the Creative Commons Attribution 4.0 International License.
Each subsection below contains release notes for a specific 3W Dataset version. Differences from the immediately previous version are highlighted.
Release: July 1, 2019.
This was the first published version, which is fully described in this paper.
Release: December 30, 2022.
Highlights:
- New instances were added as follows:
- 1 instance of event type 7.
- Instances were removed due to issues identified as follows:
- 3 instances of event type 0;
- 1 instance of event type 5;
- 3 instances of event type 8, when compared to what is described in the paper A realistic and public dataset with rare undesirable real events in oil wells published in the Journal of Petroleum Science and Engineering (link here).
- Normal periods of certain instances with anomalies were increased as possible. We tried to have instances with minimum normal periods of 1 hour;
- Names of certain files with instances have changed due to increased normal periods;
- Labels in some real instances were adjusted by experts;
- All values of some variables in some real instances were corrected due to corrections in historian systems' tag configurations;
- Certain variable values have undergone minimal change due to different rounding;
- The 3W Dataset's main configuration file (dataset.ini) was updated.
Release: April 09, 2023.
Highlights:
- Issue #60 was resolved;
- Issue #65 was resolved;
- Certain variable values have undergone minimal change due to different rounding;
- The 3W Dataset's main configuration file (dataset.ini) was updated.
Release: July 25, 2024.
Highlights:
- All instances are now saved in Parquet files (created with the
pyarrow
engine andbrotli
compression); - Reduction in disk space occupied by the 3W Dataset of 3.15 GB (from 4.89 GB to 1.74 GB);
- Real and simulated instances of type 9 were added;
- Several instances of types 0, 3, 4, 5, 6 and 8 were added;
- Another 24 real wells were covered with new real instances (now 42 real wells are covered);
- Some real instances, mainly of type 1, were removed;
- 1 variable was removed (
T-JUS-CKGL
); - Another 20 variables were added (there are now 27 variables);
- Another label referring to well operational status was added;
- Normal periods in several real instances with unwanted events were extended;
- All labeling gaps in real instances were eliminated (all observations were labeled);
- Conversions between measurement units in several instances were corrected;
- Labels in several real instances were adjusted by experts;
- All values of some variables in some real instances were corrected due to corrections in historian systems' tag configurations;
- Certain variable values have undergone minimal change due to different rounding;
- The 3W Dataset's main configuration file (dataset.ini) was updated.