Sales_Transactions_Dataset_Weekly
52 columns for 52 weeks; normalised values of provided too.
- | - |
---|---|
Data Set Characteristics | Multivariate, Time-Series |
Attribute Characteristics | Integer, Real |
Number of Attributes | 53 |
Number of Instances | 811 |
Associated Tasks | Clustering |
James Tan, jamestansc '@' suss.edu.sg, Singapore University of Social Sciences
Paper - Finding Similar Time Series in Sales Transaction Data
PS. Best k identify by the maximum of Calinski and Harabaz score
Model | Best K | Calinski and Harabaz score | Mean Silhouette Coefficient | Build in score |
---|---|---|---|---|
K-Means Scikit Learn (original) | 3 | 2704.5317 | 0.7503 | -686537.2554 |
K-Means Scikit Learn (normalized) | 2 | 295.0516 | 0.2464 | -2402.6697 |
K-Means From Scratch (all data) | 3 | 3191.7559 | 0.6147 | - |