Skip to content

Latest commit

 

History

History

Data

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Data

The original data is downloaded from Kaggle which obtained the data set from WhiskyClassified.com.

In the original data set contains 12 columns of characters or flavors, including body, sweetness, smoky...etc. Besides those features, there are columns of distillery name, postcode, UTM latitude and UTM longitude of the distilleries. The columns looks like this:

  • Distillery — Name of Distillery (String)
  • Body (Integer, range from 0–4)
  • Sweetness (Integer, range from 0–4)
  • Medicinal (Integer, range from 0–4)
  • Tobacco (Integer, range from 0–4)
  • Honey (Integer, range from 0–4)
  • Spicy (Integer, range from 0–4)
  • Winey (Integer, range from 0–4)
  • Nutty (Integer, range from 0–4)
  • Malty (Integer, range from 0–4)
  • Fruity (Integer, range from 0–4)
  • Floral (Integer, range from 0–4)
  • Postcode — Zipcode of the distillery (String)
  • LatitudeUTM — Latitude in UTM format (Integer)
  • LongitudeUTM — Longitude in UTM format (Integer)


The columns of Body, Sweetness, Medicinal, Tobacco, Honey, Spicy, Winey, Nutty, Malty, Fruity, Floral are quantified character/flavor, range from 0-4. 0 is None, 4 is the heaviest character/flavor. In the application, I labeled 1="Light", 2="Medium", 3="Strong", 4="Very Strong". Postcode is the UK zipcode in string.

Additionally to the original 86 rows by 12 columns data set, I added three more columns:
  • Latitude in degree
  • Longitude in degree
  • Region (Region Classification of Whisky Distillery)


Note: The original data set contains latitude and longitude in UTM that I found it is useful to convert to degree. Latititude and Longitude in degree are very handy to visualize in Tableau.
The Whisky distilleries are labeled like this:



You may find the data set here.