-
Notifications
You must be signed in to change notification settings - Fork 0
/
data.html
160 lines (157 loc) · 7.81 KB
/
data.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
<!DOCTYPE html>
<html>
<head>
<title>Master Project Visualizations</title>
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- CSS Lib -->
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.2.0/css/bootstrap.min.css">
<link rel="stylesheet" href="./lib/css/site.css">
<link rel="stylesheet" href="./lib/css/uikit.min.css">
<link rel="stylesheet" href="./lib/css/components/accordion.gradient.css">
<!-- JS Lib -->
<script src="https://code.jquery.com/jquery-2.1.0.min.js"></script>
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.2.0/js/bootstrap.min.js"></script>
<script src="./lib/js/uikit.js"></script>
<script src="./lib/js/core/modal.min.js"></script>
<script src="./lib/js/components/accordion.min.js"></script>
</head>
<body>
<nav class="navbar navbar-default" role="navigation">
<div class="container-fluid">
<!-- Brand and toggle get grouped for better mobile display -->
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#bs-example-navbar-collapse-1">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="navbar-brand" href="index.html">Master Project Visualizations</a>
</div>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse" id="bs-example-navbar-collapse-1">
<ul class="nav navbar-nav">
<li><a href="index.html">Home</a></li>
<li><a href="visualization.html">Visualization</a></li>
<li class="active"><a href="data.html">Data</a></li>
<li><a href="tools.html">Tools</a></li>
<li><a href="team.html">Team</a></li>
</ul>
<ul class="nav navbar-nav navbar-right"></ul>
</div><!-- /.navbar-collapse -->
</div><!-- /.container-fluid -->
</nav>
<div class="container">
<div class="row">
<div class="col-md-12">
<h1>About YouTube Data</h1>
<p style="font-size:18px;">
Datasets that we use for this project are about <a href="http://netsg.cs.sfu.ca/youtubedata/">YouTube Videos</a>. The data is collected by a webcrawler using YouTube APIs and is publically available on a web page at Simon Fraser University's website. The raw data contains the features described below.
</p>
<br />
</div>
<div class="col-md-12">
<table class="table table-bordered">
<thead>
<tr>
<th>Video ID</th>
<th>Uploader</th>
<th>Age</th>
<th>Category</th>
<th>Length</th>
<th>Views</th>
<th>Rate</th>
<th>Ratings</th>
<th>Comments</th>
<th>Related IDs</th>
</tr>
</thead>
<tr>
<th>bRPeEVpHiI8</th>
<th>ufc</th>
<th>744</th>
<th>Sports</th>
<th>154</th>
<th>331333</th>
<th>3.32</th>
<th>1284</th>
<th>103</th>
<th>"bRPeEVpHiI8,D7-oJH4TR88,etc"</th>
</tr>
</table>
</div>
<div class="col col-md-12">
<p style="font-size:18px;">
Some of them (e.g. Video ID) will be self-explanatory, but several will not be. Age refers to how old the video (not the uploader) is. The difference between Rate and Ratings is that Ratings represents how many times the video has been rated while Rate is the actual rating value (number of stars); similarly, Comments represents how many comments have been posted rather than what people have commented.
</p>
</div>
<div class="col col-md-12">
<p style="font-size:18px;">
More than 100 datasets are available on the original site. Since the duration time of the webcrawler is different, each dataset has a different size. A small one is, for example, less than 100KB, but a large one can be more than 10MB. The oldest dataset was created on February 2007, and the newest on September 2008. We are using a dataset (0.txt) created on March 2, 2007.
In order to visualize it, we preprocessed the data: Views and Rate (aka Stars) changed numerical values to categorical values. Some values of Category are also changed. The details are as follows:
</p>
<div class="row">
<div class="col col-md-4">
<h3>Views</h3>
<p><b>< 10k</b>: All videos views of which are less than 10,000 </p>
<p><b>< 50k</b>: Less than 50,000</p>
<p><b>< 100k</b>: Less than 100,000</p>
<p><b>≥ 100k</b>: Equal or more than 100,000</p>
</div>
<div class="col col-md-4">
<h3>Stars</h3>
<p>1: Rates between 0 and 0.99..</p>
<p>2: between 1 and 1.99..</p>
<p>3: between 2 and 2.99..</p>
<p>4: between 3 and 3.99..</p>
<p>5: between 4 and 5</p>
</div>
<div class="col col-md-4">
<h3>Category</h3>
<h5>'Others' contains the following categories:</h5>
<p>1. Howto & DIY</p>
<p>2. Pets & Animals</p>
<p>3. Autos & Vehicles</p>
<p>4. Gadgets & Games</p>
<p>5. Travel & Places</p>
</div>
</div>
</div>
<div class="col-md-12" style="font-size:18px;">
<h1>Findings</h1>
What we learned from the dataset using our visualization and consider to be of interest are the following three points:<br><br>
<ul>
<li>First, people rate videos more often as the number of video views increases. This is not surprising, but if the number of views exceeds 100,000, the number of ratings suddenly increases significantly.</li>
<br>
<div class="uk-accordion" data-uk-accordion="{showfirst: false}">
<button type="button" class="uk-accordion-title uk-button uk-icon-angle-down"> See Image</button>
<div class="uk-accordion-content"><img src="./lib/img/findings1.jpg"></div>
</div>
<br>
<li>Second, by contrast, this is not true for the number of comments. The number of both comments and ratings increases monotonically as the number of views increases; yet, the number of comments does not increase as much as the ratings when the views exceed 100,000. This may imply that people tend to rate only, rather than making a comment, when they watch popular videos.</li>
<br>
<div class="uk-accordion" data-uk-accordion="{showfirst: false}">
<button type="button" class="uk-accordion-title uk-button uk-icon-angle-down"> See Image</button>
<div class="uk-accordion-content"><img src="./lib/img/findings2.jpg"></div>
</div>
<br>
<li>Finally, ratings tend to be higher as the length of the video increases. In other words, people seem to prefer longer videos over shorter ones in general. Meanwhile, the length tends to decrease as videos are viewed repeatedly. Our thinking about this is because shorter videos are easier to view and to share.</li>
<br>
<div class="uk-accordion" data-uk-accordion="{showfirst: false}">
<button type="button" class="uk-accordion-title uk-button uk-icon-angle-down"> See Image</button>
<div class="uk-accordion-content"><img src="./lib/img/findings3.jpg"></div>
</div>
<br>
</ul>
</div>
</div>
</div>
<br /><br />
<footer class="well">
CS690 Master Project, Spring 2015.
<a href="http://www.cs.usfca.edu/">Computer Science Department</a>
of
<a href="http://www.usfca.edu/">University of San Francisco</a>.
</footer>
</body>
</html>