Dashboard fixes #100

jbednar · 2016-03-04T17:00:40Z

This PR adds:

Support for Castra-format files (via dask imported dynamically, thus not adding a required dependency)
Support for plotting pure counts (rows in the data, e.g. taxi trips, not associated with any particular field)
Support for plotting counts for the census data

It's also close to adding support for colorization by census race categories, but I'm not sure how to add that in a general way. Below are some trivial diffs that are sufficient to get racial color categories shown, in a hardcoded way that removes support for non-categorical data. With this PR and those diffs applied, there will initially be an error on startup because the default Field is counts in census.yml, but if the Field is then changed to Race to match the new Count Categories aggregate declared below, it should work.

So, @brendancol, can you take it from here? Can you make dashboard.py support categorical information where appropriate? I've added the race colors to the census.yml file already, but it would take me a while to figure out how to look that information up when needed, based on the "cat_colors" pointer I added to census.yml (but which could be changed if needed). There will also of course need to be some logic to switch between tf.interpolate and tf.colorize.

If we want to avoid errors for nonsensical combinations, just as for the earliest client.py file from ages ago, we'll presumably need to start declaring datatypes for these objects so that the buttons generally work rather than generally fail...

0045-jbednar:~/datashader/examples> git diff
diff --git a/examples/dashboard/dashboard.py b/examples/dashboard/dashboard.py
index 5d304de..0c32239 100644
--- a/examples/dashboard/dashboard.py
+++ b/examples/dashboard/dashboard.py
@@ -55,8 +55,8 @@ class GetDataset(RequestHandler):
                          self.model.active_axes[1],
                          self.model.active_axes[2],
                          self.model.aggregate_function(self.model.field))
-        pix = tf.interpolate(agg, (255, 204, 204), 'red',
-                             how=self.model.transfer_function)
+        color_field = 'race_colors'
+        pix = tf.colorize(agg, self.model.config[color_field])

         # serialize to image
         img_io = pix.to_bytesio()
@@ -72,6 +72,7 @@ class AppState(object):
         self.load_config_file(config_file)

         self.aggregate_functions = OrderedDict()
+        self.aggregate_functions['Count Categories'] = ds.count_cat
         self.aggregate_functions['Count'] = ds.count
         self.aggregate_functions['Mean'] = ds.mean
         self.aggregate_functions['Sum'] = ds.sum

…d refs to taxi

jbednar · 2016-03-08T03:36:39Z

It would also be good if it wouldn't re-run the entire pipeline if only the transfer fn changes. That way people can quickly try out different transfer_fns.

jbednar · 2016-03-10T22:41:30Z

@brendancol, while this is all fresh in your mind, there are some other small dashboard-related fixes in the issues list that could be addressed before you move on to other things...

jbednar · 2016-03-17T21:01:46Z

examples/dashboard/dashboard.py

@@ -28,7 +31,6 @@
 from webargs import fields


How should we handle the dashboard's dependence on webargs? Should we just state that in the README? I don't think we'd necessarily want to make webargs a dependency of datashader, at least not until conda supports optional dependencies.

It's just an example, so my opinion is that the dependencies don't matter as long as they're listed explicitly somewhere.

We do need to make it easy for people to run the example, though. After getting an error message about it, I briefly looked for webargs on conda, tried some bogus versions from non-main channels that didn't work, and eventually pip-installed webargs (which worked). Most people probably aren't that dedicated.

jbednar · 2016-03-17T21:16:26Z

Is the area highlighted by the hover tool correct? It doesn't seem to be. E.g. there's a hotspot just to the left of the blue area:

but if I hover directly over that, seemingly enclosing the hotspot in a blue box, the counts aren't particularly high:

Yet if I move the mouse down to the cell below and to the left, I get high counts indicative of a hotspot:

Does the displayed blue box need to be moved to accurately reflect the area in the hover information?

jbednar · 2016-03-17T21:18:49Z

The behavior in the corners with a large hover-box size also looks suspicious -- shouldn't the box be the same size throughout the array (with at most a pixel of rounding more or less), not cropped to a quarter the size in the corners?

jbednar · 2016-03-17T21:28:02Z

I guess this may be addressed by the proposed switch to averaging pixel values for reaggregation, but I can't quite make sense of the values for some combinations of Field and Aggregate. E.g.:

What does counting the Fare field mean? Counting how many non-empty Fare values there are? If so, presumably it's not in $; not sure what to do about that.

jbednar · 2016-03-17T21:48:22Z

Maybe report "Avg Fare ($) Count: xxxx", once it's averaging the values instead of re-aggregation? I.e., show the aggregate explicitly, not just the field?

jbednar · 2016-03-19T04:43:26Z

I added some commits for some useful bits, including out-of-core operation based on code from jcrist, but it's not quite ready to merge because the link to the census.castra file is blank (because I haven't yet heard back about options for hosting that file). It's also strange that castra has to be downloaded from a special channel; is there any way to get castra from a more public place (@jcrist?)

…egend to left-hand controls menu

jbednar · 2016-03-25T20:25:47Z

examples/dashboard/dashboard.py

+                                                      x_end=max_val,
+                                                      y_range=(0,18))
+
+                self.model.legend_vbox.children = [legend_fig]


Once it settles down a bit, it would be good to move the legend support into a function or a class, with appropriate parameters, with an eye to eventually moving it out of the dashboard.py file and into a datashader library file.

jbednar added 4 commits March 3, 2016 16:41

Added support for pure counts (no corresponding field). Fixed outdate…

f078a28

…d refs to taxi

Added support for multiple filetypes

749d600

Added data file path to startup message

5cea597

Initial version of census.yml; works for counts

4a39b97

jbednar added the in progress label Mar 4, 2016

jbednar assigned brendancol Mar 4, 2016

bcollins added 3 commits March 10, 2016 15:16

moved location of colormap to nest under summary_field

f24e96e

added checks for categorical fields and refactored aggregate type menu

26bbfaf

removed pdb

5b5db6f

added hover support

5b24f06

jbednar reviewed Mar 17, 2016
View reviewed changes

bcollins and others added 4 commits March 17, 2016 17:30

added mean aggregate downsample for hover layer

ee1bc4b

moved slider start value back to 4

a0f3cbc

Added description of census data

75cbd86

Added outofcore option and port option

70c8260

bcollins added 8 commits March 21, 2016 11:58

fixed merge conflicts

a7a77f3

added cmap back after merge

740edd6

added colorbar for numeric data

3d1f9a0

added eq_hist, spread slider, and colorbars

d82a076

resized plot to better fit 15" monitor

3a9bc99

added log x_axis_type for non-linear transfer_functions

c72e2a0

adjusted padding on ordinal colorbar, added older-style categorical l…

7ccdab3

…egend to left-hand controls menu

fixed logscale colorbar

ec16992

jbednar reviewed Mar 25, 2016
View reviewed changes

bcollins added 3 commits March 25, 2016 16:00

switched out diverging colorramps for sequential

92a6841

removed pdb

602b10e

small import cleanup

0ef5ec3

brendancol merged commit 2bcda51 into master Mar 25, 2016

brendancol deleted the dashboard-fixes branch March 25, 2016 21:09

jbednar assigned brendancol Mar 25, 2016

jbednar removed the in progress label Mar 25, 2016

jbednar mentioned this pull request Apr 6, 2016

Legend/color key support #90

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dashboard fixes #100

Dashboard fixes #100

jbednar commented Mar 4, 2016

jbednar commented Mar 8, 2016

jbednar commented Mar 10, 2016

jbednar Mar 17, 2016

jcrist Mar 17, 2016

jbednar Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 19, 2016

jbednar Mar 25, 2016

Dashboard fixes #100

Dashboard fixes #100

Conversation

jbednar commented Mar 4, 2016

jbednar commented Mar 8, 2016

jbednar commented Mar 10, 2016

jbednar Mar 17, 2016

Choose a reason for hiding this comment

jcrist Mar 17, 2016

Choose a reason for hiding this comment

jbednar Mar 17, 2016

Choose a reason for hiding this comment

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 17, 2016

jbednar commented Mar 19, 2016

jbednar Mar 25, 2016

Choose a reason for hiding this comment