Instrument JupyterHub to record events with jupyter_telemetry [Part II] #2698

Zsailer · 2019-08-22T18:10:41Z

This adds documentation and tests to @yuvipanda's PR in #2542.

The original instruments JupyterHub to record events about user server starts & stops using the jupyter_telemetry library.

- Introduce the EventLog class from BinderHub for emitting structured event data - Instrument server starts and stops to emit events - Defaults to not saving any events anywhere

They're pure python, and should be ok

jupyterhub still supports Python 3.5

Privacy by default!

Full circle, since the code in jupyter_telemetry came from here: jupyter/telemetry#6

Move it to YAML, since jupyter_telemetry supports these natively.

consideRatio · 2019-08-22T20:11:36Z

@Zsailer and @yuvipanda! I'm just dropping in to say that I think this is very exciting work! I'll be very happy to see this in place! ❤️ 🎉

Zsailer · 2019-08-23T16:03:14Z

I'm actually puzzled by these failing tests. They pass locally, so I'm struggling to identify why they fail on Travis.

I'm investigating further, but if anyone has immediate insights I'd really appreciate it! 😃

consideRatio · 2019-08-23T22:10:14Z

@Zsailer, I'm fairly confident on the following:

the output variable is a blank string.
json.loads(output) returns an error from getting a blank string.

    @pytest.mark.parametrize('schema, version, event', valid_events)
    def test_valid_events(get_hub_and_sink, schema, version, event):
        hub, sink = get_hub_and_sink(schema)
        # Record event.
        hub.eventlog.record_event(schema, version, event)
        # Inspect consumed event.
        output = sink.getvalue()
        data = json.loads(output)

The question then becomes, why is sink.getvalue() returning a blank string?

Verification of my hypothesis

The error seen in travis, actually matches exactly what I get if I try to invoke json.loads with a blank string.

Me reproducing the error

The travis logs

Zsailer · 2019-08-23T22:23:32Z

Thanks for investigating, @consideRatio 😃

Yes, I know that json.loads returns an error when the string is blank. The weird thing is that I don't get a blank string when I run the tests locally. I get the event data, just as you'd expect/hope. On Travis, I get a blank string. I need to track down why I'm not getting the event data string on Travis 😕

Zsailer · 2019-09-11T17:32:10Z

Alright, I'm really stumped here. This is really weird.

The tests I've written pass locally but fail on CI. This makes it extremely difficult to debug.

I think it's an issue with MockHub instances (good ol' traitlets troubles).

The test:

creates a StringIO instance for collecting telemetry data
creates a logging handler to route event data to the StringIO.
creates an MockHub instance with the logging handler attached
records an event.
asserts the event data was written to the StringIO stream

Locally, the StringIO buffer contains the event data. On CI, the StringIO buffer is empty and fails the test.

What's crazy is that I could remove other tests in other files and this test would pass.

So my hunch is that MockHub instances from other tests were not cleared and the logging handlers + stringio were never properly configured in the MockHub. I started addressing each case where the MockHub instances were not cleared, adding other tests back in one at a time. It worked for awhile but eventually failed again.

Does anyone have insight on this issue? Has anyone else run into issues where jupyterhub tests affect each other?

re-run init_eventlog to ensure event logging is hooked up

minrk · 2019-09-19T13:37:04Z

Sorry for leaving you hanging. I'm digging into this one today. First, instead of using .instance(), you can leave that alone and use the existing app fixture to get a handle on the configured JupyterHub Application object. To modify the configuration after-the-fact, you can try taking the instantiated object and calling init_events again, e.g.

@fixture
def event_sink(app):
    sink = io.StringIO()
    handler = logging.StreamHandler(sink)
    cfg = app.config
    cfg.EventLog.handlers = [handler]
    cfg.EventLog.allowed_schemas = [schema]
    # re-initialize EventLog object to register events
    app.init_events()
    yield sink

The reason it's failing on CI and not for you is likely that CI is running all the tests and you are only testing the one file. I can reproduce the failure by running all the tests, which means it's presumably an un-cleared instance or some other state from one test to the next. I see you tried to make sure this didn't happen with .clear_instance(), but I'm guessing there was just one that was missed, or perhaps another bit of state leaking that's not the global instance.

causes some weird behavior, such as event log not working

minrk · 2019-09-19T14:14:47Z

Found the state pollution, and it was weird! alembic, the database migration tool we use, configures logging when it runs (perhaps it shouldn't?). This has disable_existing_loggers=True by default, which disables all loggers when it runs. Since EventLogger is a logger, this gets disabled, but that's the wrong thing to do.

It was in test_app.py that the relevant alembic code was being triggered, btw, so the failure could be reproduced with pytest jupyterhub/tests/test_app.py jupyterhub/tests/test_eventlog.py. I found it by running pairs of test files one by one, e.g.

pytest -vx jupyterhub/tests/test_api.py jupyterhub/tests/test_eventlog.py  # success
pytest -vx jupyterhub/tests/test_app.py jupyterhub/tests/test_eventlog.py # failure, aha! One of these tests
pytest -vx jupyterhub/tests/test_app.py::test_resume_spawners jupyterhub/tests/test_eventlog.py # failure, this is the one!

I've pushed a couple commits simplifying the retrieval of the eventlog and reverting the .instance() bookkeeping that was unneeded.

yuvipanda · 2019-09-19T15:32:41Z

AWESOME! <3

Zsailer · 2019-09-19T15:41:13Z

@minrk you are my hero.

But seriously, thank you so much!

It was the test_app.py and test_eventlog.py combination that broke tests for me. I just cleared all Mockhub instances and eventually got all tests to pass locally, but the still failed on CI.

Just for laughs, check out the various attempts I took at solving this issue on my other branch 🤣

yuvipanda · 2019-09-19T15:49:15Z

Does this mean this is ready to be merged?

Zsailer · 2019-09-19T16:17:51Z

I've got it in the JupyterHub/Binder team meeting agenda for today :)

minrk · 2019-09-19T14:15:39Z

docs/source/events/index.rst

+To begin recording events, you'll need to set two configurations:
+
+    1. ``handlers``: tells the EventLog *where* to route your events. This trait is a list of Python logging handlers that route events to 
+    2. ``allows_schemas``: tells the EventLog *which* events should be recorded. No events are emitted by default; all recorded events must be listed here.


allowed_schemas

Is it possible to easily generate a list of all current schemas?

Hm... we don't have functionality for this in telemetry, but that's an interesting idea. We could add an option to get all registered schemas in jupyterhub.

Our general approach has been to force admins to whitelist all events they'd like to record, so that there's no data collected without operator knowledge.

@yuvipanda, thoughts?

docs/source/events/index.rst

minrk · 2019-09-19T14:16:59Z

docs/source/events/server-actions.rst

@@ -0,0 +1 @@
+.. jsonschema:: ../../../jupyterhub/event-schemas/server-actions/v1.yaml


Can files like this be generated instead of manually maintained? Not needed immediately, but for future ideas.

Not right now. That was actually my intention for adding pydantic to telemetry. Check out my example PR on Yuvi's JupyterHub branch..

With pydantic, we get doc generation, validation, and testing for free. The events themselves are python objects.

minrk · 2019-09-19T17:47:04Z

Left a couple comments on typos in docs, but 👍 to merge after that.

Zsailer · 2019-09-19T21:17:27Z

I think the new test failures are for reasons outside this PR.

We want to get access to the eventlogging feature (jupyterhub/jupyterhub#2698) and the user-redirect customization (jupyterhub/jupyterhub#2790?) This is a little scary, but there is enough testing that I feel this should be fine.

yuvipanda and others added 10 commits August 22, 2019 11:05

Add eventlogging infrastructure

41b2e6e

- Introduce the EventLog class from BinderHub for emitting structured event data - Instrument server starts and stops to emit events - Defaults to not saving any events anywhere

Add jsonschema and python-json-logger as dependencies

1e578a2

They're pure python, and should be ok

Don't use f strings yet

eca4f33

jupyterhub still supports Python 3.5

Emitted schemas must be whitelisted by admins

5aaa526

Privacy by default!

Use dunder formatting for capsule

1225ff4

Use EventLog class from jupyter_telemetry

dcde402

Full circle, since the code in jupyter_telemetry came from here: jupyter/telemetry#6

Add lots of documentation to event schema

aea2eef

Move it to YAML, since jupyter_telemetry supports these natively.

Depend on the jupyter_telemetry package

2b1bfa0

add docs for event-logging

c34bcab

add tests for eventlog

439e438

This was referenced Aug 22, 2019

Eventlog tests and documentation yuvipanda/jupyterhub#4

Closed

add docs for event-logging yuvipanda/jupyterhub#1

Closed

rename test fixture

263c5e8

Zsailer force-pushed the eventlog-tests branch from f959cbe to 263c5e8 Compare August 22, 2019 18:29

verify test data was emitted

c9d52be

Zsailer requested a review from minrk September 11, 2019 17:32

rely on app fixture to get configured app

7fd3271

re-run init_eventlog to ensure event logging is hooked up

avoid disabling existing loggers when invoking alembic

949d8d0

causes some weird behavior, such as event log not working

minrk force-pushed the eventlog-tests branch from 7c17a1b to 949d8d0 Compare September 19, 2019 13:46

run pre-commit hook

ac32ae4

minrk reviewed Sep 19, 2019

View reviewed changes

Minor typos found by @minrk

898fea9

minrk merged commit ca00c0e into jupyterhub:master Sep 24, 2019

Zsailer mentioned this pull request Sep 24, 2019

Instrument JupyterHub to record events with jupyter_telemetry #2542

Closed

yuvipanda mentioned this pull request Oct 28, 2019

Run JupyterHub master on our hubs berkeley-dsep-infra/datahub#1127

Merged

afshin mentioned this pull request Apr 29, 2022

Meeting Notes 2022 jupyter-server/team-compass#15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Instrument JupyterHub to record events with jupyter_telemetry [Part II] #2698

Instrument JupyterHub to record events with jupyter_telemetry [Part II] #2698

Zsailer commented Aug 22, 2019

consideRatio commented Aug 22, 2019 •

edited

Loading

Zsailer commented Aug 23, 2019 •

edited

Loading

consideRatio commented Aug 23, 2019 •

edited

Loading

Zsailer commented Aug 23, 2019

Zsailer commented Sep 11, 2019

minrk commented Sep 19, 2019

minrk commented Sep 19, 2019

yuvipanda commented Sep 19, 2019

Zsailer commented Sep 19, 2019

yuvipanda commented Sep 19, 2019

Zsailer commented Sep 19, 2019

minrk Sep 19, 2019

Zsailer Sep 19, 2019

minrk Sep 19, 2019

Zsailer Sep 19, 2019

minrk commented Sep 19, 2019

Zsailer commented Sep 19, 2019

		@@ -0,0 +1 @@
		.. jsonschema:: ../../../jupyterhub/event-schemas/server-actions/v1.yaml

Instrument JupyterHub to record events with jupyter_telemetry [Part II] #2698

Instrument JupyterHub to record events with jupyter_telemetry [Part II] #2698

Conversation

Zsailer commented Aug 22, 2019

consideRatio commented Aug 22, 2019 • edited Loading

Zsailer commented Aug 23, 2019 • edited Loading

consideRatio commented Aug 23, 2019 • edited Loading

Verification of my hypothesis

Me reproducing the error

The travis logs

Zsailer commented Aug 23, 2019

Zsailer commented Sep 11, 2019

minrk commented Sep 19, 2019

minrk commented Sep 19, 2019

yuvipanda commented Sep 19, 2019

Zsailer commented Sep 19, 2019

yuvipanda commented Sep 19, 2019

Zsailer commented Sep 19, 2019

minrk Sep 19, 2019

Choose a reason for hiding this comment

Zsailer Sep 19, 2019

Choose a reason for hiding this comment

minrk Sep 19, 2019

Choose a reason for hiding this comment

Zsailer Sep 19, 2019

Choose a reason for hiding this comment

minrk commented Sep 19, 2019

Zsailer commented Sep 19, 2019

consideRatio commented Aug 22, 2019 •

edited

Loading

Zsailer commented Aug 23, 2019 •

edited

Loading

consideRatio commented Aug 23, 2019 •

edited

Loading