feat: OL Crowding Heuristics Implementation #1879

cmaddox5 · 2023-09-29T14:06:32Z

Asana task: Add logging to triptych app - pt 2

A lot of changes here are refactoring so that credo doesn't complain about too many parameters or functions having too many nested conditionals. The meat of this PR is in get_instance_using_dwell_time. The gist is that we wait for the train to be stopped at the previous station. Once it is stopped, we get the prediction for that particular part of the trip in order to get the predicted dwell time (departure_time - arrival_time). When now is >= to that time (minus 10 seconds just to give ourselves a little time before the train is supposed to leave), we want to show the widget. The original logic of the train being in transit to the current station will always take precedent. This is only to be used as an extra effort to show the widget a little sooner.

Tests added?

hannahpurcell · 2023-10-11T16:23:18Z

Once it is stopped, we get the prediction for that particular part of the trip in order to get the predicted dwell time (departure_time - arrival_time). When now is >= to that time (minus 10 seconds just to give ourselves a little time before the train is supposed to leave), we want to show the widget.

Before reviewing, I have questions on the PR description: do we need to calculate the dwell time? It's just the departure time minus 10 seconds.

Also, we don't want to show the widget in this case. We want to log to Splunk to say that "heuristic #1 was met" and include the crowding level at that time. Then we'll make a Splunk dashboard to answer these questions:

...for what % of trips does that heuristic log appear BEFORE we'd normally show the widget?
...what % of the time does the crowding level change between that heuristic log when the widget actually shows up?

I remember there being 2 heuristics to evaluate, and it seems like this PR only captures one. The second one is: two back-to-back readings of crowding are identical.

This might be TMI at this point, but rewriting the problem this way helped it click for me. We're trying to figure out if we we can have high confidence crowding predictions earlier than just "once the train has left a station". Can we have high confidence predictions when:

The train is scheduled to depart a station in 10s? (first heuristic)
Two back-to-back readings of crowding were identical? (second heuristic)

Edit: Wanted to add, the task should have been written with more and clearer detail!! The task description absolutely did not capture this work well.

cmaddox5 · 2023-10-11T16:52:16Z

Before reviewing, I have questions on the PR description: do we need to calculate the dwell time? It's just the departure time minus 10 seconds.

This is definitely a case of me overthinking what we needed... That does seem exactly right so unless I run into problems with that, I will make that change.

Also, we don't want to show the widget in this case.

Guess I got too excited trying to add the widget to the screen more often 😅 . Easy to fix though.

I remember there being 2 heuristics to evaluate, and it seems like this PR only captures one. The second one is: two back-to-back readings of crowding are identical.

This reminds me that I had a concern about this heuristic at the time and completely forgot to communicate it. Are these for two consecutive screen refreshes or some other interval to be done in the background? Both I feel pose different challenges. Two consecutive screen refreshes seems like most screens will never have a chance to catch that (although if we are just logging, not really a big issue. Just may not be a heuristic most stations use). A background interval isn't really a problem, but will mean I need to spin up a different background process to track it.

hannahpurcell · 2023-10-11T18:43:08Z

Are these for two consecutive screen refreshes or some other interval to be done in the background?

Ah, I don't remember. Maybe ask Paul H? We decided during that heuristics meeting and I cannot at all remember which way we decided

github-actions · 2023-10-13T13:49:57Z

Coverage of commit `6814b0f`

Summary coverage rate:
  lines......: 40.5% (2323 of 5731 lines)
  functions..: 41.9% (1054 of 2518 functions)
  branches...: no data found

Files changed coverage rate:
                                                                           |Lines       |Functions  |Branches    
  Filename                                                                 |Rate     Num|Rate    Num|Rate     Num
  ===============================================================================================================
  lib/screens/application.ex                                               |75.0%      4|50.0%     2|    -      0
  lib/screens/departures/departure.ex                                      | 0.0%    152| 0.0%    30|    -      0
  lib/screens/ol_crowding/agent.ex                                         |60.0%      5|75.0%     4|    -      0
  lib/screens/ol_crowding/dynamic_supervisor.ex                            |50.0%      4|66.7%     3|    -      0
  lib/screens/ol_crowding/logger.ex                                        | 0.0%     20| 0.0%     4|    -      0
  lib/screens/v2/candidate_generator/widgets/train_crowding.ex             |64.5%     76|54.5%    11|    -      0

Download coverage report

cmaddox5 · 2023-10-16T17:20:54Z

Added an additional heuristic for consecutive crowing classes. Heuristic logic now live in their own function pick_heuristic. Dwell time logic is the same as before (except we look at departure_time instead of calculating it). New logic is simply log when two consecutive predictions have the same crowding classes.

cmaddox5 · 2023-10-23T13:10:49Z

which one did you end up choosing?

2 consecutive screen refreshes. The time-based heuristic will always get priority just so we aren't forever looking for 2 consecutive classes that are the same. Each refresh that still isn't currently past that time will compare to the previous. If it's the same, log it. Otherwise, replace what is in the cache with the new prediction.

github-actions · 2023-10-23T13:17:44Z

Coverage of commit `f97ff2c`

Summary coverage rate:
  lines......: 39.5% (2163 of 5471 lines)
  functions..: 38.5% (873 of 2268 functions)
  branches...: no data found

Files changed coverage rate:
                                                                   |Lines       |Functions  |Branches    
  Filename                                                         |Rate     Num|Rate    Num|Rate     Num
  =======================================================================================================
  lib/screens/application.ex                                       |75.0%      4|50.0%     2|    -      0
  lib/screens/departures/departure.ex                              | 0.0%    152| 0.0%    30|    -      0
  lib/screens/ol_crowding/agent.ex                                 |60.0%      5|75.0%     4|    -      0
  lib/screens/ol_crowding/dynamic_supervisor.ex                    |50.0%      4|66.7%     3|    -      0
  lib/screens/ol_crowding/logger.ex                                | 0.0%     19| 0.0%     4|    -      0
  lib/screens/v2/candidate_generator/widgets/train_crowding.ex     |53.9%     89|46.2%    13|    -      0

Download coverage report

cmaddox5 · 2023-10-23T14:16:12Z

After our chat in Slack, I agree that these heuristics should be independent. That has been changed.

github-actions · 2023-10-23T14:21:52Z

Coverage of commit `f91f869`

Summary coverage rate:
  lines......: 39.5% (2163 of 5472 lines)
  functions..: 38.5% (873 of 2269 functions)
  branches...: no data found

Files changed coverage rate:
                                                                   |Lines       |Functions  |Branches    
  Filename                                                         |Rate     Num|Rate    Num|Rate     Num
  =======================================================================================================
  lib/screens/application.ex                                       |75.0%      4|50.0%     2|    -      0
  lib/screens/departures/departure.ex                              | 0.0%    152| 0.0%    30|    -      0
  lib/screens/ol_crowding/agent.ex                                 |60.0%      5|75.0%     4|    -      0
  lib/screens/ol_crowding/dynamic_supervisor.ex                    |50.0%      4|66.7%     3|    -      0
  lib/screens/ol_crowding/logger.ex                                | 0.0%     19| 0.0%     4|    -      0
  lib/screens/v2/candidate_generator/widgets/train_crowding.ex     |53.3%     90|42.9%    14|    -      0

Download coverage report

hannahpurcell

Some clean-up thoughts, but I think the logic is sound 👍

lib/screens/ol_crowding/dynamic_supervisor.ex

hannahpurcell · 2023-10-23T18:13:30Z

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

+      is_nil(next_train_prediction) or
+        alert_makes_this_a_terminal or
+          next_train_prediction.vehicle.carriages == [] ->
+        []


since get_instance is its own function, this can all be done with pattern matching in the function def. Optional suggestion

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

hannahpurcell · 2023-10-26T21:07:56Z

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

+         cached_prediction,
+         common_params
+       ) do
+    show_widget_after_dt = DateTime.add(cached_prediction.departure_time, -10)


Should maybe call this var show_widget_before_dt

Well now needs to be >= this dt. I added a comment to this var that should help. Let me know if it's still unclear.

I do think this value should be called something else, because it suggests we're logging after departure time, but really we're logging before departure time; that is, 10s before departure time. now needs to be later than that time, which is still before the vehicle has even departed.

TL;DR the value represented by this variable is pre-departure, whether by 10s, 15s, or more.

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

hannahpurcell · 2023-10-26T21:13:07Z

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

+         :gt
+       ] do
+      log_crowding_info(
+        :dwell,


Maybe adjust this to match the "rebrand" of time-based heuristic. Perhaps :some_time_before_departure

Good catch. I went with :time_based because that's what I call the heuristic in code. Is that name clear or is it too general?

I'm on the fence, but I suppose we can leave it for now (until we add another time based heuristic). But there is another comment referring to dwell time on line 284 that could use clarification

github-actions · 2023-10-30T19:46:37Z

Coverage of commit `6a469db`

Summary coverage rate:
  lines......: 39.5% (2159 of 5471 lines)
  functions..: 38.5% (873 of 2269 functions)
  branches...: no data found

Files changed coverage rate:
                                                                   |Lines       |Functions  |Branches    
  Filename                                                         |Rate     Num|Rate    Num|Rate     Num
  =======================================================================================================
  lib/screens/application.ex                                       |75.0%      4|50.0%     2|    -      0
  lib/screens/departures/departure.ex                              | 0.0%    152| 0.0%    30|    -      0
  lib/screens/ol_crowding/agent.ex                                 |60.0%      5|75.0%     4|    -      0
  lib/screens/ol_crowding/dynamic_supervisor.ex                    |50.0%      4|66.7%     3|    -      0
  lib/screens/ol_crowding/logger.ex                                | 0.0%     19| 0.0%     4|    -      0
  lib/screens/v2/candidate_generator/widgets/train_crowding.ex     |49.4%     89|42.9%    14|    -      0

Download coverage report

hannahpurcell · 2023-10-31T17:48:46Z

lib/screens/v2/candidate_generator/widgets/train_crowding.ex

       ) do
-    show_widget_after_dt = DateTime.add(cached_prediction.departure_time, -10)
+    # cached_prediction.departure_time minus previous_departure_time_cushion is when we expect crowding to be reliable.
+    # When now >= this time, show the widget.


Still refers to showing the widget. Perhaps instead:

Suggested change

# When now >= this time, show the widget.

# When now >= this time, log it.

hannahpurcell

Pending two other tweaks, you should be good to go

github-actions · 2023-10-31T18:23:29Z

Coverage of commit `38d213c`

Summary coverage rate:
  lines......: 39.5% (2159 of 5471 lines)
  functions..: 38.5% (873 of 2269 functions)
  branches...: no data found

Files changed coverage rate:
                                                                   |Lines       |Functions  |Branches    
  Filename                                                         |Rate     Num|Rate    Num|Rate     Num
  =======================================================================================================
  lib/screens/application.ex                                       |75.0%      4|50.0%     2|    -      0
  lib/screens/departures/departure.ex                              | 0.0%    152| 0.0%    30|    -      0
  lib/screens/ol_crowding/agent.ex                                 |60.0%      5|75.0%     4|    -      0
  lib/screens/ol_crowding/dynamic_supervisor.ex                    |50.0%      4|66.7%     3|    -      0
  lib/screens/ol_crowding/logger.ex                                | 0.0%     19| 0.0%     4|    -      0
  lib/screens/v2/candidate_generator/widgets/train_crowding.ex     |49.4%     89|42.9%    14|    -      0

Download coverage report

* Added departure_time in epoch seconds to response. (#1911) * feature: New widget endpoint for einks (#1909) * Pull last deploy timestamp from config cache, add to data response * Removed unneeded ResponseMapper * Added frontend for bus einks * feat: OL Crowding Heuristics Implementation (#1879) * Added translation to logger. * Started on heuristics work. * Changed what value is saved in Agent. * Fixed logic for NB triptychs. * Tweaked shutdown logic for accuracy logger. * Removed inspect. * Dialyzer. * Changed heuristic to only log, not show widget. * Changed time we use to determine heuristic. * Changed variable name. * Added an additional heuristic for consecutive crowding classes. * Added nil check. * Refactored heuristics so they run independently. * Improved conditional. * Added a log. * Improved comments. * Made hardcoded value a parameter. * Changed log scenario name. * Improved var name. * Credo. * Addressed comments. * feat: Mercury GL E-Ink audio (#1913) * Added audio SSML to screen data response on gl einks with audio configured. * Updated screens_config. * Tweaked data so we always include the key even if there is no audio. * Added audio column to GL & PreFare in admin table (#1916) * Add filter to make sure ID is a number. (#1915) * fix: Skip CSRF protection on /widget POSTs since they really fetch a new HTML page (#1918) * fix: Skip CSRF protection on /widget POSTs since they really fetch a new HTML page * Build browser pipeline atop browser_no_csrf pipeline * Added suppressions for GL Surge. (#1919) * Added suppressions for GL Surge. * Fixed Govt Ctr alert ids. * Added a GL surge suppression at Kenmore. * Alert ID fix. --------- Co-authored-by: Hannah Purcell <69368883+hannahpurcell@users.noreply.github.com> Co-authored-by: Jon Zimbel <63608771+jzimbel-mbta@users.noreply.github.com>

cmaddox5 added 6 commits September 18, 2023 10:48

Added translation to logger.

f4106ec

Started on heuristics work.

b26f55d

Changed what value is saved in Agent.

be8b099

Fixed logic for NB triptychs.

8952396

Merge branch 'master' into cm/ol-heuristics

06de45c

Tweaked shutdown logic for accuracy logger.

fe85f4e

cmaddox5 requested review from a team and hannahpurcell and removed request for a team September 29, 2023 14:06

cmaddox5 assigned hannahpurcell Sep 29, 2023

cmaddox5 added 2 commits September 29, 2023 14:50

Removed inspect.

4d66458

Dialyzer.

8e42d35

hannahpurcell assigned cmaddox5 and unassigned hannahpurcell Oct 11, 2023

cmaddox5 assigned hannahpurcell and unassigned cmaddox5 Oct 11, 2023

cmaddox5 added 3 commits October 11, 2023 13:23

Merge branch 'master' into cm/ol-heuristics

f83d612

Changed heuristic to only log, not show widget.

08bfb09

Changed time we use to determine heuristic.

9b023ea

hannahpurcell assigned cmaddox5 and unassigned hannahpurcell Oct 11, 2023

Changed variable name.

6814b0f

cmaddox5 added 2 commits October 16, 2023 13:12

Added an additional heuristic for consecutive crowding classes.

ec08e30

Added nil check.

d060405

cmaddox5 assigned hannahpurcell Oct 16, 2023

mbta deleted a comment from github-actions bot Oct 23, 2023

Merge branch 'master' into cm/ol-heuristics

f97ff2c

Refactored heuristics so they run independently.

f91f869

hannahpurcell reviewed Oct 26, 2023

View reviewed changes

hannahpurcell assigned cmaddox5 and unassigned hannahpurcell Oct 27, 2023

cmaddox5 added 6 commits October 30, 2023 15:13

Improved conditional.

22f23d7

Added a log.

7e11be5

Improved comments.

d8ee356

Made hardcoded value a parameter.

6b68bd7

Changed log scenario name.

15db09c

Improved var name.

bf2e1f2

cmaddox5 assigned hannahpurcell and unassigned cmaddox5 Oct 30, 2023

cmaddox5 added 2 commits October 30, 2023 15:37

Merge branch 'master' into cm/ol-heuristics

bc73fc0

Credo.

6a469db

hannahpurcell reviewed Oct 31, 2023

View reviewed changes

hannahpurcell approved these changes Oct 31, 2023

View reviewed changes

hannahpurcell assigned cmaddox5 and unassigned hannahpurcell Oct 31, 2023

Addressed comments.

38d213c

cmaddox5 merged commit 0f03c50 into master Nov 7, 2023
2 checks passed

cmaddox5 deleted the cm/ol-heuristics branch November 7, 2023 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: OL Crowding Heuristics Implementation #1879

feat: OL Crowding Heuristics Implementation #1879

cmaddox5 commented Sep 29, 2023

hannahpurcell commented Oct 11, 2023 •

edited

Loading

cmaddox5 commented Oct 11, 2023

hannahpurcell commented Oct 11, 2023

github-actions bot commented Oct 13, 2023

cmaddox5 commented Oct 16, 2023

cmaddox5 commented Oct 23, 2023

github-actions bot commented Oct 23, 2023

cmaddox5 commented Oct 23, 2023

github-actions bot commented Oct 23, 2023

hannahpurcell left a comment

hannahpurcell Oct 23, 2023

hannahpurcell Oct 26, 2023

cmaddox5 Oct 30, 2023

hannahpurcell Oct 31, 2023

hannahpurcell Oct 26, 2023

cmaddox5 Oct 30, 2023

hannahpurcell Oct 31, 2023 •

edited

Loading

github-actions bot commented Oct 30, 2023

hannahpurcell Oct 31, 2023

hannahpurcell left a comment

github-actions bot commented Oct 31, 2023

	# When now >= this time, show the widget.
	# When now >= this time, log it.

feat: OL Crowding Heuristics Implementation #1879

feat: OL Crowding Heuristics Implementation #1879

Conversation

cmaddox5 commented Sep 29, 2023

hannahpurcell commented Oct 11, 2023 • edited Loading

cmaddox5 commented Oct 11, 2023

hannahpurcell commented Oct 11, 2023

github-actions bot commented Oct 13, 2023

Coverage of commit 6814b0f

cmaddox5 commented Oct 16, 2023

cmaddox5 commented Oct 23, 2023

github-actions bot commented Oct 23, 2023

Coverage of commit f97ff2c

cmaddox5 commented Oct 23, 2023

github-actions bot commented Oct 23, 2023

Coverage of commit f91f869

hannahpurcell left a comment

Choose a reason for hiding this comment

hannahpurcell Oct 23, 2023

Choose a reason for hiding this comment

hannahpurcell Oct 26, 2023

Choose a reason for hiding this comment

cmaddox5 Oct 30, 2023

Choose a reason for hiding this comment

hannahpurcell Oct 31, 2023

Choose a reason for hiding this comment

hannahpurcell Oct 26, 2023

Choose a reason for hiding this comment

cmaddox5 Oct 30, 2023

Choose a reason for hiding this comment

hannahpurcell Oct 31, 2023 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Oct 30, 2023

Coverage of commit 6a469db

hannahpurcell Oct 31, 2023

Choose a reason for hiding this comment

hannahpurcell left a comment

Choose a reason for hiding this comment

github-actions bot commented Oct 31, 2023

Coverage of commit 38d213c

hannahpurcell commented Oct 11, 2023 •

edited

Loading

Coverage of commit `6814b0f`

Coverage of commit `f97ff2c`

Coverage of commit `f91f869`

hannahpurcell Oct 31, 2023 •

edited

Loading

Coverage of commit `6a469db`

Coverage of commit `38d213c`