Fix broken CheckLiveBillRunService #841

Cruikshanks · 2024-03-20T12:30:22Z

https://eaflood.atlassian.net/browse/WATER-4379

We recently updated the CheckLiveBillRunService to support this project handling annual billing (see Handle live bill run check for annual bill runs).

We thought all was well until we spotted some strange behaviour when testing. This app wouldn't create new SROC supplementary bill runs because CheckLiveBillRunService was claiming one was in progress. But a check of the DB found that wasn't the case.

We added some extra debug logging, restarted the app and tried again ... and it worked fine! Ok, chalk it up to 'puters and their weird ways.

Till it happened again. We looked again at the source and immediately spotted a flaw. But we thought it was unrelated.

PR #761 updated the code in the service to this

  const statuses = LIVE_STATUSES

  // Only one annual bill run per region and financial year is allowed. So, we include sent and sending in the statues
  // to check for
  if (batchType === 'annual') {
    statuses.push('sent', 'sending')
  }

  const numberOfLiveBillRuns = await BillRunModel.query()
    .select(1)
    .where({
      regionId,
      toFinancialYearEnding,
      batchType,
      scheme: 'sroc'
    })
    .whereIn('status', LIVE_STATUSES)
    .resultSize()

  return numberOfLiveBillRuns !== 0

The flaw we spotted was .whereIn('status', LIVE_STATUSES). Even though we'd created a new statuses object that we were updating if the batch type is 'annual', we weren't using it in our query. That line should have been .whereIn('status', statuses).

But hang on, our unit tests were saying everything is okay and working for both batch types. Huh!? 🫤

And then we had our 'a-ha' moment. 😁

This line const statuses = LIVE_STATUSES is not copying LIVE_STATUSES to statuses. It is setting statuses and LIVE_STATUSES to be the same thing. It is essentially the same thing as Passing by reference vs passing by value.

Say I want to share a web page with you. If I tell you the URL, I'm passing by reference. You can use that URL to see the same web page I can see. If that page is changed, we both see the changes. If you delete the URL, all you're doing is destroying your reference to that page - you're not deleting the actual page itself.

If I print out the page and give you the printout, I'm passing by value. Your page is a disconnected copy of the original. You won't see any subsequent changes, and any changes that you make (e.g. scribbling on your printout) will not show up on the original page. If you destroy the printout, you have destroyed your copy of the object - but the original web page remains intact.

Our unit tests worked because the supplementary test runs before the annual. When you run the annual it causes the LIVE_STATUSES to get updated hence the query works and the test passes.

Now, you can apply this same behaviour to an environment. When the app first starts LIVE_STATUSES won't include 'sending' and 'sent'. So, supplementary bill runs will go through. Then someone creates an annual bill run. Now LIVE_STATUSES does include those values and because it is declared outside of the scope of the function, it will stay that way for subsequent calls. This means the next supplementary bill run will fail because the query will return a result.

Don't you love programming?! 🤦😧😬

This change fixes the issue.

https://eaflood.atlassian.net/browse/WATER-4379 We recently updated the `CheckLiveBillRunService` to support this project handling annual billing (see [Handle live bill run check for annual bill runs](#761)). We thought all was well until we spotted some strange behaviour when testing. This app wouldn't create new SROC supplementary bill runs because `CheckLiveBillRunService` was claiming one was in progress. But a check of the DB found that wasn't the case. We added some extra debug logging, restarted the app and tried again ... and it worked fine! Ok, chalk it up to 'puters and their weird ways. Till it happened again. We looked again at the source and immediately spotted a flaw. But we thought it was unrelated. PR #761 updated the code in the service to this ```javascript const statuses = LIVE_STATUSES // Only one annual bill run per region and financial year is allowed. So, we include sent and sending in the statues // to check for if (batchType === 'annual') { statuses.push('sent', 'sending') } const numberOfLiveBillRuns = await BillRunModel.query() .select(1) .where({ regionId, toFinancialYearEnding, batchType, scheme: 'sroc' }) .whereIn('status', LIVE_STATUSES) .resultSize() return numberOfLiveBillRuns !== 0 ``` The flaw we spotted was `.whereIn('status', LIVE_STATUSES)`. Even though we'd created a new `statuses` object that we were updating if the batch type is 'annual', we weren't using it in our query. That line should have been `.whereIn('status', statuses)`. But hang on, our unit tests were saying everything is okay and working for both batch types. Huh!? 🫤 And then we had our 'a-ha' moment. 😁 This line `const statuses = LIVE_STATUSES` is not _copying_ `LIVE_STATUSES` to `statuses`. It is _setting_ `statuses` and `LIVE_STATUSES` to be the same thing. It is essentially the same thing as [Passing by reference vs passing by value](https://stackoverflow.com/a/430958/6117745). > Say I want to share a web page with you. If I tell you the URL, I'm passing by reference. You can use that URL to see the same web page I can see. If that page is changed, we both see the changes. If you delete the URL, all you're doing is destroying your reference to that page - you're not deleting the actual page itself. > > If I print out the page and give you the printout, I'm passing by value. Your page is a disconnected copy of the original. You won't see any subsequent changes, and any changes that you make (e.g. scribbling on your printout) will not show up on the original page. If you destroy the printout, you have destroyed your copy of the object - but the original web page remains intact. Our unit tests worked because the supplementary test runs _before_ the annual. When you run the annual it is causing the `LIVE_STATUSES` to get updated hence the query works and the test passes. Now, you can apply this same behaviour to an environment. When the app first starts `LIVE_STATUSES` won't include `'sending'` and `'sent'`. So, supplementary bill runs will go through. Then someone creates an annual bill run. Now `LIVE_STATUSES` does include those values and because it is declared outside of the scope of the function, it will stay that way for subsequent calls. This means the next supplementary bill run will fail because the query will return a result. Don't you love programming! 🤦😧😬 This change fixes the issue.

Doing this our test will now fail. Whoopee!

Jozzey

https://eaflood.atlassian.net/browse/WATER-4416 https://eaflood.atlassian.net/browse/WATER-4379 We spotted an issue with removing a bill from a bill run not setting the supplementary flags on a the licences involved correctly. We've fixed that, but its led to our QA team rigorously re-creating bill runs over and over. Doing this they spotted we did have an issue with the `CheckLiveBillRunService` that we fixed in [Fix broken CheckLiveBillRunService](#841). Now they've spotted something else. `CheckLiveBillRunService` includes batch type as a filter. This means it _will_ allow you to create a supplementary bill run, for example, even if another type of bill run is 'in progress' (queued, processing, or ready). AT the time we thought that was ok but now we know better. We had actually spotted this in our work to migrate the setup bill run journey (see [Handle bill run setup matches an existing bill run](#810)). We hoped we'd be using our version of the journey by now and we could quietly retire `CheckLiveBillRunService` as part of ongoing maintenance and no one would be any the wiser (it has been live for 7 months now!) But our QA team would rather clear the issues currently found before bringing in the new setup journey for testing. So, rather than fix the service, we'll replace it with `DetermineBlockingBillRunService` which matches what the legacy service does and deals with this scenario.

https://eaflood.atlassian.net/browse/WATER-4416 https://eaflood.atlassian.net/browse/WATER-4379 We spotted an issue with removing a bill from a bill run and it not setting the supplementary flags on the licences involved correctly. We've fixed that, but it's led to our QA team rigorously re-creating bill runs over and over. Doing this they spotted we did have an issue with the `CheckLiveBillRunService` that we fixed in [Fix broken CheckLiveBillRunService](#841). Now they've spotted something else. `CheckLiveBillRunService` includes batch type as a filter. This means it _will_ allow you to create a supplementary bill run, for example, even if another type of bill run is 'in progress' (queued, processing, or ready). At the time we thought that was ok but now we know better. We spotted this in our work to migrate the setup bill run journey (see [Handle bill run setup matches an existing bill run](#810)). We hoped we'd be using our version of the journey by now and we could quietly retire `CheckLiveBillRunService` as part of ongoing maintenance and no one would be any the wiser (it has been live for 7 months now!) However, our QA team would rather clear the issues currently found before bringing in the new setup journey for testing. So, rather than fix the service, we'll replace it with `DetermineBlockingBillRunService` which matches what the legacy service does and deals with this scenario.

Cruikshanks added the bug Something isn't working label Mar 20, 2024

Cruikshanks self-assigned this Mar 20, 2024

Cruikshanks added 3 commits March 20, 2024 12:31

Copy LIVE_STATUSES

87817e9

Doing this our test will now fail. Whoopee!

And now they pass again

4463115

Housekeeping - correct and update comments

902c4c6

Cruikshanks marked this pull request as ready for review March 20, 2024 12:42

Cruikshanks requested review from Demwunz, robertparkinson, Jozzey, Beckyrose200 and rvsiyad March 20, 2024 12:42

Jozzey approved these changes Mar 20, 2024

View reviewed changes

Merge branch 'main' into fix-broken-live-bill-run-check

a1ea4c0

Cruikshanks merged commit 259be13 into main Mar 20, 2024
6 checks passed

Cruikshanks deleted the fix-broken-live-bill-run-check branch March 20, 2024 13:11

Cruikshanks mentioned this pull request Mar 20, 2024

Replace live bill run checking in engine #843

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix broken CheckLiveBillRunService #841

Fix broken CheckLiveBillRunService #841

Cruikshanks commented Mar 20, 2024

Jozzey left a comment

Fix broken CheckLiveBillRunService #841

Fix broken CheckLiveBillRunService #841

Conversation

Cruikshanks commented Mar 20, 2024

Jozzey left a comment

Choose a reason for hiding this comment