Don't request all the blocks for liveblog main page #13566

johnduffell · 2016-07-12T11:24:26Z

This PR changes the CAPI query for liveblog page 1 (the live page) and the ajax "get updates" call to only retrieve the necessary blocks, rather than the whole liveblog.

This should save a lot of bandwidth as liveblogs get bigger, and also reduce the CPU usage both in CAPI and frontend. We noticed things were particularly bad when there were many big liveblogs running after brexit. This is a problem particularly for those two endpoints because they only have a 5 second cache time. Archive pages of a live blog cache for a few minutes.

This is part 2 of the series - part 1 was #13458

@TBonnin @cb372 @JustinPinner any comments or +1s please!

TBonnin · 2016-07-13T08:57:54Z

article/app/controllers/ArticleController.scala

+  def withParser: Parser[Unit] = "with:" ^^ { _ => () }
+  def block: Parser[Unit] = "block-" ^^ { _ => () }
+  def id: Parser[String] = "[a-zA-Z0-9]+".r
+  def blockId = block ~> id


Would it make sense to make those private if they are not supposed to be exposed outside of this object?

TBonnin · 2016-07-13T10:16:48Z

Thanks for doing this @johnduffell
A lot of going on in this PR. I would be happy if more people are reviewing it.

johnduffell · 2016-07-13T11:09:10Z

thanks for picking through that one @TBonnin it can't have been fun, and more reviews are better. I did a lot of things I wasn't 100% happy with the result, I feel like the code ended up slightly more complicated in the end. I've done the updates you suggested, thanks

cb372 · 2016-07-13T11:54:08Z

I'm going to take a look at it now.

cb372 · 2016-07-13T12:03:29Z

article/app/controllers/ArticleController.scala

@@ -167,7 +194,7 @@ class ArticleController extends Controller with RendersItemResponse with Logging
      .showReferences("all")
      .showAtoms("all")

-    val capiItemWithBlocks = if (blocks) capiItem.showBlocks("body") else capiItem
+    val capiItemWithBlocks = range.map(r => capiItem.showBlocks(r.query.map(_.mkString(",")).getOrElse("body"))).getOrElse(capiItem)


Maybe split this line up for readability?

cb372 · 2016-07-13T12:33:15Z

Good stuff! I hope all this work pays off in bandwidth savings!

I'm going to work this afternoon on adding the support for "blocks created/modified since" that we discussed.

johnduffell · 2016-07-15T13:47:06Z

ok I've managed to get the tests passing, can anyone take a look and give me a +1 please? Next stage after this is to increase the cache time for "updates since block X" requests where block X isn't the latest block. That should leave only one html page and one json page with a 5 second cache time

johnduffell · 2016-07-15T14:12:30Z

after that change all the perf updates should be complete, then I can start making it use published-since to get edits as well as new blocks and also fix the timeline to update again.

TBonnin · 2016-07-15T15:41:13Z

👍 thank you very much @johnduffell

# Conflicts: # article/app/controllers/ArticleController.scala

`LiveBlogCurrentPage.findPageWithBlock()` is responsible for making sure that when a user clicks on a live-blog permalink to a **block**, eg: https://www.theguardian.com/politics/live/2022/aug/01/tory-leadership-race-rishi-sunak-lizz-truss-vote-keir-starmer-uk-politics-live?page=with:block-62e7e89d8f08730a1f7f5579#block-62e7e89d8f08730a1f7f5579 ...the user is taken to the correct 'page' of the live blog - page 3 of 5 in the example above, for instance. The code was was first introduced in January 2016 with #11700, subsequently updated with #13566, etc. The existing code compiles without warnings under Scala 2.12, but the Scala 2.13 compiler gives a warning that not _all_ conceivable cases in the pattern-match are handled: ``` frontend/common/app/model/LiveBlogCurrentPage.scala:190:12: match may not be exhaustive. [warn] It would fail on the following inputs: List(_), Nil [warn] .map { [warn] ^ ``` In this particular case, the warning's unnecessary - the preceding `.sliding(3)`: https://github.com/guardian/frontend/blob/f90c8a58e6f0941c96392d14fe27e61a4dbaf25b/common/app/model/LiveBlogCurrentPage.scala#L188 ...means that the List supplied to the case match expression will _always_ have 3 elements, and that's precisely the case that's handled - but there's no way for the compiler to know that - so we as devs need to do _something_ to make the warning go away! See also: * The Scala 2.13 upgrade PR: #25190

@unchecked

`LiveBlogCurrentPage.findPageWithBlock()` is responsible for making sure that when a user clicks on a live-blog permalink to a **block**, eg: https://www.theguardian.com/politics/live/2022/aug/01/tory-leadership-race-rishi-sunak-lizz-truss-vote-keir-starmer-uk-politics-live?page=with:block-62e7e89d8f08730a1f7f5579#block-62e7e89d8f08730a1f7f5579 ...the user is taken to the correct 'page' of the live blog - page 3 of 5 in the example above, for instance. The code was was first introduced in January 2016 with #11700, subsequently updated with #13566, etc. The existing code compiles without warnings under Scala 2.12, but the Scala 2.13 compiler gives a warning that not _all_ conceivable cases in the pattern-match are handled: ``` frontend/common/app/model/LiveBlogCurrentPage.scala:190:12: match may not be exhaustive. [warn] It would fail on the following inputs: List(_), Nil [warn] .map { [warn] ^ ``` In this particular case, the warning's unnecessary - the preceding `.sliding(3)`: https://github.com/guardian/frontend/blob/f90c8a58e6f0941c96392d14fe27e61a4dbaf25b/common/app/model/LiveBlogCurrentPage.scala#L188 ...means that the `List` supplied to the case match expression will _always_ have 3 elements, and that's precisely the case that is handled - but there's no way for the compiler to know that - so we as devs need to do _something_ to make the warning go away! Some options: * Use the @unchecked annotation (see https://www.scala-lang.org/api/2.12.7/scala/unchecked.html) This is an option of last resort - turning off compiler checks leaves us exposed to runtime errors and is generally a bad idea! * Use `collect()` rather than `map()` - this accepts a partial function, rather than a total one, so the compiler error will go away. https://www.scala-lang.org/api/2.12.7/scala/collection/immutable/Seq.html#collectFirst[B](pf:PartialFunction[A,B]):Option[B] * Refactor so that the code no longer needs to assume that a `List` type (which can have any length) has length `3`. In the end I decided to go with the refactor, because there were a few things about the existing code that could be tweaked: * Unnecessary work: The method was creating a `LiveBlogCurrentPage` for _every_ page in the liveblog, even though it only ever needed one (the single page that the block actually exists on!), and would eventually throw away the rest. * The logic around padding the front & end of the `pages` List with two `None` entries, to allow extracting the `newer` & `older` pages, totally worked but added an extra step to the code (and therefore a bit of complexity for humans to understand: "what are endedPages?") and could be replaced by just incrementing/decrementing the page index for the one `LiveBlogCurrentPage` we're now creating. * Various variables could be inlined to the point of creation on the `LiveBlogCurrentPage` case class. By using named parameters in constructing the case class, no clarity is lost, and the code is more concise. See also: * The Scala 2.13 upgrade PR: #25190

johnduffell added 6 commits July 11, 2016 16:38

change liveblogs to use capi blocks for the first page and updates

ec09e5e

tidy up liveblog code

3575f4b

don't 404 if we don't ask for the right range for articles

001bbde

fix compile error

579f290

Merge branch 'master' into liveblog-blocks

c0dbdfb

make tests compile

38a2b70

TBonnin reviewed Jul 13, 2016
View reviewed changes

tidy up unclear code

76361ef

cb372 reviewed Jul 13, 2016
View reviewed changes

johnduffell added 5 commits July 14, 2016 09:49

fix tests and minor tidyup

11bd08b

Merge branch 'master' into liveblog-blocks

837b405

Merge branch 'master' into liveblog-blocks

f1e73b1

make sure blocks are sorted

58ab38d

update pressed data for tests

195bee9

sort on first publshed not last published and test fix

a0ecfd0

johnduffell added 3 commits July 15, 2016 16:57

Merge branch 'master' into liveblog-blocks

f6b69ba

# Conflicts: # article/app/controllers/ArticleController.scala

fix up imports

70e0e08

fix up imports

cc4b620

johnduffell merged commit a8b26fa into master Jul 18, 2016

johnduffell deleted the liveblog-blocks branch July 18, 2016 08:47

DavidLawes mentioned this pull request Jan 20, 2022

AR pagination logic guardian/dotcom-rendering#3754

Merged

rtyley mentioned this pull request Aug 3, 2022

Scala 2.13: Refactor LiveBlogCurrentPage.findPageWithBlock() #25338

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't request all the blocks for liveblog main page #13566

Don't request all the blocks for liveblog main page #13566

johnduffell commented Jul 12, 2016 •

edited

Loading

TBonnin Jul 13, 2016

TBonnin commented Jul 13, 2016

johnduffell commented Jul 13, 2016 •

edited

Loading

cb372 commented Jul 13, 2016

cb372 Jul 13, 2016

cb372 commented Jul 13, 2016

johnduffell commented Jul 15, 2016

johnduffell commented Jul 15, 2016

TBonnin commented Jul 15, 2016

Don't request all the blocks for liveblog main page #13566

Don't request all the blocks for liveblog main page #13566

Conversation

johnduffell commented Jul 12, 2016 • edited Loading

TBonnin Jul 13, 2016

Choose a reason for hiding this comment

TBonnin commented Jul 13, 2016

johnduffell commented Jul 13, 2016 • edited Loading

cb372 commented Jul 13, 2016

cb372 Jul 13, 2016

Choose a reason for hiding this comment

cb372 commented Jul 13, 2016

johnduffell commented Jul 15, 2016

johnduffell commented Jul 15, 2016

TBonnin commented Jul 15, 2016

johnduffell commented Jul 12, 2016 •

edited

Loading

johnduffell commented Jul 13, 2016 •

edited

Loading