Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AMP Cache Pages are staying stale for a long time #19988

Closed
nickmoline opened this issue Dec 19, 2018 · 5 comments
Closed

AMP Cache Pages are staying stale for a long time #19988

nickmoline opened this issue Dec 19, 2018 · 5 comments

Comments

@nickmoline
Copy link

nickmoline commented Dec 19, 2018

What's the issue?

Google's AMP Cache (cdn.ampproject.org) is not refreshing cache reliably on our pages on https://www.justia.com/lawyers. Pages are cached well beyond (in some cases for weeks) even with multiple attempts to use the update-cache mechanism and multiple requests to the page.

We noticed this issue yesterday and when doing investigation (including discussing on slack with @jpettitt) some urls will eventually get recached, but other urls do not. So far of dozens of pages that we noticed were stale, we’ve only seen 4 of them eventually get recached.

When viewing the amp cache urls for the pages, the Last-Modified: header shows a clear indication of when the page was cached. Our pages are updated frequently and it is important that our changes can actually be seen by visitors.

How do we reproduce the issue?

Here is a small sampling of urls at the time of this posting have outdated content on the AMP Cache (we have tons):

What we've tried to resolve (to no avail)

  • Deprecated update-ping method
  • Newer update-cache method
  • Enforcing a Cache-Control: max-age=30 header to indicate that our pages need to expire quickly
  • Attempting to use Google Search Console's Fetch as Google feature on both the source html page and the amp url in hopes to force the page getting recached

Other Notes:

  • AMP Pages are properly validating
  • @jpettitt on slack suggested that we may be hitting a rate limit based on our crawl rate at google. We checked our crawl rate was 2.7/sec in GSC which Googlebot + Google-AMPHTML was not exceeding that rate. In hopes that it might help, we've increased the crawl rate in GSC to the maximum available (10/sec)

What browsers are affected?

All browsers

@nickmoline
Copy link
Author

Hello,

Just an update here. At approximately 10:30 PST (06:30 UTC) last night, we discovered that several of the pages (including the ones here) suddenly updated. Upon further investigation it seems that the behavior has changed and we are now getting the expected behavior of AMP. We visit a page, and check a couple minutes later and see that the timestamp has updated to sometime between the time we visited and the current time). At the moment, things appear to be working normally again, but this problem did exist and the dates we gave above were accurate as of the time we posted.

We are not certain at this time what fixed our problem (but we are certainly glad it is fixed).

As I mentioned in our original ticket, we submitted a request in Google Search Console to increase our crawl rate and it said that request would take a couple days to take effect. If that is what solved our issue, hopefully the issue won't come back when our crawl rate change expires.

@jgluk
Copy link

jgluk commented Dec 20, 2018

I'm glad to see the issue resolved itself. I am also internally tracking this for follow up investigation.
However, maybe we can close this specific issue in the meantime? (Since we have the internal tracking)

@nickmoline
Copy link
Author

Hi @jgluk

As the issue is not actively ongoing at this moment, I think that it is fine to close the github issue and rely on your internal tracking, however we would greatly appreciate it if you could let us know if your follow up investigation finds anything, especially if it finds something that we need to be proactively aware of to make sure this doesn't happen again in the future

@jgluk
Copy link

jgluk commented Dec 20, 2018

Yes, we will let you know if there is an action you can take on your side as a result of our investigation.

@nickmoline
Copy link
Author

Closing in interim per your suggestion, we look forward to finding out what caused this though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants