Switch to cache_resource for Document Index #54

JoepdeJong · 2024-05-20T10:10:14Z

This PR solves the issue of a missing _model in the Index after loading it from cache.

It seems that it is better to use another caching decorator.

As described in https://docs.streamlit.io/develop/concepts/architecture/caching

st.cache_resource is the recommended way to cache global resources like ML models or database connections – unserializable objects that you don't want to load multiple times. Using it, you can share these resources across all reruns and sessions of an app without copying or duplication. Note that any mutations to the cached return value directly mutate the object in the cache (more details below).

Closes #53

jonfairbanks · 2024-05-20T20:19:51Z

Thank you for this PR! Caching has definitely been a headache here.

jonfairbanks · 2024-05-24T07:06:49Z

Actually since we are using _documents here, the underscore tells Streamlit to not cache that particular resource. Removing the underscore will result in an error from Streamlit.

I'll merge this up to the main branch but technically nothing is being cached in this function.

JoepdeJong · 2024-05-25T19:36:06Z

Actually since we are using _documents here, the underscore tells Streamlit to not cache that particular resource. Removing the underscore will result in an error from Streamlit.

I'll merge this up to the main branch but technically nothing is being cached in this function.

Placing an underscore in front of a parameter to exclude it from caching works, as far as i know, only for hashable objects (https://docs.streamlit.io/develop/concepts/architecture/caching#excluding-input-parameters).

Since cache_resource does not create a copy, but returns the same value every time, no hashing is required for this decorator.

Not creating a copy means there's just one global instance of the cached return object, which saves memory, e.g. when using a large ML model. In computer science terms, we create a singleton. https://docs.streamlit.io/develop/concepts/architecture/caching#behavior-1

This should also explain why _model is missing when using @st.cache_data.

Swich to cache_resource for Document Index

1eeeb8b

JoepdeJong changed the title ~~Swich to cache_resource for Document Index~~ Switch to cache_resource for Document Index May 20, 2024

jonfairbanks self-assigned this May 20, 2024

jonfairbanks merged commit a7a6808 into jonfairbanks:develop May 20, 2024

JoepdeJong mentioned this pull request May 21, 2024

Streamlit cache leading to empty index #48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to cache_resource for Document Index #54

Switch to cache_resource for Document Index #54

JoepdeJong commented May 20, 2024

jonfairbanks commented May 20, 2024

jonfairbanks commented May 24, 2024 •

edited

Loading

JoepdeJong commented May 25, 2024

Switch to cache_resource for Document Index #54

Switch to cache_resource for Document Index #54

Conversation

JoepdeJong commented May 20, 2024

jonfairbanks commented May 20, 2024

jonfairbanks commented May 24, 2024 • edited Loading

JoepdeJong commented May 25, 2024

jonfairbanks commented May 24, 2024 •

edited

Loading