Refactor Catalog Indexing #2632

ramonski · 2024-10-23T20:14:29Z

Description of the issue/feature this PR addresses

This PR refactors the catalog indexing by using the IndexQueue for all SENAITE catalogs and avoids multiple UID catalog indexing.

Current behavior before PR

AT object indexing never use the IndexQueue and temporary objects were indexed in UID catalog

Desired behavior after PR is merged

AT object indexing always use the IndexQueue and no temporary objects are indexed in UID catalog

--
I confirm I have tested this PR thoroughly and coded it according to PEP8
and Plone's Python styleguide standards.

xispa

Awesome!, thanks for the doctest and README's (really useful and necessary). Have left some comments and questions

xispa · 2024-10-30T08:40:11Z

src/bika/lims/utils/analysisrequest.py

+    # reindex UID
+    uid_catalog = api.get_tool("uid_catalog")
+    uid_catalog.catalog_object(obj, obj._getURL())
+    # reindex object in all other catalogs


To not spread here and there same thing, I suggest to rely on api.catalog_object function, that already takes care of uid_catalog here: https://github.com/senaite/senaite.core/blob/2.x/src/bika/lims/api/__init__.py#L403-L417

We could even include the recursive param in api.catalog_object function as well and remove this reindex function from here. What do you think?

Thanks for the hint! I already forgot that we have an API function that does this.
Changes done in c725d9d

xispa · 2024-10-30T08:52:52Z

src/senaite/core/catalog/catalog_multiplex_processor.py

+        """Get a list of catalog IDs for the given object
+        """
+        # get a list of catalog IDs that are mapped to the object
+        catalogs = list(map(lambda x: x.id, api.get_catalogs_for(obj)))


Note that by default api.get_catalogs_for relies on this static mapping to get the catalogs for a given portal type. And the value of the attribute _catalog is taken into account only when the portal type of the given object is not present in that mapping: https://github.com/senaite/senaite.core/blob/2.x/src/bika/lims/api/__init__.py#L1276

In my opinion this static mapping forces us to monkey patch api.get_catalogs_for if we want an existing baseline type (like AnalysisRequest) to also be catalogued in a brand-new-add-on-specific catalog.

So, I think this change is fine as long as we skip the static mapping check in api.get_catalogs_for. Or maybe don't skip, but rely on a new adapter to get additional catalogs for baseline portal_types.

Yes, you are right and this is why we have the TODO in the docstring here:

senaite.core/src/senaite/core/catalog/__init__.py

Lines 107 to 120 in c725d9d

def get_catalogs_by_type(portal_type):

"""Return the mapped catalogs by type

TODO: Provide registry setting for this mapping lookup

:param portal_type: The portal type to look up

"""

if not isinstance(portal_type, str):

raise TypeError("Expected string type, got <%s>" % type(portal_type))

mapping = dict(CATALOG_MAPPINGS)

catalogs = mapping.get(portal_type)

if not catalogs:

return []

return catalogs

I guess providing the mapping as a registry setting we could easily add/remove/edit catalogs from the base types and new types.

For the moment I would like to postpone that until we have this requirement and provide it in a new PR

xispa · 2024-10-30T09:21:06Z

src/senaite/core/patches/archetypes/referenceable.py

+        return
+    if not uc:
+        uc = api.get_tool(UID_CATALOG)
+    url = self._getURL()


Note for @xispa : ._getURL() relies on Archetypes.utils.getRelPath, that returns the relative path of the object from the portal root. The result is the same as url = "/".join(obj.getPhysicalPath()[2:])

xispa · 2024-10-30T09:28:45Z

src/senaite/core/patches/cmfcore/portal_catalog_processor.py

+PORTAL_CATALOG = "portal_catalog"
+
+
+def index_in_portal_catalog(obj):


Shouldn't we also check for api.is_temporary(obj), like what you did in catalog_multiplex?

Everything that ends up in this queue comes from Products.CMFCore.CatalogTool.reindexObject which does already filterTemporaryItems, which uses our patched isTemporary for both AT and DX types:

senaite.core/src/senaite/core/patches/archetypes/base_object.py

Lines 29 to 30 in c725d9d

def isTemporary(self):

return api.is_temporary(self)

senaite.core/src/senaite/core/patches/dexterity/dexterity_content.py

Lines 29 to 30 in c725d9d

def isTemporary(self):

return api.is_temporary(self)

So another api.is_temporary is at this stage not required

xispa

Excellent, thanks!

ramonski added 10 commits October 23, 2024 09:05

Added test base

ee94a7c

Moved patches to separate packages

238489c

Handle AT catalog multiplexing

3229141

Skip portal catalog indexer

fd94f47

reindex UID after sample creation

bba9cfb

Patch UID indexing

c50e11d

Test fixture

1953fad

Added test for Analysis Indexing

e512184

Workaround for partition IDs

1a329be

Added comment

3d49e31

ramonski added Improvement 🔧 PR: Not Ready ⛔️ Cleanup 🧹 Code cleanup and refactoring labels Oct 23, 2024

ramonski marked this pull request as draft October 23, 2024 20:14

ramonski mentioned this pull request Oct 23, 2024

Temporary deactivation of Workflow Variables Patch from #2593 #2628

Closed

ramonski added 2 commits October 24, 2024 09:41

Added more tests

6899f23

Implemented Workflow Variables Patch

2e3f85c

ramonski removed the PR: Not Ready ⛔️ label Oct 24, 2024

ramonski marked this pull request as ready for review October 24, 2024 11:15

ramonski added 2 commits October 24, 2024 13:17

Reduce noise in diff

10c107f

Added indexing test for batches

78e4ad0

ramonski requested a review from xispa October 24, 2024 11:37

ramonski and others added 8 commits October 24, 2024 20:47

README added

503c386

Readme added

96c76a6

Readme added

ddf4069

Better comment

0a1e534

Better comment

cddf46c

Merge branch '2.x' into rethink-catalog-indexing

0ffbd77

Changelog updated

841e19b

Merge branch '2.x' into rethink-catalog-indexing

4abe694

xispa requested changes Oct 30, 2024

View reviewed changes

ramonski added 2 commits October 30, 2024 19:55

Merge branch '2.x' into rethink-catalog-indexing

3489a99

Rely on API method to reindex new Sample objects

c725d9d

xispa approved these changes Oct 31, 2024

View reviewed changes

xispa merged commit a5fbfbd into 2.x Oct 31, 2024
2 checks passed

xispa deleted the rethink-catalog-indexing branch October 31, 2024 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Catalog Indexing #2632

Refactor Catalog Indexing #2632

ramonski commented Oct 23, 2024

xispa left a comment

xispa Oct 30, 2024

ramonski Oct 30, 2024

xispa Oct 30, 2024

ramonski Oct 30, 2024

xispa Oct 30, 2024

xispa Oct 30, 2024

ramonski Oct 30, 2024 •

edited

Loading

xispa left a comment

	def get_catalogs_by_type(portal_type):
	"""Return the mapped catalogs by type

	TODO: Provide registry setting for this mapping lookup

	:param portal_type: The portal type to look up
	"""
	if not isinstance(portal_type, str):
	raise TypeError("Expected string type, got <%s>" % type(portal_type))
	mapping = dict(CATALOG_MAPPINGS)
	catalogs = mapping.get(portal_type)
	if not catalogs:
	return []
	return catalogs

		PORTAL_CATALOG = "portal_catalog"


		def index_in_portal_catalog(obj):

Refactor Catalog Indexing #2632

Refactor Catalog Indexing #2632

Conversation

ramonski commented Oct 23, 2024

Description of the issue/feature this PR addresses

Current behavior before PR

Desired behavior after PR is merged

xispa left a comment

Choose a reason for hiding this comment

xispa Oct 30, 2024

Choose a reason for hiding this comment

ramonski Oct 30, 2024

Choose a reason for hiding this comment

xispa Oct 30, 2024

Choose a reason for hiding this comment

ramonski Oct 30, 2024

Choose a reason for hiding this comment

xispa Oct 30, 2024

Choose a reason for hiding this comment

xispa Oct 30, 2024

Choose a reason for hiding this comment

ramonski Oct 30, 2024 • edited Loading

Choose a reason for hiding this comment

xispa left a comment

Choose a reason for hiding this comment

ramonski Oct 30, 2024 •

edited

Loading