[ntuple] Add MT index building #16679

enirolf · 2024-10-14T16:06:33Z

This PR introduces the first steps towards MT support for the RNTupleIndex, by enabling mulithreaded building of the index. To enable this, the index itself now manages multiple index partitions, which are essentially sub-indices for a particular entry range. These entry ranges are currently set according to the cluster boundaries, but further benchmarking and evaluation will be required to determine the optimal partitioning scheme.

Building a separate index for each RNTuple cluster (or any other partitioning scheme) opens up the possibility for multithreaded building and probing. Note that the choice to partition on cluster boundaries might not be the optimal choice performance-wise and might be changed in the future following further evaluation.

github-actions · 2024-10-14T19:57:19Z

Test Results

17 files 17 suites 4d 2h 3m 25s ⏱️
2 713 tests 2 713 ✅ 0 💤 0 ❌
43 511 runs 43 511 ✅ 0 💤 0 ❌

Results for commit 8cce410.

hahnjo

Some comments inline. I think the current implementation with partitions means the built index is less performant (on probe) than before, which is not ideal. Did you consider merging the partial indices at the end to preserve constant complexity?

hahnjo · 2024-10-15T06:41:42Z

tree/ntuple/v7/inc/ROOT/RNTupleIndex.hxx

-   const std::vector<NTupleSize_t> *GetAllEntryNumbers(const std::vector<void *> &valuePtrs) const;
+   const std::vector<NTupleSize_t> GetAllEntryNumbers(const std::vector<void *> &valuePtrs) const;


Note that this interface forces the allocation of a std::vector including its heap-backed data for every probe

Additionally, returning a const value is generally not what you want (as it is my understanding it may prevent move semantics/rvalue reference semantics, and it doesn't really give any benefit to the caller).
To avoid forcing the allocation on the caller while giving them ownership of the data it's probably better to pass the out vector as a reference parameter (so they may reuse it across calls etc).

hahnjo · 2024-10-15T06:44:12Z

tree/ntuple/v7/inc/ROOT/RNTupleIndex.hxx

+      RNTupleIndexPartition(const RClusterDescriptor &descriptor,
+                            const std::vector<std::unique_ptr<RFieldBase>> &indexFields, const RPageSource &pageSource)
+         : fPageSource(pageSource.Clone())


Does this constructor need inlining? If not I think it should be defined in the source file

hahnjo · 2024-10-15T06:45:07Z

tree/ntuple/v7/inc/ROOT/RNTupleIndex.hxx

+            auto clonedField = field->Clone(field->GetFieldName());
+            CallConnectPageSourceOnField(*clonedField, *fPageSource);


Now we need to clone and connect page sources and fields for every index partition / cluster, which means we probably cannot use cluster prefetching...

hahnjo · 2024-10-15T06:47:18Z

tree/ntuple/v7/src/RNTupleIndex.cxx

 ROOT::Experimental::Internal::RNTupleIndex::GetAllEntryNumbers(const std::vector<void *> &valuePtrs) const
 {
   if (valuePtrs.size() != fIndexFields.size())
      throw RException(R__FAIL("Number of value pointers must match number of indexed fields."));

   EnsureBuilt();

-   std::vector<NTupleIndexValue_t> indexValues;
-   indexValues.reserve(fIndexFields.size());
+   std::vector<std::vector<NTupleIndexValue_t>> entryNumbersPerCluster;


This implementation uses two vectors (entryNumbersPerCluster and entryNumbers) which require two heap allocations. Would it be possible to fill entryNumbers directly?

hahnjo · 2024-10-15T06:49:42Z

tree/ntuple/v7/src/RNTupleIndex.cxx

+   for (const auto &indexPartition : fIndexPartitions) {
+      auto clusterEntryNumbers = indexPartition.fIndex.find(indexValue);
+
+      if (clusterEntryNumbers == indexPartition.fIndex.end())
+         continue;
+
+      entryNumbersPerCluster.push_back(clusterEntryNumbers->second);
+   }


Note that this introduces linear complexity in the number of clusters: while fIndex.find() has constant complexity, now there are separate hash maps per cluster. In the end, this means linear complexity in the number of entries (with a small coefficient). Maybe this is fine, but I think this means worse probe performance because of multi-threaded building...

hahnjo · 2024-10-15T06:52:38Z

tree/ntuple/v7/src/RNTupleIndex.cxx

-   return &(entryNumber->second);
+   return entryNumbers;


(side note: it might be possible to get around returning vectors by value. For example, after building the partitions the RNTupleIndex could "link" collisions across partitions. Then we could return a "linked list of vectors", which would behave like a container with a custom iterator.)

hahnjo · 2024-10-15T06:53:52Z

tree/ntuple/v7/inc/ROOT/RNTupleIndex.hxx

+#ifdef R__USE_IMT
+   std::unique_ptr<ROOT::TThreadExecutor> fPool;
+#endif


Is this member required? AFAICT it's only used once during Build(), so it might be constructed and destructed locally

enirolf added 2 commits October 14, 2024 17:53

[ntuple] Add multi-threaded index building

8cce410

enirolf added the in:RNTuple label Oct 14, 2024

enirolf requested review from hahnjo, pcanal, silverweed and vepadulano October 14, 2024 16:06

enirolf self-assigned this Oct 14, 2024

hahnjo reviewed Oct 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ntuple] Add MT index building #16679

[ntuple] Add MT index building #16679

enirolf commented Oct 14, 2024

github-actions bot commented Oct 14, 2024

hahnjo left a comment

hahnjo Oct 15, 2024

silverweed Oct 15, 2024

hahnjo Oct 15, 2024

hahnjo Oct 15, 2024

hahnjo Oct 15, 2024

hahnjo Oct 15, 2024

hahnjo Oct 15, 2024

hahnjo Oct 15, 2024

		const std::vector<NTupleSize_t> GetAllEntryNumbers(const std::vector<void > &valuePtrs) const;
		const std::vector<NTupleSize_t> GetAllEntryNumbers(const std::vector<void *> &valuePtrs) const;

		auto clonedField = field->Clone(field->GetFieldName());
		CallConnectPageSourceOnField(clonedField, fPageSource);

[ntuple] Add MT index building #16679

Are you sure you want to change the base?

[ntuple] Add MT index building #16679

Conversation

enirolf commented Oct 14, 2024

github-actions bot commented Oct 14, 2024

Test Results

hahnjo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment