🚀 Optimized entropy functions #69

serkor1 · 2025-02-01T19:43:01Z

📚 What?

This PR optimizes all entropy functions. The run-time on 200 x 1e6 matrices have been decreased significantly, see below:

Before optimization (Without OpenMP)

Iterations	Runtime (sec)	Garbage Collections [gc()]	gc() pr. second	Memory Allocation (MB)
100	2.5	0	0	0

After optimization (Without OpenMP)

Iterations	Runtime (sec)	Garbage Collections [gc()]	gc() pr. second	Memory Allocation (MB)
100	0.86	0	0	0

* The values are passed as pointers, and the overall logic has been simplified.

* See fd906f0

* See commit fd906f0

* The benchmarks have not changed on the entropy-function. The local tests indicated that the new implementation was faster - this will be investigated.

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 81d28c7 in 1 minute and 16 seconds

More details

Looked at 1074 lines of code in 27 files
Skipped 1 files when reviewing.
Skipped posting 12 drafted comments based on config settings.

1. src/classification_CrossEntropy.h:49

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable:
The combination of parallel for and simd here actually makes sense - they operate at different levels of parallelism. Parallel for splits work across threads, while simd enables vectorization within each thread. The operations in the inner loops (simple additions and multiplications) are perfect candidates for SIMD vectorization. The comment appears to be incorrect about inefficiencies.
I could be wrong about the interaction between these OpenMP directives. There might be specific hardware architectures where this combination is problematic.
While there might be edge cases, the code follows a common and well-established pattern of combining thread-level and SIMD parallelism. The operations are simple enough that vectorization should be beneficial.
The comment should be deleted as it suggests removing SIMD directives that are actually beneficial for performance when properly combined with parallel for as done in this code.

2. src/classification_CrossEntropy.h:62

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

3. src/classification_CrossEntropy.h:123

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

4. src/classification_Entropy.h:44

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

5. src/classification_Entropy.h:57

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

6. src/classification_Entropy.h:117

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

7. src/classification_RelativeEntropy.h:50

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

8. src/classification_RelativeEntropy.h:65

Draft comment:
Using #pragma omp simd and #pragma omp parallel for together can lead to inefficiencies. Consider using only #pragma omp parallel for for the outer loop parallelization.
Reason this comment was not posted:
Marked as duplicate.

9. src/classification_CrossEntropy.h:13

Draft comment:
All Rcpp functions and classes should be namespace qualified. For example, use Rcpp::NumericVector instead of NumericVector. This applies throughout the C++ code.
Reason this comment was not posted:
Comment was on unchanged code.

10. src/classification_Entropy.cpp:14

Draft comment:
All Rcpp functions and classes should be namespace qualified. For example, use Rcpp::NumericVector instead of NumericVector. This applies throughout the C++ code.
Reason this comment was not posted:
Marked as duplicate.

11. src/classification_Entropy.h:11

Draft comment:
All Rcpp functions and classes should be namespace qualified. For example, use Rcpp::NumericVector instead of NumericVector. This applies throughout the C++ code.
Reason this comment was not posted:
Marked as duplicate.

12. src/classification_RelativeEntropy.h:14

Draft comment:
All Rcpp functions and classes should be namespace qualified. For example, use Rcpp::NumericVector instead of NumericVector. This applies throughout the C++ code.
Reason this comment was not posted:
Marked as duplicate.

Workflow ID: wflow_L0sjpnWlN4E1W2HS

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

serkor1 added 6 commits February 1, 2025 09:53

🔨 Optimized Cross Entropy

fd906f0

* The values are passed as pointers, and the overall logic has been simplified.

🚀 Optimized Entropy function

e5ebe75

* See fd906f0

🚀 Optimized Relative entropy

e988607

* See commit fd906f0

📖 Updated benchmarks and meta-docs

5a9a140

* The benchmarks have not changed on the entropy-function. The local tests indicated that the new implementation was faster - this will be investigated.

🚀 Performance Benchmarks

45b459f

📚 Updated README and NEWS

81d28c7

serkor1 added enhancement New feature or request optimze Various optimizations to source code labels Feb 1, 2025

serkor1 self-assigned this Feb 1, 2025

serkor1 linked an issue Feb 1, 2025 that may be closed by this pull request

[BUG] Optimized entropy-functions #68

Closed

2 tasks

ellipsis-dev bot reviewed Feb 1, 2025

View reviewed changes

serkor1 merged commit a9ebafe into development Feb 1, 2025
22 checks passed

serkor1 deleted the optimize-entropy branch February 1, 2025 20:01

serkor1 mentioned this pull request Feb 3, 2025

🚀 {SLmetrics} Version 0.3-2 #71

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 Optimized entropy functions #69

🚀 Optimized entropy functions #69

serkor1 commented Feb 1, 2025

ellipsis-dev bot left a comment

🚀 Optimized entropy functions #69

🚀 Optimized entropy functions #69

Conversation

serkor1 commented Feb 1, 2025

📚 What?

Before optimization (Without OpenMP)

After optimization (Without OpenMP)

ellipsis-dev bot left a comment

Choose a reason for hiding this comment