-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge release/2.6 into google/2.6 #15663
Conversation
juszhan1
commented
Dec 23, 2024
- DAOS-16784 build: Tag 2.6.2 tb2 (DAOS-16784 build: Tag 2.6.2 tb2 #15461)
- DAOS-9355 doc: DAOS 2.6.2 release notes (DAOS-9355 doc: DAOS 2.6.2 release notes #15560)
- SRE-2525 ci: Fix Trivy scan upload to the Security tab (SRE-2525 ci: Fix Trivy scan upload to the Security tab #15394)
- DAOS-16350 test: decrease pool size for ior_per_rank (DAOS-16350 test: decrease pool size for ior_per_rank #15183) (DAOS-16350 test: decrease pool size for ior_per_rank (#15183) #15403)
- DAOS-16265 test: Split erasurecode/multiple_failure.py (DAOS-16265 test: Split erasurecode/multiple_failure.py #15355) (DAOS-16265 test: Split erasurecode/multiple_failure.py (#15355) #15369)
- DAOS-16096 test: Add retry loop for comparing free pool space (DAOS-16096 test: Add retry loop for comparing free pool space #15289) (DAOS-16096 test: Add retry loop for comparing free pool space (#15289) #15411)
- DAOS-16825 test: Support register cleanup for all Test classes (DAOS-16825 test: Support register cleanup for all Test classes #15530) (DAOS-16825 test: Support register cleanup for all Test classes (#15530) #15540)
- DAOS-16709 test: Handle decoding empty json output (DAOS-16709 test: Handle decoding empty json output #15397) (DAOS-16709 test: Handle decoding empty json output (#15397) #15410)
- DAOS-12859 test: use pool and container labels (pass 3) (DAOS-12859 test: use pool and container labels (pass 3) #13210) (DAOS-12859 test: use pool and container labels (pass 3) (#13210) #15253)
- DAOS-16670 test: container/multiple_delete.py - Increase SCM leftover… (DAOS-16670 test: container/multiple_delete.py - Increase SCM leftover… #15420) (DAOS-16670 test: container/multiple_delete.py - Increase SCM leftover… #15457)
- DAOS-16702 rebuild: restart rebuild for a massive failure case (DAOS-16702 rebuild: restart rebuild for a massive failure case #15406)
- DAOS-16100 test: Fix stopping daos_test during timeout (DAOS-16100 test: Fix stopping daos_test during timeout #15275) (DAOS-16100 test: Fix stopping daos_test during timeout (#15275) #15603)
- DAOS-16167 test: update soak test to use internal job scheduler (DAOS-16167 test: update soak test to use internal job scheduler #14775) (DAOS-16167 test: update soak test to use internal job scheduler (#14775) #15595)
- DAOS-16865 cq: update flake8 to 7.1.1 (DAOS-16865 cq: update flake8 to 7.1.1 #15575) (DAOS-16865 cq: update flake8 to 7.1.1 (#15575) #15597)
- DAOS-16276 doc: Address engine unavailability (DAOS-16276 doc: Address engine unavailability #15456) (DAOS-16276 doc: Address engine unavailability (#15456) #15496)
- DAOS-16572 object: refine sc_ec_agg_active flag setting (DAOS-16572 object: refine sc_ec_agg_active flag setting #15352)
- DAOS-16812 cart: read after free cid 2556737 (DAOS-16812 cart: read after free cid 2556737 #15517) (DAOS-16812 cart: read after free cid 2556737 (#15517) #15600)
- DAOS-16875 cq: fix flake8 xargs usage (DAOS-16875 cq: fix flake8 xargs usage #15608) (DAOS-16875 cq: fix flake8 xargs usage (#15608) #15614)
- SRE-2171 ci: Big refactor of GHA workflows (SRE-2171 ci: Big refactor of GHA workflows #15556) (SRE-2171 ci: Big refactor of GHA workflows (#15556) #15588)
- DAOS-16621 build: Fix Go versions in rpm/deb packaging (DAOS-16621 build: Fix Go versions in rpm/deb packaging #15174) (DAOS-16621 build: Fix Go versions in rpm/deb packaging (#15174) #15255)
- DAOS-16170 cart: do not release completed RPC reference repeatedly - b26 (DAOS-16170 cart: do not release completed RPC reference repeatedly - b26 #15477)
- DAOS-623 client: Address java security scan (DAOS-623 client: Address java security scan #15542) (DAOS-16880 client: Address java security scan (#15542) #15615)
- DAOS-16826 build: Fix compiling issues in gcc 14 (DAOS-16826 build: Fix compiling issues in gcc 14 #15531) (DAOS-16826 build: Fix compiling issues in gcc 14 (#15531) #15607)
- DAOS-16833 cq: update GHA ubuntu (DAOS-16833 cq: update GHA ubuntu #15538) (DAOS-16833 cq: update GHA ubuntu (#15538) #15563)
- DAOS-16662 test: update some tests to use unique dfuse mount (DAOS-16662 test: update some tests to use unique dfuse mount #15242) (DAOS-16662 test: update some tests to use unique dfuse mount (#15242) #15598)
- DAOS-16873 cq: remove workflows/version-checks.yml (DAOS-16873 cq: remove workflows/version-checks.yml #15601) (DAOS-16873 cq: remove workflows/version-checks.yml (#15601) #15627)
- DAOS-16787 utils: Suppress NLT valgrind false positives (DAOS-16787 utils: Suppress NLT valgrind false positives #15478) (DAOS-16787 utils: Suppress NLT valgrind false positives (#15478) #15637)
- DAOS-13292 build: Don't need UCX libraries … (DAOS-13292 build: Don't need UCX libraries … #15016) (DAOS-13292 build: Don't need UCX libraries … (#15016) #15623)
- DAOS-15964 test: verify daos_server_helper on server (DAOS-15964 test: verify daos_server_helper on server #15503) (DAOS-15964 test: verify daos_server_helper on server (#15503) #15599)
- DAOS-16645 cart: Bump file descriptor limit (DAOS-16645 cart: Bump file descriptor limit #15224) (DAOS-16645 cart: Bump file descriptor limit (#15224) #15366)
- DAOS-16889 build: fix finding protobuf (DAOS-16889 build: fix finding protobuf #15625) (DAOS-16889 build: fix finding protobuf (#15625) #15641)
- DAOS-16263 cq: merge yamllint and clang-format into linting (DAOS-16263 cq: merge yamllint and clang-format into linting #14803) (DAOS-16263 cq: merge yamllint and clang-format into linting (#14803) #15642)
Tag second test build for 2.6.2. faults-enabled: false Signed-off-by: Phil Henderson <phillip.henderson@intel.com>
2.6.2 release notes document Signed-off-by: Phil Henderson <phillip.henderson@hpe.com>
- Enable write access to the Security section of Github project - Use GHA cache to avoid Trivy scan failures due to overuse of CVEs database results in database download failure Upgrade `trivy-action` to version 0.28.0 where the caching mechanism is enabled by default. Enable debug option in Trivy to be prepared for detail scan failures analysis Signed-off-by: Tomasz Gromadzki <tomasz.gromadzki@intel.com>
Split the erasurecode/multiple_failure.py into two separate tests to reduce the possibility of a large number of ERR messages in the server log file from preventing other test variants from failing dure to out of space errors. Signed-off-by: Phil Henderson <phillip.henderson@intel.com>
#15411) Loop retrying the check for the pool free space after destroying half of the containers. If the check doesn't pass within 60 seconds, then fail the test. Signed-off-by: Phil Henderson <phillip.henderson@intel.com>
… (#15540) Support calling register cleanup methods for tests based upon the Test and TestWithoutServers classes. Also remove stopping agents as part of calling TestWithServers.stop_servers() since DAOS-6873 is no longer an issue. Signed-off-by: Phil Henderson <phillip.henderson@intel.com>
#15420) (#15457) The object placement algorithm was changed by DAOS-16445. As a result, data are written to targets more uniformly while the amount of leftover data after container destroy/garbage collection in each target remains the same. i.e., Data are written to more targets while the cleanup method in each target hasn't been improved, which results in higher aggregate leftover data. To handle larger amount of leftover data in SCM, increase the threshold to 1.5MB. Signed-off-by: Makito Kano <makito.kano@intel.com>
In special massive failure case - 1. some engines down and triggered rebuild. 2. one engine participated the rebuild, not finished yet, it down again, the #failures exceeds pool RF and will not change pool map. 3. That engine restarted by administrator. In that case should recover the rebuild task on the engine, to simplify it now just abort and retry the global rebuild task. No such issue by the typical recover approach that restart the whole system including the PS leader. another backport commit - 947c76d DAOS-16175 container: fix a case for cont_iv_hdl_fetch (#15395) Signed-off-by: Xuezhao Liu <xuezhao.liu@intel.com>
Fix stopping timed out processes run by a JobManager class by only searching for and killing the command executable being run by clush, orterun, mpirun, etc. Add a new harness/cmocka.py test to verify the stopping of the processes with a test timeout. Signed-off-by: Phil Henderson <phillip.henderson@intel.com>
…) (#15595) Update soak to support using an internal job scheduler. Signed-off-by: Maureen Jean <maureen.jean@intel.com> Co-authored-by: mjean308 <48688872+mjean308@users.noreply.github.com>
clear the sc_ec_agg_active flag more proactively. Signed-off-by: Xuezhao Liu <xuezhao.liu@intel.com>
Add the use of reusable workflows and actions to reduce the amount of duplicated code in this repository as well as dependency repositories. Run Bullseye workflow on schedule (#15574) Saturdays at midnight, UTC. Accept and propagate a run-gha variable (#15576) For the case where daos is being used as a downstream test. Test inputs context before trying to use it. Fixes: SRE-2570 DAOS-16262 Signed-off-by: Brian J. Murrell <brian.murrell@intel.com>
- Set Go minimum version to 1.21 in rpm and debian packaging spec files. - Update scons Go version check to use version in go.mod. - Add a reminder in go.mod file so we remember the packaging files when bumping the minimum Go version in the future. - Update Ubuntu 22.04 Dockerfile to get an appropriate version of Go. Signed-off-by: Kris Jacque <kristin.jacque@hpe.com>
…b26 (#15477) For collective RPC, when handle failure cases during crt_req_send(), its reference may has been released via crt_rpc_complete_and_unlock() that is triggered by crt_corpc_complete(). Under such case, we should check whether the RPC is completed or not before calling RPC_DECREF() to avoid releasing the RPC reference repeatedly. The patch also initializes some local variable for CHK RPC to avoid accessing invalid DRAM when handle failed collective CHK RPC. Some enhancement for CR test logic. Signed-off-by: Fan Yong <fan.yong@intel.com>
…#15598) Update some tests to use unique dfuse mount directory by letting the framework generate one. Remove mount_dir from run_ior_multiple_variants since it is no longer needed and this level of fine control should be handled per test ideally. Signed-off-by: Dalton Bohning <dalton.bohning@intel.com>
With tcp provider, using many sockets can cause significant file descriptor usage. Bump the soft limit, if possible and warn if it appears insufficient. Valgrind sets hard limit to soft limit, so work around that in NLT. Signed-off-by: Jeff Olivier <jeffolivier@google.com>
…15642) merge yamllint and clang-format into linting workflow so all lint checks are grouped together. Make yaml-lint required but clang-format optional until stable. Signed-off-by: Dalton Bohning <dalton.bohning@hpe.com>
…/2.6 Required-githooks: true Change-Id: I10ffc1413bec864f7e72578a298e71d460258b55
Errors are component not formatted correctly,Ticket number prefix incorrect,PR title is malformatted. See https://daosio.atlassian.net/wiki/spaces/DC/pages/11133911069/Commit+Comments,Unable to load ticket data |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Scorecard found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.
Test stage NLT on EL 8.8 completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15663/2/testReport/ |
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15663/2/execution/node/1216/log |
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15663/3/execution/node/490/log |