OpenSearch Performance Experiments Results #2461

travisbenedict · 2022-03-14T17:50:15Z

Background

Using OpenSearch 1.2 build 762 (arm64/x64) I ran a set of ~20 performance tests for each of the following single node configurations:

m5.xlarge - Security enabled
m5.xlarge - Security disabled
m6g.xlarge - Security enabled
m6g.xlarge - Security disabled

All tests were running using OpenSearch Benchmark with an i3.8xlarge EC2 instance as the load generation host. The tests used a modified version of the default schedule for the nyc_taxis workload which runs the original schedule twice with all operations in warmup mode and three times as the standard schedule, commonly known as two warmup and three test iterations. Additional aggregations were run on the results of each test to average together metrics across different query types in order to create a set of query summary metrics.

A new load generator and new OpenSearch single node cluster were provisioned for each test.

Findings

Some random variation between tests is expected. For indexing throughput the standard deviation as a percentage of the mean of any percentile statistic, excluding p100, is about 5% across all configurations. For query latency this is about 10%.

Average latency for all queries in a workload can vary by 20% or more between any given test. Why this is will require more research. In the meantime we should avoid outright comparisons of one test to another.

Included below are some approximate statistics for index and query metrics for each configuration. This includes the expected (average) value, the standard deviation as a percentage of the mean and the percent difference between the min and max. This table is meant to give people a framework for understanding their performance test results and should not necessarily be taken as a ground truth.

Instance Type	Security	Expected Indexing Throughput Avg (req/s)	Indexing StDev% Mean	Indexing MinMax% Diff	Expected Query Latency p90 (ms)	Expected Query Latency p99 (ms)	Query StDev% Mean	Query MinMax% Diff
m5.xlarge	Enabled	30554	~5%	~12%	431	449	~10%	~23%
m5.xlarge	Disabled	34472	~5%	~15%	418	444	~10%	~25%
m6g.xlarge	Enabled	38625	~3%	~8%	497	512	~8%	~23
m6g.xlarge	Disabled	45447	~2%	~3%	470	480	~5%	~15%

Raw Data

BlackMetalz · 2022-04-08T13:40:45Z

Can you guide some steps on how to test @travisbenedict since I have no idea to run a benchmark with this:

opensearch-project/opensearch-benchmark#106

travisbenedict · 2022-04-08T18:05:53Z

Hey @BlackMetalz

After starting OpenSearch on a node I ran the following OpenSearch Benchmark commands depending on the cluster type.

Clusters with security plugin disabled:
opensearch-benchmark execute_test --pipeline=benchmark-only --workload=nyc_taxis --target-hosts=<my_endpoint> --telemetry-params=node-stats-sample-interval:60 --telemetry=node-stats --kill-running-processes --workload-repository=private

Clusters with security plugin enabled:
opensearch-benchmark execute_test --pipeline=benchmark-only --workload=nyc_taxis --target-hosts=<my_endpoint> --client-options=basic_auth_user:'admin',basic_auth_password:'admin',verify_certs:false,use_ssl:true --telemetry-params=node-stats-sample-interval:60 --telemetry=node-stats --kill-running-processes --workload-repository=private

In my ~/.benchmark/benchmarks/workloads/private/ directory I had a modified version of the OpenSearch Benchmark Workloads repo which contained the test_procedure file for nyc_taxis linked above

I hope this helps. Sorry for not including more details in the first place

layavadi · 2024-05-02T13:27:30Z

Is there similar result for vector search for reference ?

dblock · 2024-05-03T18:13:14Z

This is quite old, I'm going to close this issue because it's not calling for anything actionable. Since then, we've built https://opensearch.org/benchmarks and there's a bunch of stuff there.

@layavadi I don't believe we publish vector numbers there (yet). That's opensearch-project/opensearch-benchmark#103. However, AWS has a very detailed blog post on what vector search performance looks like in production and the tradeoffs that I think can help.

travisbenedict added enhancement Enhancement or improvement to existing feature or request untriaged labels Mar 14, 2022

travisbenedict mentioned this issue Mar 14, 2022

Add performance and longevity testing validation to the release template opensearch-project/opensearch-build#1752

Merged

1 task

kotwanikunal added benchmarking Issues related to benchmarking or performance. and removed untriaged labels Mar 15, 2022

travisbenedict mentioned this issue Apr 8, 2022

OpenSearch-Benchmark Example Command Gives Error [BUG] opensearch-project/opensearch-benchmark#106

Closed

travisbenedict mentioned this issue May 19, 2022

[BUG] OpenSearch Benchmark produces inconsistent results over the time opensearch-project/opensearch-benchmark#196

Open

bbarani added the Performance This is for any performance related enhancements or bugs label Apr 4, 2023

nibix mentioned this issue Dec 19, 2023

[RFC] Optimized Privilege Evaluation opensearch-project/security#3870

Open

nibix mentioned this issue Dec 29, 2023

[RFC] Security Performance Test Suite opensearch-project/security#3903

Open

dblock closed this as completed May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenSearch Performance Experiments Results #2461

OpenSearch Performance Experiments Results #2461

travisbenedict commented Mar 14, 2022 •

edited

Loading

BlackMetalz commented Apr 8, 2022 •

edited

Loading

travisbenedict commented Apr 8, 2022

layavadi commented May 2, 2024

dblock commented May 3, 2024 •

edited

Loading

OpenSearch Performance Experiments Results #2461

OpenSearch Performance Experiments Results #2461

Comments

travisbenedict commented Mar 14, 2022 • edited Loading

Background

Findings

BlackMetalz commented Apr 8, 2022 • edited Loading

travisbenedict commented Apr 8, 2022

layavadi commented May 2, 2024

dblock commented May 3, 2024 • edited Loading

travisbenedict commented Mar 14, 2022 •

edited

Loading

BlackMetalz commented Apr 8, 2022 •

edited

Loading

dblock commented May 3, 2024 •

edited

Loading