[FEATURE] Add support for large free text workloads #43

kotwanikunal · 2022-04-21T19:21:23Z

Is your feature request related to a problem?

The current workloads do not have a large free text documents which is representative of the real world scenarios
This issue was highlighted in [BUG] Indexing Performance Degraded in OpenSearch 1.3.+ OpenSearch#2916 where there was a big drop in indexing performance which was not uncovered by the recommended workloads during the 1.3.0 launch as well as recently run tests for 1.2.4 and 1.3.0 with the same configuration
We would like to identify such anomalies to further strengthen our coverage and impact analysis for releases and feature additions to the OpenSearch codebase

What solution would you like?

Addition of new customer dataset representative workloads with large free text

What alternatives have you considered?

Running performance tests with multiple existing workloads which have even smaller documents than nyc_taxis

Do you have any additional context?

N/A

The text was updated successfully, but these errors were encountered:

anasalkouz · 2022-04-21T22:30:01Z

@treddeni-amazon this is really important to support. Having such performance degradation issues will decrease our trust of the performance benchmark results.

travisbenedict · 2022-04-22T18:22:52Z

@kotwanikunal would the so workload fit this use case? It has freeform text fields - example doc.

kotwanikunal · 2022-04-25T16:37:41Z

@kotwanikunal would the so workload fit this use case? It has freeform text fields - example doc.

Thanks for the update. I did schedule some tests over the weekend for so and http-logs. We should have the results by tomorrow to see if these tests detect the mentioned performance drop.

kotwanikunal · 2022-04-26T19:19:03Z

The results for so and http-logs on 1.2.4 and 1.3.0 are similar in nature to our original findings. 1.3.0 seems to perform better in general.
We will need an additional workload with larger free text fields which is able to detect the performance drops.

CEHENKLE · 2022-05-09T22:00:57Z

@opensearch-project/benchmark-core Hey folks -- What are your thoughts on this? Is this an improvement we can get added?

IanHoang · 2022-05-15T20:06:44Z

@CEHENKLE @kotwanikunal Thanks for pointing this out. We will generate an additional workload with larger text fields as soon as possible. However, due to a high volume of tasks that we need to attend to, please expect delays.

ankitkala · 2022-07-18T04:05:43Z

@CEHENKLE @kotwanikunal Can you guys also verify whether the regression mentioned above could've been caught with 50% heap instead of the current 1 GB.
I'm asking this since elastic has been completely relying on these datasets for catching any regression. Nothing against adding new workloads but we should also improve our existing test setup.
We can also look for more thorough perf testing with different workloads and cluster configuration.

kotwanikunal added the enhancement New feature or request label Apr 21, 2022

kotwanikunal mentioned this issue Apr 21, 2022

[BUG] Performance tests for 1.3 unable to detect indexing performance degradation opensearch-project/OpenSearch#2985

Closed

ankitkala mentioned this issue Jul 22, 2022

[Discuss] Performance benchmarking improvements for Opensearch opensearch-project/OpenSearch#3983

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add support for large free text workloads #43

[FEATURE] Add support for large free text workloads #43

kotwanikunal commented Apr 21, 2022

anasalkouz commented Apr 21, 2022

travisbenedict commented Apr 22, 2022

kotwanikunal commented Apr 25, 2022

kotwanikunal commented Apr 26, 2022

CEHENKLE commented May 9, 2022

IanHoang commented May 15, 2022

ankitkala commented Jul 18, 2022 •

edited

Loading

[FEATURE] Add support for large free text workloads #43

[FEATURE] Add support for large free text workloads #43

Comments

kotwanikunal commented Apr 21, 2022

anasalkouz commented Apr 21, 2022

travisbenedict commented Apr 22, 2022

kotwanikunal commented Apr 25, 2022

kotwanikunal commented Apr 26, 2022

CEHENKLE commented May 9, 2022

IanHoang commented May 15, 2022

ankitkala commented Jul 18, 2022 • edited Loading

ankitkala commented Jul 18, 2022 •

edited

Loading