-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Default spark-infrastructure Helm charts to V2 and ensure migration #300
Comments
Running into issues with the current implementation:
|
While testing I found a bug in our configuration store implementation. The PVC is supposed to be read only but it is miss-configured. Will add the resolution to DoD |
Currently running into an issue with the spark-event PVC that is being attached to the container. It is giving permissions errors when writing to the path |
Resolved the spark-event pvc error by updated the node host path. Working on testing the migration steps and scripts |
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
Default all new applications to use spark-infrastructure to the V2 charts. Mark any V1 users with a depreciation warning. Provide migration instructions for going from V1 -> V2. Update spark-events to be saved to a more dynamic PVC instead of a S3 bucket. Signed-off-by: Peter McClonski <mcclonski_peter@bah.com> #300 Resolve several issues with the V2 charts Correct the Spark-event PV host node mount point. Update the spark configmap name to be in line with our documentaion (-conf -> -config). Corrected a bug in the Configuration-store PVC being writable instead of read only.
all test steps are passing! Test 1: testing the default v2 behavior
Test 2: testing the v1 -> v2 migration
Test 3: testing the Config-store pvc is a read only file system
|
Description
We have a long lived feature branch that needs to get merged in - feature/spark-infrastructure-v2-default. This branch sets the spark-inf helm deployment v2 as the default along with migration steps and other changes. Currently the manual actions call out using aissemble-spark-infrastructure-deploy and not the -v2 profile.
Technical Details:
The feature branch combines hive-metastore-db and hive-metastore-service into one chart. It then adds this one chart into the aissemble-spark-inf chart.
Also the thrift-service is bundled into the aissemble-spark-inf chart.
DOD
Test Strategy/Script
Test 1: testing the default v2 behavior
test-project-pipeline-models/src/main/resources/pipelines/
test-project-pipeline-models/src/main/resources/records/
test-project-pipeline-models/src/main/resources/dictionaries/
mvn clean install
repeatedly, resolving all presented manual actions until none remain.6.1. The profile
aissemble-spark-infrastructure-deploy-v2
is intest-project-deploy/pom.xml
mvn clean install
tilt up
kubectl exec -it <DATA_ACCESS_POD_NAME> -- bash
curl -X POST localhost:8080/graphql -H "Content-Type: application/json" -d '{ "query": "{ CustomRecord(table: \"my_new_table\") { customField } }" }'
and ensure that data including two records is returned, ie:{"data":{"CustomRecord":[{"customField":null},{"customField":null}]}}
Test 2: testing the v1 -> v2 migration
4.1. Update the pyproject.toml to include the snapshot repo
mvn clean install
tilt up
kubectl exec -it <DATA_ACCESS_POD_NAME> -- bash
curl -X POST localhost:8080/graphql -H "Content-Type: application/json" -d '{ "query": "{ CustomRecord(table: \"my_new_table\") { customField } }" }'
and ensure that data including two records is returned, ie:{"data":{"CustomRecord":[{"customField":null},{"customField":null}]}}
Test 3: testing the Config-store pvc is a read only file system
2.1. If you are on a WSL instance and do not have the project on a path the Rancher instance can reach then save the src file somewhere on your C drive and make note of the path
2.2. 300-helper.zip
touch /configurations/test.txt
References/Additional Context
N/A
The text was updated successfully, but these errors were encountered: