-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Split imports on preprocess #3389
Conversation
backend/src/main/java/com/bakdata/conquery/models/jobs/ImportJob.java
Outdated
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/preproc/Preprocessed.java
Outdated
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/jobs/ImportJob.java
Outdated
Show resolved
Hide resolved
extract methods for PreprocessorCommand#run
f1964db
to
f38d321
Compare
|
||
private final Map<String, Integer> starts; | ||
private final Map<String, Integer> lengths; | ||
// TODO make sure that everyone respects this is an end not a length |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
noch offen?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lol, war an mich selber gerichtet, habe es aber noch nicht angeschaut.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hier ist noch kein outdated dranne
backend/src/test/java/com/bakdata/conquery/util/support/StandaloneSupport.java
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ich finde das mit dem ConsistentHash cool, aber muss es nochmal durch denken.
backend/src/main/java/com/bakdata/conquery/commands/PreprocessorCommand.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/commands/PreprocessorCommand.java
Show resolved
Hide resolved
preprocessJobs(jobs, buckets, config); | ||
|
||
|
||
log.info("Successfully Preprocess {} Jobs:", success.size()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
log.info("Successfully Preprocess {} Jobs:", success.size()); | |
log.info("Successfully preprocessed {} jobs:", success.size()); | |
entity2Bucket.add(entity, bucket); | ||
|
||
return bucket; | ||
public void assignEntityBucket(String entity, int bucket) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
public void assignEntityBucket(String entity, int bucket) { | |
public void assignEntityToBucket(String entity, int bucket) { | |
backend/src/main/java/com/bakdata/conquery/io/storage/xodus/stores/CachedStore.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/jobs/ImportJob.java
Outdated
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/jobs/ImportJob.java
Outdated
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/preproc/PreprocessedHeader.java
Outdated
Show resolved
Hide resolved
backend/src/test/java/com/bakdata/conquery/integration/tests/RestartTest.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/preproc/Preprocessed.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/preproc/PreprocessedReader.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/commands/PreprocessorCommand.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/io/storage/xodus/stores/CachedStore.java
Show resolved
Hide resolved
@@ -31,17 +31,15 @@ class ClusterImportHandler implements ImportHandler { | |||
@SneakyThrows | |||
@Override | |||
public void updateImport(Namespace namespace, InputStream inputStream) { | |||
ImportJob job = ImportJob.createOrUpdate( | |||
final Table table = ImportJob.createOrUpdate( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ja vorher war es besser, hat aber alles auf einmal im RAM gehalten, das will ich ja mit dem PR vermeiden. Aber der Name ist tatsächlich nicht mehr passend.
backend/src/main/java/com/bakdata/conquery/models/events/Bucket.java
Outdated
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/models/preproc/Preprocessed.java
Show resolved
Hide resolved
/** | ||
* Handle validity and update logic. | ||
*/ | ||
public static Table createOrUpdate(DistributedNamespace namespace, boolean update, PreprocessedHeader header) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Vielleicht nochmal splitten, der Kommentar passt besser als der Funktionsname
backend/src/main/java/com/bakdata/conquery/models/preproc/Preprocessed.java
Show resolved
Hide resolved
backend/src/main/java/com/bakdata/conquery/mode/cluster/ClusterImportHandler.java
Outdated
Show resolved
Hide resolved
# Conflicts: # backend/src/main/java/com/bakdata/conquery/mode/cluster/ClusterImportHandler.java # backend/src/main/java/com/bakdata/conquery/models/jobs/ImportJob.java # backend/src/main/java/com/bakdata/conquery/models/preproc/PreprocessedHeader.java
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bitte ausgiebig testen mit großen daten vor dem Mergen
|
||
private final Map<String, Integer> starts; | ||
private final Map<String, Integer> lengths; | ||
// TODO make sure that everyone respects this is an end not a length |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe
220f5ab
to
92c3561
Compare
…terfere with submission
backend/src/main/java/com/bakdata/conquery/io/storage/StoreMappings.java
Outdated
Show resolved
Hide resolved
/** | ||
* select, then send buckets. | ||
*/ | ||
public static WorkerId sendBucket(Bucket bucket, WorkerInformation responsibleWorker) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
public static WorkerId sendBucket(Bucket bucket, WorkerInformation responsibleWorker) { | |
private static WorkerId sendBucket(Bucket bucket, WorkerInformation responsibleWorker) { |
backend/src/main/java/com/bakdata/conquery/mode/cluster/ClusterImportHandler.java
Outdated
Show resolved
Hide resolved
Bitte noch mal über die offenen Konversationen schauen |
# Conflicts: # backend/src/main/java/com/bakdata/conquery/models/jobs/CalculateCBlocksJob.java
…itions with high parallelism.
…ort process that loads concepts last, batching CalculateCBlocksJobs
# Conflicts: # backend/src/main/java/com/bakdata/conquery/models/datasets/concepts/tree/ConceptTreeCache.java
No description provided.