-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upstream 2023 03 01 1 #271
Commits on Feb 8, 2023
-
[query] IBD implemented in terms of block matrices (hail-is#12629)
* [query] IBD implemented in terms of block matrices * cleanup * fix * get debugging info * more debugging * fix temp file location * address comments * minor fix * fix? * checkpoints
Configuration menu - View commit details
-
Copy full SHA for eb0f48b - Browse repository at this point
Copy the full SHA eb0f48bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 186714b - Browse repository at this point
Copy the full SHA 186714bView commit details
Commits on Feb 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b666975 - Browse repository at this point
Copy the full SHA b666975View commit details
Commits on Feb 10, 2023
-
[query] Construct MatrixTable from Constituent Parts (hail-is#12659)
Configuration menu - View commit details
-
Copy full SHA for 6c8901f - Browse repository at this point
Copy the full SHA 6c8901fView commit details -
[query][scala-fs] more debugging information when memory fails (hail-…
…is#12678) * [query][scala-fs] more debugging information when memory fails * wording * log off the fast path
Configuration menu - View commit details
-
Copy full SHA for 77f4cb0 - Browse repository at this point
Copy the full SHA 77f4cb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d84db0 - Browse repository at this point
Copy the full SHA 7d84db0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 447ea87 - Browse repository at this point
Copy the full SHA 447ea87View commit details -
[batch] Add environment variable for batch id in worker (hail-is#12662)
* [batch] Add environment variable for batch id in worker * address comments?
Configuration menu - View commit details
-
Copy full SHA for fcac149 - Browse repository at this point
Copy the full SHA fcac149View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2476f7 - Browse repository at this point
Copy the full SHA d2476f7View commit details
Commits on Feb 11, 2023
-
[batch] Profile more services not just batch-driver (hail-is#12681)
* [batch] Profile more services not just batch-driver * fixes * only add the googlecloudprofiler logger filter if invoked * fix HAIL_SHOULD_PROFILE * sort * add cloud env variable to auth * fix
Configuration menu - View commit details
-
Copy full SHA for 451fdcc - Browse repository at this point
Copy the full SHA 451fdccView commit details
Commits on Feb 12, 2023
-
[query] fix gsutil cp invocation (hail-is#12686)
`gsutil` is one of the most user hostile tools I have ever used. Here are some examples of why. I think what I now have committed is the only way to achieve the behavior we want without assuming anything about which objects are present at the target. ``` $ gsutil -m cp -r baz gs://danking/baz/ $ gsutil ls gs://danking/baz/ gs://danking/baz/baz/ ``` ``` $ gsutil rm -rf gs://danking/baz $ gsutil -m cp -r baz/ gs://danking/baz/ $ gsutil ls gs://danking/baz/ gs://danking/baz/baz/ ``` ``` $ gsutil rm -rf gs://danking/baz $ gsutil -m cp -r baz gs://danking/baz $ gsutil ls gs://danking/baz/ gs://danking/baz/1 gs://danking/baz/2 ``` ``` $ gsutil rm -rf gs://danking/baz $ touch foo $ gsutil cp foo gs://danking/baz/foo $ gsutil -m cp -r baz gs://danking/baz $ gsutil ls gs://danking/baz gs://danking/baz/foo gs://danking/baz/baz/ ``` ``` $ gsutil rm -rf gs://danking/baz $ gsutil cp foo gs://danking/baz/foo $ gsutil -m cp -r baz/\* gs://danking/baz/ $ gsutil ls gs://danking/baz gs://danking/baz/1 gs://danking/baz/2 gs://danking/baz/foo ```
Configuration menu - View commit details
-
Copy full SHA for 2aba588 - Browse repository at this point
Copy the full SHA 2aba588View commit details
Commits on Feb 13, 2023
-
[query] improve error message when rng_nonce is unparseable (hail-is#…
…12688) * [query] improve error message when rng_nonce is unparseable * Update ExecuteContext.scala
Configuration menu - View commit details
-
Copy full SHA for 52feb78 - Browse repository at this point
Copy the full SHA 52feb78View commit details -
[query] MatrixBlockMatrixWriter can be lowered (hail-is#12695)
Efficiency can wait.
Configuration menu - View commit details
-
Copy full SHA for 9c0854e - Browse repository at this point
Copy the full SHA 9c0854eView commit details
Commits on Feb 14, 2023
-
[compiler] Refactor compiled functions to take a HailTaskContext inst… (
hail-is#12597) * [compiler] Refactor compiled functions to take a HailTaskContext instead of partitionIndex Partition index is now unnecessary due to the completion of the randomness redesign. HailTaskContext will be used in a subsequent PR to add task-level cleanup to permit aggressive caching in generated code. * fixes * oops writeIRs * fix combine * fix other issues in combine * fix * fix combop nonsense * bleh * fix * fix * bump
Configuration menu - View commit details
-
Copy full SHA for 0b4218d - Browse repository at this point
Copy the full SHA 0b4218dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b12ee8 - Browse repository at this point
Copy the full SHA 8b12ee8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 46aba89 - Browse repository at this point
Copy the full SHA 46aba89View commit details -
Configuration menu - View commit details
-
Copy full SHA for df3d682 - Browse repository at this point
Copy the full SHA df3d682View commit details -
[batch] Mitigate too many resources with same prices (hail-is#12654)
* [batch] Mitigate too many resources with same prices * add equality chcks * fix
Configuration menu - View commit details
-
Copy full SHA for d03121f - Browse repository at this point
Copy the full SHA d03121fView commit details
Commits on Feb 15, 2023
-
[query] Make caching in PartitionNativeIntervalReader more aggressive (…
…hail-is#12600) * [query] Make caching in PartitionNativeIntervalReader more aggressive Add finalizers to HailTaskContext to clean up open indices. * remove log * comment
Configuration menu - View commit details
-
Copy full SHA for 2fbf68b - Browse repository at this point
Copy the full SHA 2fbf68bView commit details -
[query] Add RouterFS to scala to permit local file system IO in QoB (h…
…ail-is#12667) * [query] Add RouterFS to scala to permit local file system IO in QoB * erasure * scala is stupid * override open/create cached * fix cast
Configuration menu - View commit details
-
Copy full SHA for 0f85414 - Browse repository at this point
Copy the full SHA 0f85414View commit details -
[compiler] Iterative
DistinctlyKeyed
Analysis (hail-is#12696)* [compiler] Iterative `DistinctlyKeyed` Analysis Use iterative tree traversals to prevent exceeding stack size for large IRs. * traverse all ir nodes
Configuration menu - View commit details
-
Copy full SHA for 1fe6f2a - Browse repository at this point
Copy the full SHA 1fe6f2aView commit details
Commits on Feb 16, 2023
-
[lowering] Rewrite maximal independent set to be its own value IR (ha…
…il-is#12295) * [lowering] Rewrite maximal independent set to be its own IR And use this new IR to enable lowered execution of maximal_independent_set * fix * updates * Update Copy.scala * Update __init__.py * Requireness.scala rule * fix bindings * fix * remove bad error * fix requireness * custom element binding for tiebreaker arguments * Add Array[Long] to hail value conversion * Make maximalIndependentSet return an IndexedSeq Functions.unwrapReturn expects scala IndexedSeq not java Array * fix GraphSuite * Add (bad) unwrap rule for arrays of structs * xfails * better uwrapReturn for arrays of structs * fix bad merge * correct Requiredness rules * ChildEnvWithoutBindings rule --------- Co-authored-by: Dan King <daniel.zidan.king@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 235d2bc - Browse repository at this point
Copy the full SHA 235d2bcView commit details -
Bump com.github.samtools:htsjdk from 3.0.2 to 3.0.4 in /hail (hail-is…
…#12700) Bumps [com.github.samtools:htsjdk](https://github.com/samtools/htsjdk) from 3.0.2 to 3.0.4. - [Release notes](https://github.com/samtools/htsjdk/releases) - [Commits](samtools/htsjdk@3.0.2...3.0.4) --- updated-dependencies: - dependency-name: com.github.samtools:htsjdk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 76e92ac - Browse repository at this point
Copy the full SHA 76e92acView commit details -
[query] add
hl.utils.genomic_range_table
(hail-is#12679)* [query] add `hl.utils.genomic_range_table` CHANGELOG: In Query on Batch, `hl.balding_nichols_model` is slightly faster. Also added `hl.utils.genomic_range_table` to quickly create a table keyed by locus. It has grated on me for a while that `hl.balding_nichols_models` requires a whole pass to verify it is sorted even though it is plainly so. This change introduces the necessary infrastructure to convince Hail of that fact. * pylint * missing import * fix tests * add to __init__.py * better genomic range test tables
Configuration menu - View commit details
-
Copy full SHA for 68b89a3 - Browse repository at this point
Copy the full SHA 68b89a3View commit details -
Bump werkzeug from 2.2.2 to 2.2.3 in /docker (hail-is#12703)
Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.2.2 to 2.2.3. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](pallets/werkzeug@2.2.2...2.2.3) --- updated-dependencies: - dependency-name: werkzeug dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2490ed1 - Browse repository at this point
Copy the full SHA 2490ed1View commit details -
Configuration menu - View commit details
-
Copy full SHA for ae3d0d5 - Browse repository at this point
Copy the full SHA ae3d0d5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6056c05 - Browse repository at this point
Copy the full SHA 6056c05View commit details -
Bump de.undercouch.download from 5.3.0 to 5.3.1 in /hail (hail-is#12707)
Bumps [de.undercouch.download](https://github.com/michel-kraemer/gradle-download-task) from 5.3.0 to 5.3.1. - [Release notes](https://github.com/michel-kraemer/gradle-download-task/releases) - [Commits](michel-kraemer/gradle-download-task@5.3.0...5.3.1) --- updated-dependencies: - dependency-name: de.undercouch.download dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 17e5568 - Browse repository at this point
Copy the full SHA 17e5568View commit details
Commits on Feb 17, 2023
-
[query] Upgrade spark to 3.3.0 and dataproc to 2.1 (hail-is#12701)
* [query] Upgrade spark to 3.3.0 and dataproc to 2.1 * lint * lint * update zstd-jni * trust pyspark on patch versions * create specific exceptions to appease pylint
Configuration menu - View commit details
-
Copy full SHA for 75f351d - Browse repository at this point
Copy the full SHA 75f351dView commit details -
[query] lower logistic SKAT (hail-is#12643)
* [query] lower logistic SKAT CHANGELOG: `hl.skat(..., logistic=True)` now supported in the Batch backend. * minor doc fix * remove wrong sentence from docs * covariates are required now * support max_iter = 0 in logreg_fit * fix syntax * wip docs * docs iwp * final take on the math * docs fixes
Configuration menu - View commit details
-
Copy full SHA for 9c5851c - Browse repository at this point
Copy the full SHA 9c5851cView commit details -
[compiler] refactor BlockMatrix lowering to generate small IR (hail-i…
…s#12624) * wip * lower BlockMatrixNativeReader * lower BlockMatrixBinaryReader * lower BlockMatrixRandom * refactor BMS2, lower BlockMatrixWrite * lower GetElement and BlockMatrixMap * make variable names private * lower map2 * lower first broadcast cases * lower diagonal * lower remaining broadcasts * runtime bm sparsity abstractions * take BMType off BMSContexts * fixes * try different assertion * fixes * fix diagonal and xfails * fix requiredness test * fix transpose and enable test * don't use ExecuteContext.scoped in parser * lower BlockMatrixFilter * add withSparsity short-circuit * refactor letBindings and broadcasts * fix service xfails * rename EvalContext -> IRBuilder * move IRBuilder
Configuration menu - View commit details
-
Copy full SHA for 87997a7 - Browse repository at this point
Copy the full SHA 87997a7View commit details -
[batch] Fix adding new resources with same rates (hail-is#12698)
* [batch] More debugging info for resource rate changes * fix lint * fix actual bug * delint --------- Co-authored-by: Dan King <daniel.zidan.king@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 12b0942 - Browse repository at this point
Copy the full SHA 12b0942View commit details -
[batch] Mitigate test failures by extending batch client timeout (hai…
…l-is#12709) * [batch] Mitigate test failures by extending batch client timeout * fix
Configuration menu - View commit details
-
Copy full SHA for dbfd553 - Browse repository at this point
Copy the full SHA dbfd553View commit details -
[query] fix NPE introduced in hail-is#12555 (hail-is#12711)
Not all IOExceptions have a cause.
Configuration menu - View commit details
-
Copy full SHA for 5f7e4be - Browse repository at this point
Copy the full SHA 5f7e4beView commit details -
[batch] Support job logs that are not UTF-8 compatible (hail-is#12666)
* [batch] Support job logs that are not UTF-8 compatible * redundant * add test for non-utf-8 log * fix * fix test * linting * address comments * lint
Configuration menu - View commit details
-
Copy full SHA for f7b3f10 - Browse repository at this point
Copy the full SHA f7b3f10View commit details -
[query] improve the memory client (hail-is#12691)
* [query] improve the memory client Currently, the memory client buffers the entire output in memory which is likely to cause OOMs. For reasons that are not entirely clear to me, sometimes these OOMs get muffled by our system and instead lead to non-termination. I vaguely remember this happening before with `using`. I suspect there is something somewhat subtle wrong with that method, but I am not certain. Anyway, there are four big changes here: 1. Do not buffer the entire request body in memory when writing to memory. 2. Because of (1) we have to pull retry behavior all the way up to the top-level where we know how to recreate the body. 3. Because of (2) it is easier to provide a `write(url)(writerFunction)` style API, which I do here. 4. Again, because of (2), and because I want to preserve the file-object-like interface, I added a somewhat funky anonymous class which uses a second thread to facilitate the movement of data written into the OutputStream returned by `create` into the OutputStream of the HTTP connection. Point (4) probably bears more explanation. The root issue is the bad Apache HTTP Client interface. Instead of `request` returning an OutputStream, it takes an "entity". An entity knows how to write itself into the OutputStream of an HTTP request. This works fine if the "writer" code is pased as a function (as in my new `write` method), but that does not work if the control flow looks like: f = create(...) f.write(...) f.close() We avoid this limited API by initiating the request in a second thread which will eventually block waiting to receive data from a PipedInputStream. That PipedInputStream produces the data written to a PipedOutputStream. The `create` call returns a positioned OutputStream which just writes data into the PipedOutputStream and handles cleaning up the thread when it is closed. In a multi-core system, network requests should proceed in parallel to the client code. In a single-core system, the written data will buffer until `close` is called which will definitely yield control to the other thread. * restore the retry * fix * fix * fix oerride
Configuration menu - View commit details
-
Copy full SHA for abd47fa - Browse repository at this point
Copy the full SHA abd47faView commit details
Commits on Feb 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b255123 - Browse repository at this point
Copy the full SHA b255123View commit details -
[query] rework flags and fix QoB flags (hail-is#12423)
Flags now use the same user configuration machinery we use for Batch and QoB. I am not certain this is the right choice. Feedback very welcome. The configuration_of function lets us uniformly treat any configuration by checking, in order: explicit argument, envvar, config file, or a fallback. I added a bit of code to allow us to support the envvars which do not conform to the new envvar scheme. I also removed a few flags that are no longer used. I kind of think these flags should actually be under a new section like "query_compiler" or something. @tpoterba, thoughts?
Configuration menu - View commit details
-
Copy full SHA for 1b2dba1 - Browse repository at this point
Copy the full SHA 1b2dba1View commit details -
[hailtop] Keep strong references to tasks (hail-is#12692)
* [hailtop] Keep strong references to tasks on the event loop * isort * linting
Configuration menu - View commit details
-
Copy full SHA for d867b79 - Browse repository at this point
Copy the full SHA d867b79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d65cf5 - Browse repository at this point
Copy the full SHA 2d65cf5View commit details -
[batch] Refactor resource billing checks with additional debugging in…
…fo (hail-is#12713) * [batch] Refactor resource billing checks with additional debugging info * delint
Configuration menu - View commit details
-
Copy full SHA for 537f8f7 - Browse repository at this point
Copy the full SHA 537f8f7View commit details
Commits on Feb 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 669a48b - Browse repository at this point
Copy the full SHA 669a48bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 59fbbb9 - Browse repository at this point
Copy the full SHA 59fbbb9View commit details
Commits on Feb 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 691d4bd - Browse repository at this point
Copy the full SHA 691d4bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 60ee155 - Browse repository at this point
Copy the full SHA 60ee155View commit details
Commits on Feb 23, 2023
-
[tests] Use token in remote_tmpdir path for tests (hail-is#12693)
* [tests] Use token in remote_tmpdir path * fix missing curly brace
Configuration menu - View commit details
-
Copy full SHA for 4b92fb6 - Browse repository at this point
Copy the full SHA 4b92fb6View commit details -
[qob] Update ServiceBackend logic to use distributed sort by default …
…and not with a default feature flag (hail-is#12719) * [qob] Don't use local sort to shuffle... * fix python * use empty dict when flags is None --------- Co-authored-by: Daniel Goldstein <danielgold95@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a49c76a - Browse repository at this point
Copy the full SHA a49c76aView commit details -
[batch] Allow python jobs to have file resources too (hail-is#12726)
* [batch] Allow python jobs to have file resources too * add test
Configuration menu - View commit details
-
Copy full SHA for 37d1cad - Browse repository at this point
Copy the full SHA 37d1cadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8104a1c - Browse repository at this point
Copy the full SHA 8104a1cView commit details
Commits on Feb 24, 2023
-
[qob] Enable liftovers for Query-on-Batch (hail-is#12699)
* [qob] Enable liftovers for Query-on-Batch * reorganize python side of things * linting * dedup some stuff on the local and spark backends * dedup local spark and py4j backend functionality * linting * address some comments * cache orderings * simplify * fix
Configuration menu - View commit details
-
Copy full SHA for 995994c - Browse repository at this point
Copy the full SHA 995994cView commit details -
[batch] Make driver parameters configurable in the UI (hail-is#12575)
* [batch] Make driver parameters configurable in the UI * delint * address comments * address comment
Configuration menu - View commit details
-
Copy full SHA for 4350aec - Browse repository at this point
Copy the full SHA 4350aecView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7fb8646 - Browse repository at this point
Copy the full SHA 7fb8646View commit details -
[hailtop.utils] Fix async_to_blocking to support keyboard interrupt w…
…ith nest_asyncio (hail-is#12732)
Configuration menu - View commit details
-
Copy full SHA for 60f47ac - Browse repository at this point
Copy the full SHA 60f47acView commit details
Commits on Feb 25, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e7945a8 - Browse repository at this point
Copy the full SHA e7945a8View commit details
Commits on Feb 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3cd6e4b - Browse repository at this point
Copy the full SHA 3cd6e4bView commit details
Commits on Feb 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9e0081c - Browse repository at this point
Copy the full SHA 9e0081cView commit details
Commits on Mar 1, 2023
-
Merge commit '9e0081c' into upstream-2023-03-01_1
This the main merge about the memory service that's debatable, ```scala val (open, write) = if (n <= 50) { (fs.openCachedNoCompression _, fs.writeCached _) } else { ((x: String) => fs.openNoCompression(x), fs.writePDOS _) } ``` Merged to: ```scala val (open, write) = ((x: String) => fs.openNoCompression(x), fs.writePDOS _) ``` Conflicts: batch/batch/driver/instance_collection/pool.py batch/batch/driver/main.py batch/batch/driver/templates/pool.html batch/batch/inst_coll_config.py ci/ci/ci.py hail/src/main/scala/is/hail/backend/service/ServiceBackend.scala hail/src/main/scala/is/hail/backend/service/Worker.scala
Configuration menu - View commit details
-
Copy full SHA for 601daf5 - Browse repository at this point
Copy the full SHA 601daf5View commit details