Require Python imports to be qualified with repo name #7067

brandjon · 2019-01-09T14:56:16Z

For context see #6886, #7051, and discussion thread.

For all Python modules that are defined within the build (not standard library modules, not extra libraries installed non-hermetically on the system), we should require that they be imported using their fully qualified name, which includes the repo's canonical name. For instance, we should not allow importing Python modules defined in bazel packages //third_party/... or @some_repo//third_party/... via the import statement import third_party.[...], precisely because that's ambiguous and collides between same-named packages in different repos.

For cases where the package really should be top-level, like a repo that exposes an existing library like numpy to the build, the py_library can still use the imports attribute to put it on PYTHONPATH.

We need a design doc to cover this change and also how to handle importing repos from runfiles in the face of repo renaming.

The text was updated successfully, but these errors were encountered:

brandjon · 2019-03-18T16:58:47Z

Note that a flag for disabling exactly this behavior has existed for a long time, and #2636 tracks flipping it.

The problem is that we don't have a complete story for how repo-qualified names should work in the face of repo renaming. If your main repo is named @foo and you import it into other workspaces using repo rules with name = "foo", that's fine, everyone can just write import foo... in their Python programs. But if someone uses a repo rule with name = "bar", this means the runfiles symlink for the repository will be called "bar", and "foo" will be unknown on the PYTHONPATH.

A similar situation can exist for transitive dependencies due to repository name remapping (@dkelmer), but we don't currently rename symlinks so it isn't (yet) an issue. This'll probably change in the future because I think the symlinks need to be renamed to resolve diamond dependencies.

One solution might be to allow repos to declare their canonical name for the purpose of Python imports, independently of their workspace or repo name, and create some symlink structure in runfiles to make the canonical name available.

brandjon · 2019-03-19T16:40:18Z

I think it's very possible in practice that people will want the canonical top-level name for their repo's Python libraries to be different from the repo's name in its WORKSPACE file. For instance, rules_python may want to refer to its own Python modules as rules_python.experimental.examples... rather than as io_bazel_rules_python.experimental.examples....

Perhaps a more general solution would be to put neither the runfiles root nor any of the repo roots on PYTHONPATH automatically, but force users to use a mechanism like py_library.imports, possibly extended with the ability to supply arbitrary names for the top-level package.

c-nuro · 2019-07-26T18:00:58Z

Would it be possible to use import hook, or generating some advanced __init__.py which can be more multi-package aware(e.g. setting __path__) and automatically merge modules?

brandjon · 2019-07-26T18:42:28Z

Yes, that's my current thinking. Take for instance #7091, where Python will unconditionally prepend the directory containing the main script (after chasing symlinks!), causing unwanted visibility of source files that happen to be in the same dir as the main script. I think the best way to control that would involve injecting __init__.py logic into the build, whereas currently all our manipulation of the package path occurs in the stub script, before the user Python process is launched.

If we're adding __init__.py logic to manipulate the path anyway, we may as well also customize how module loading is done in other ways, to address this kind of issue. Instead of just adding things from the runfiles dir to PYTHONPATH, we can allow py_library and py_binary to have more control over what exact name an import is visible as, whether it overrides a standard library module, etc.

sgowroji · 2023-02-16T10:02:33Z

Hi there! We're doing a clean up of old issues and will be closing this one. Please reopen if you’d like to discuss anything further. We’ll respond as soon as we have the bandwidth/resources to do so.

brandjon added P2 We'll consider working on this in future. (Assignee optional) type: process team-Rules-Python Native rules for Python labels Jan 9, 2019

brandjon mentioned this issue Jan 9, 2019

py_runtime in third_party package conflicts with third_party.py.gflags import in Android rules #6886

Closed

brandjon mentioned this issue Mar 13, 2019

Python rules are not hermetic #890

Closed

This was referenced Mar 18, 2019

Flip the value of --experimental_python_import_all_repositories to false and remove the flag #2636

Open

Can't use the same name for workspace and top-level python package #6897

Closed

brandjon mentioned this issue Mar 19, 2019

Add experimental support for building wheels. bazelbuild/rules_python#159

Merged

This was referenced Apr 1, 2019

bazel run py_binary loads incorrect module with same name #3925

Closed

Python Namespace not handling correctly #6844

Closed

aaliddell mentioned this issue May 2, 2019

Inconsistent import behaviour between py_binary and par_binary google/subpar#97

Open

segiddins mentioned this issue Sep 5, 2020

Unable to point to git repo as drop-in bazel replacement for release google/xctestrunner#19

Closed

lberki added P4 This is either out of scope or we don't have bandwidth to review a PR. (No assignee) and removed P2 We'll consider working on this in future. (Assignee optional) labels Nov 18, 2020

tpudlik mentioned this issue Aug 30, 2022

py_binary rules default to importing non-sandboxed code #7091

Open

sgowroji added the stale Issues or PRs that are stale (no activity for 30 days) label Feb 16, 2023

sgowroji closed this as not planned Won't fix, can't repro, duplicate, stale Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Require Python imports to be qualified with repo name #7067

Require Python imports to be qualified with repo name #7067

brandjon commented Jan 9, 2019

brandjon commented Mar 18, 2019

brandjon commented Mar 19, 2019

c-nuro commented Jul 26, 2019

brandjon commented Jul 26, 2019

sgowroji commented Feb 16, 2023

Require Python imports to be qualified with repo name #7067

Require Python imports to be qualified with repo name #7067

Comments

brandjon commented Jan 9, 2019

brandjon commented Mar 18, 2019

brandjon commented Mar 19, 2019

c-nuro commented Jul 26, 2019

brandjon commented Jul 26, 2019

sgowroji commented Feb 16, 2023