Skip to content
This repository has been archived by the owner on Nov 1, 2021. It is now read-only.

Implement input_method_v2 popups #2550

Merged
merged 2 commits into from
Sep 22, 2021

Conversation

tadeokondrak
Copy link
Contributor

No description provided.

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Dec 16, 2020

Questions about the protocol (@dcz-purism?):

  • There's no protocol error for when the surface already has a role.

  • Some clients (for me, Alacritty) don't support displaying preedit text. I think on X11, the IME knows whether or not to draw the preedit in a popup window. Should a way to express this be added to text-input and input-method?

  • What's the purpose of having multiple input popups? How should they be positioned?

  • This one is very minor, but should IMEs have a way to control positioning of the popup in any way? In many screenshots I see online, there is a left margin and then the word itself is aligned to the text input rectangle, example

@dcz-purism
Copy link
Contributor

dcz-purism commented Dec 20, 2020

Thanks for the contribution! Is there a client to test this with?

There's no protocol error for when the surface already has a role.

What would be the benefit? Also, there's no defined concept of "roles", so one would have to be introduced.

EDIT: now I think I see what you mean: preedit surface is wrapping a wl_surface. That should probably be an error indeed.

Should a way to express [lack of preedit] be added to text-input and input-method?

Thanks for noticing this. I think my assumption was "if it doesn't report surrounding text, it doesn't support preedit", but those are somewhat independent, especially when the input method can spawn a popup. Perhaps that could be a bit field on the enable request. Or another state-changing request. Best submitted here: https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests?label_name%5B%5D=text-input .

What's the purpose of having multiple input popups? How should they be positioned?

It was easier to design the protocol this way. It's up to the compositor.

This one is very minor, but should IMEs have a way to control positioning of the popup in any way?

This might be a good idea. I didn't try to tackle this problem because I have little experience with using popups.

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Dec 20, 2020

I've been implementing this for https://github.com/tadeokondrak/anthywl.
Fcitx5 also has support, but I've been having issues getting it to work consistently.

I haven't gotten to the point of implementing this correctly in Sway, but I'm wondering if the protocol is missing a method similar to xdg-shell for responding to configure requests.

If a popup is extending past the edges of the parent surface, moving it close to the edge of the output should make it extend into the window instead. This means the text input rectangle changes, and the popup needs to be redrawn with the new one. The server doesn't know which surface commit actually does that, so it can't wait for the popup to redraw before moving the window.
Popups should probably stay where they were first created

Also, having to put a buffer on the screen to get the text input rectangle at all seems like it might lead to flickering where the first frame is drawn without any knowledge of it, then the rest are correctly drawn.

@dcz-purism
Copy link
Contributor

Those are all good points. When discussing the new version of input-method (currently not progressing due to lack of reviewers and me being focused on other things), the conclusion was not to implement the popup due to unforeseen bugs. Turns out you're finding them as you implement it.

As far as I know, no one else implements popups, so if you want to fix the protocol by introducing a new version, there would be no one to hold you up.

@emersion
Copy link
Member

Side note: I wonder if we should add input-method-v2 to wlr-protocols, because it's not hosted anywhere ATM.

@tadeokondrak tadeokondrak marked this pull request as draft December 30, 2020 20:22
@tadeokondrak tadeokondrak force-pushed the input-method-popups branch 4 times, most recently from 4f78d2d to 49583eb Compare January 25, 2021 23:58
@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Jan 26, 2021

While implementing text-input's set_cursor_rectangle request in foot, I realized that it really is a cursor rectangle (i.e. it's usually 0px wide) and not a rectangle of the text input itself.

However, input-method describes it differently:

    <event name="text_input_rectangle">
      <description summary="set text input area position">
        Notify about the position of the area of the text input expressed as a
        rectangle in surface local coordinates.

        This is a hint to the input method telling it the relative position of
        the text being entered.
      </description>
      <arg name="x" type="int"/>
      <arg name="y" type="int"/>
      <arg name="width" type="int"/>
      <arg name="height" type="int"/>
    </event>

Is this intentional?


I think multiple popups add considerable implementation complexity while having no guaranteed behaviour and no use case as far as I can tell. I think it's worth disallowing them.


I think the text_input_rectangle request isn't very useful without control over positioning of the popup.
Here are some screenshots of IME UIs that would need positioning control to be implemented:

I tried to come up with a protocol update making it possible:

    <!-- in zwp_input_popup_surface_v2 -->

    <enum name="anchor" bitfield="true" since="2">
      <entry name="top" value="1"/>
      <entry name="bottom" value="2"/>
      <entry name="left" value="4"/>
      <entry name="right" value="8"/>
    </enum>

    <event name="configure" since="2">
      <arg name="cursor_width" type="uint" summary="width of the cursor"/>
      <arg name="cursor_height" type="uint" summary="height of the cursor"/>
      <arg name="up_available" type="uint"
        summary="space available above of the cursor"/>
      <arg name="down_available" type="uint"
        summary="space available below of the cursor"/>
      <arg name="left_available" type="uint"
        summary="space available to the left of the cursor"/>
      <arg name="right_available" type="uint"
        summary="space available to the right of the cursor"/>
      <arg name="serial" type="uint" summary="serial of the configure event"/>
    </event>

    <event name="ack_configure" since="2">
      <arg name="anchor" type="uint" enum="anchor"/>
      <arg name="margin_x" type="int"
        summary="horizontal distance from the anchor"/>
      <arg name="margin_y" type="int"
        summary="vertical distance from the anchor"/>
      <arg name="serial" type="uint"
        summary="the serial from the configure event"/>
    </event>

This also makes it possible for IMEs to receive the cursor rectangle before actually rendering anything, preventing flicker if the IME decides to do anything with it.

configure would be sent by the compositor with the size of the cursor and the distance to the edges of the current output.
ack_configure would be sent by the client with the desired anchors and margins.

A typical case without any shortage of space where the IME wants 50px of margin before the text:
anchor=bottom|left margin_x=-50

If the window is approaching the right edge of the monitor:
anchor=bottom|right margin_x=50

If the window is approaching the bottom edge of the monitor:
anchor=top|left margin_x=-50

Feedback on this would be appreciated.

@dcz-purism
Copy link
Contributor

Is this intentional?

At least to some extent, as in I think I made it vague because I didn't have an opinion other than "allows the compositor to choose a placement of the pop up such as what's being edited is not obscured". I think it's reasonable at least to cover the entire preedit text within the rectangle. I actually never tested what GTK returns here, but I know the rectangle is being set.

I think it's worth disallowing them.

If you decide to add such a reuirement to a newer revision of the protocol, I will not have any arguments to oppose it ;)

How much positioning of the pop up is needed? The Wayland assumption is that surfaces don't know how they are positioned in terms of absolute coordinates, and I tend to agree with that. Also, I think while pop ups are less useful without control over positioning, they don't become useless at all. That'd be good as a first step.

I do like the idea with the anchor, although I would rather have the client provide the preferences together with the surface, and let the compositor figure out everything. With this proposal, the application is allowed to set constrains which are impossible to meet. Then the request turns into a suggestion, so the compositor must have positioning logic too. If the compositor must have positioning logic, we might as well feed it the general rule and let it handle the entirety of positioning.
Plus, it would result in lots of chatter when windows or display borders are being repositioned.

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Jan 26, 2021

Thanks for the reply!

At least to some extent, as in I think I made it vague

From what I can tell the correct behaviour is:

  • If cursor_start == cursor_end, then the rectangle is just the cursor bar
  • If the cursor is a range, then the rectangle is of the cursor range, this is important for selecting candidates with multiple words.
  • If the cursor is hidden, I'm not sure what the rectangle should be. GTK treats it the same as the first case, including not hiding the cursor at all.

Knowing what the rectangle actually means is important if the IME wants to do anything with it, so I think the protocol should clarify this.

If you decide to add such a reuirement to a newer revision of the protocol, I will not have any arguments to oppose it ;)

Didn't realize it when I was writing my last comment, but they do make sense if positioning is added. I'll add back support to the PRs.

How much positioning of the pop up is needed? The Wayland assumption is that surfaces don't know how they are positioned in terms of absolute coordinates, and I tend to agree with that.

Yeah, I agree with the Wayland avoidance of absolute coordinates, but I don't think my anchor protocol idea is in conflict with that. More on it below.

My protocol idea allows the surface to have an arrow pointing to the cursor area, like the last screenshot in my last comment, without the arrow ever becoming out of place. I don't see any reason you'd need more positioning, but of course less positioning is enough at the expense of aesthetics.

Also, I think while pop ups are less useful without control over positioning, they don't become useless at all. That'd be good as a first step.

Agreed, I think positioning can be part of another PR.

I do like the idea with the anchor, although I would rather have the client provide the preferences together with the surface, and let the compositor figure out everything. With this proposal, the application is allowed to set constrains which are impossible to meet. Then the request turns into a suggestion, so the compositor must have positioning logic too. If the compositor must have positioning logic, we might as well feed it the general rule and let it handle the entirety of positioning.

I don't think there are any constraints that are impossible to meet. The popup trailing off the screen is completely okay, it's just something that should be avoided.

And if the compositor doesn't have the concept of output borders, it can just provide UINT_MAX to all the "available" parameters.

The client providing the preferences itself seems difficult because for the compositor to know how to position it, it needs to know its width, height, and "hotspot" (where the arrow would be pointing to, for example). But if the compositor decides to position it above instead of below for example, the arrow would switch to the other side, requiring back-and-forth with the client. Giving the client available space lets it know where the "hotspot" would be before actually needing to draw anything.

Plus, it would result in lots of chatter when windows or display borders are being repositioned.

I realized that popups shouldn't actually be repositioned to fit on the screen, normal xdg_popups don't do this either. The popup would be repositioned the next time it would normally be updated.

@dcz-purism
Copy link
Contributor

From what I can tell the correct behaviour is:

Do you mean "observed" behaviour? Or is that what you think would be optimal?

Knowing what the rectangle actually means is important if the IME wants to do anything with it

What could an IME want to do with it? I can see the fancy arrow-thing pointing to the cursor, but it doesn't need to be so accurate. To me in the beginning it was about giving the compositor enough info to make sure the text is not obscured. What are the other potential use cases?

I don't think there are any constraints that are impossible to meet. The popup trailing off the screen is completely okay, it's just something that should be avoided.

Ah, that's where we had different plans. The popup trailing off the screen was something I intended to never happen as long as the compositor can help it. This design gives the application the ability to push the pop up off the visible area.

To support my view of things, the xdg pop up which appears on right-click never appears off the screen, because (I presume, didn't actually check) the compositor takes control of its exact position.

But if the compositor decides to position it above instead of below for example, the arrow would switch to the other side, requiring back-and-forth with the client. Giving the client available space lets it know where the "hotspot" would be before actually needing to draw anything.

This is a good point. There is a way for Gtk to display such pop ups, I think it uses subsurfaces. They also display a fancy arrow pointing approximately to the click site. Might be wirth it to work out how exactly this happens. That would make this interface more aligned with what's already established, and help choose between our competing views.

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Jan 26, 2021

Do you mean "observed" behaviour? Or is that what you think would be optimal?

A bit of both, I needed to find out what behaviour is best when actually implementing it, there's some discussion here of me narrowing it down. It's important for text-input clients to send it correctly becuase GNOME actually uses the arrow pointing to the cursor on its overlay.

What could an IME want to do with it?

Well, why is the text_input_rectangle event in input-method if the IME doesn't have any use for it?

A basic use I can think of is knowing whether or not the text is above or below the popup, to position extra information farther away from the cursor rather than moving the main content.
A more advanced use is the arrow pointing to the cursor.

Ah, that's where we had different plans. The popup trailing off the screen was something I intended to never happen as long as the compositor can help it.

Sure, I agree. By "completely okay" I mean that if the IME willingly positions the popup so part of it is off screen, what's the problem with it? There's no reason it would want to do that.

To support my view of things, the xdg pop up which appears on right-click never appears off the screen, because (I presume, didn't actually check) the compositor takes control of its exact position.

Well, if there is no space on the screen for the popup to fit in any direction, it goes off screen.

[I hit submit before I finished my post. Last part in a separate comment to follow]

@dcz-purism
Copy link
Contributor

What could an IME want to do with it?

I ask this question to arrive at the level of specificity we actually care about. So far the most specific we have are examples of caring about the general area, and that's easy to satisfy while giving the implementors freedom to do what's easiest to them.

There's no reason it would want to do that.

Okay, but no one wants the right click menu off the screen either. Yet the protocol designers decided to take this ability away anyway. What was their reasoning? Does it apply here? If we can't answer that, then do we have good reasons to believe that we aren't missing anything important when we make a different decision?

@tadeokondrak
Copy link
Contributor Author

I ask this question to arrive at the level of specificity we actually care about. So far the most specific we have are examples of caring about the general area, and that's easy to satisfy while giving the implementors freedom to do what's easiest to them.

Ah, I see what you mean.

During normal Japanese IME input, you have two phases.
First, you type the phonetic spelling in, then you hit Space to go into selection mode.
When entering selection mode, the IME splits your input into sections, and you can go through and resize those sections if they're incorrect.
Then, you can go through each section and choose which prediction to use. It's important that the IME is positioned in a way that makes it clear which section is being modified.

Yet the protocol designers decided to take this ability away anyway. What was their reasoning?

Yeah, looking at the xdg_positioner protocol does give me more ideas for how to design the input-method part.
It's more complex, but it manages to solve the repositioning problem without back-and-forth on every move.
I'll try to come up with a different protocol update.

@emersion
Copy link
Member

Maybe xdg-popup can be re-used for IMEs? The layer-shell is an example of a protocol that re-uses xdg-popups.

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Jan 26, 2021

Maybe xdg-popup can be re-used for IMEs?

I thought of that, but I saw some issues with it. Looking closer, they can all probably be defined:

  • xdg_popup.grab would have to be defined, probably to be a protocol error.
  • xdg_popup.configure x/y would be relative to the cursor rectangle rather than the parent surface window geometry.
  • xdg_popup.popup_done would never be sent.
  • xdg_positioner.set_anchor_rect should probably be implicitly the cursor rectangle, with a protocol error if changed.
  • xdg_positioner.set_parent_size (v3) should be a protocol error.
  • xdg_positioner.set_parent_configure (v3) should be a protocol error.
  • There's no way for the IME to get the width of the cursor. I don't see a reason why it would need to, though.

Is changing the behaviour of a protocol when used through another like that okay?

@tadeokondrak
Copy link
Contributor Author

An issue in the protocol that has nothing to do with positioning:

If you're on one window typing with a popup open, then switch to another window, then the popup may stay on screen for however long it takes for the IME to realize it's been deactivated and reactivated.

@emersion
Copy link
Member

If you're on one window typing with a popup open, then switch to another window, then the popup may stay on screen for however long it takes for the IME to realize it's been deactivated and reactivated.

The compositor should probably close the popup when switching away?

@tadeokondrak
Copy link
Contributor Author

The compositor should probably close the popup when switching away?

There isn't a way for the compositor to close an input-method popup, it's always visible when the input method is active:

      The compositor should place it near the active text input area. It must
      be visible if and only if the input method is in the active state.

@emersion
Copy link
Member

Thsi doesn't look like a good design decision to me. There are many reasons why the compositor would want to close a popup: switching focus, opening a lockscreen, switching input methods... Why not use the popup_done event?

@tadeokondrak
Copy link
Contributor Author

tadeokondrak commented Jan 26, 2021

I'm referring to the existing input-method-v2 popups rather than xdg-popup right now, it doesn't have a popup_done event.

switching focus, opening a lockscreen, switching input methods...

These are done by deactivating the input method.

@emersion
Copy link
Member

Ah, I see. The compositor shouldn't rely on the client to withdraw the popup.

@tadeokondrak
Copy link
Contributor Author

The compositor shouldn't rely on the client to withdraw the popup.

It doesn't, but the situation is switching focus from one text-input client to another text-input client, directly from input box to input box. The input method has a single popup shared across all windows.
It doesn't receive the deactivate event in time, so it doesn't have time to redraw the popup again before activated again, leading to a flicker.

@tadeokondrak tadeokondrak marked this pull request as ready for review January 30, 2021 02:27
@emersion
Copy link
Member

Are all of the popup-related protocol issues documented in https://gitlab.freedesktop.org/wayland/wayland-protocols/-/issues/40?

Do we need protocol changes before this can be reviewed?

@tadeokondrak
Copy link
Contributor Author

Are all of the popup-related protocol issues documented in https://gitlab.freedesktop.org/wayland/wayland-protocols/-/issues/40?

I think so.

Do we need protocol changes before this can be reviewed?

No, this should still be useful as-is.
There is a single protocol change in this PR, which adds a protocol error (without bumping the version.)

Copy link
Member

@emersion emersion left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for working on this! Here's a first round of comments.

I see the popup interface is pretty bare-bones right now, it sounds like there can be a number of glitches because a de-synchronized state. We probably want to improve that with the next protocol version.

include/wlr/types/wlr_input_method_v2.h Outdated Show resolved Hide resolved
types/wlr_input_method_v2.c Outdated Show resolved Hide resolved
types/wlr_input_method_v2.c Outdated Show resolved Hide resolved
types/wlr_input_method_v2.c Show resolved Hide resolved
types/wlr_input_method_v2.c Outdated Show resolved Hide resolved
types/wlr_input_method_v2.c Outdated Show resolved Hide resolved
include/wlr/types/wlr_input_method_v2.h Outdated Show resolved Hide resolved
types/wlr_input_method_v2.c Show resolved Hide resolved
wl_signal_init(&popup_surface->events.unmap);
wl_signal_init(&popup_surface->events.destroy);

popup_surface_set_mapped(popup_surface,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we allow already-mapped surfaces? I guess the protocol allows them, and disallowing them would be a breaking change. But before getting the text input box, the client can't really know how to draw the popup.

struct wlr_input_popup_surface_v2 *popup_surface,
struct wlr_box *sbox) {
zwp_input_popup_surface_v2_send_text_input_rectangle(
popup_surface->resource, sbox->x, sbox->y, sbox->width, sbox->height);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Protocol issue: the compositor has no way to know when the client has taken this new information into account. Thus the compositor might draw an old buffer at a new position, resulting in a glitch.

@tadeokondrak
Copy link
Contributor Author

Should we allow already-mapped surfaces? I guess the protocol allows them, and disallowing them would be a breaking change. But before getting the text input box, the client can't really know how to draw the popup.

Protocol issue: the compositor has no way to know when the client has taken this new information into account. Thus the compositor might draw an old buffer at a new position, resulting in a glitch.

These are issues with the protocol, but even with a flawed protocol, this feature is useful.
I think that reviews on this PR and the Sway one would definitely carry over to the implementation of any future input-method popup protocol.
If a revised protocol is finished before the PRs are merged, it might make sense to only support the new one.

@emersion
Copy link
Member

These are issues with the protocol, but even with a flawed protocol, this feature is useful.

Yes, these are more of drive-by comments, I don't expect these to be fixed in this PR. :)

@emersion
Copy link
Member

Code looks pretty good to me.

@emersion
Copy link
Member

Pushed to the wrong branch maybe?

@tadeokondrak
Copy link
Contributor Author

Pushed to the wrong branch maybe?

Whoops, was testing the commits together and accidentally pushed. Didn't notice

@fosskers
Copy link

How do we feel about this PR? Do we need a patched wlroots with this PR in order to test swaywm/sway#5890 ?

Copy link
Member

@emersion emersion left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As I said before, code LGTM! Let's merge this even if the Sway PR isn't ready yet.

Thanks for your contribution!

@emersion emersion merged commit 30d3c76 into swaywm:master Sep 22, 2021
@fosskers
Copy link

I'm looking forward to documenting this whole setup.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Development

Successfully merging this pull request may close these issues.

5 participants