-
Notifications
You must be signed in to change notification settings - Fork 144
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for multiple devices #372
Comments
some comments:
|
It's currently 31 characters (32 bytes including string terminator) of
Indeed, that's the idea. Supporting the old syntax would be annoying, and unclear what the behaviour would be.
Yes.
Sure, but no one's asked for it. Would this be useful to you? |
@mato thx for your quick reply
great! fine with me
not right now tbh, so best to wait until someone needs it. |
Just one comment. The name Two questions to clarify things for me; I'm assuming the answers are no. Will devices have to be acquired/opened before using them? will a handle be allocated at every acquire (like an fd in posix open)? |
Naming is hard. I'm trying to pick a name based on the purpose of the call, not the implementation.
You seem to be mixing intent/purpose and implementation here. Yes, you have to "acquire a handle to" a device, in order to use it? How else would you expect the libOS to get the handle? (Please ignore the implementation!) In terms of allocation, no. You'll always get the same handle back, but again, that happens to be a property of the implementation right now. Unlikely to change though. |
Sorry, i was thinking about the initialization of the device, like a TAP device getting initialized lazily in hvt only after the first acquire. That could be interesting to improve startup times even more (if the net device is only needed at the end of the execution for example). |
That would violate the security model and prevent capsicumizing hvt (see #366) and in future doing something similar with seccomp on Linux. |
Regarding the size of device names, for block devices I think its worth storing at least 64 bytes, because then a unikernel can reference a static device image with 256 bit hash encoded in hexadecimal. Also, UUIDs have a 36 byte encoding. UUIDs being a good way to refer to specfic GPT partitions or partition types. |
That's a very good point, I was thinking along the same lines myself. Regarding a hex-encoded 256-bit hash, with the final |
(Edited previous comment with actual alignment values). With a |
In #379 (comment), @cfcs brings up a good point, that we have not exhaustively defined how we want to represent the Solo5 and manifest ABI versions in the unikernel binary:
Currently, we have the following version information in the unikernel binary:
What we don't have at the moment is:
Question: Which of these do we actually want, going forward? My feeling is that to be as future-proof and flexible as possible we should encode all of the above in the binary. That would mean:
This poses the additional question, where in point (3) above, do we want to allow the tender/bindings internal ABI (e.g. in the case of hvt the hypercall ABI) to evolve separately from the Solo5 ABI (function signatures in solo5.h)? If yes, then we need two values here. Irrespective of the option we choose, I would use a single value for each, start with Thoughts? Especially @ricarkol @cfcs @hannesm. The choices are fairly subtle and will stay with us for some time, so please give this some thought. |
IIUC, (1) can change when the format needs to change (and existing tenders ( (4) is as well not very controversial imho (at the moment, I keep this in albatross separately in the certificate (https://github.com/hannesm/albatross/blob/50ed6a8d1ead169b3e322aaccb469e870ad72acc/src/vmm_asn.ml#L54), it would make sense to include this as part of the manifest! (3) is pretty open-ended -- from an operators point of view, I'd for sure like to have a solo5-hvt binary that can execute as many (differently, separately compiled) unikernels as possible: as long as the core solo5 ( (y) I would as well be fine with a single solo5 ABI version (core API + all modules), and a reliable way to extract this number from both the tender (which one it supports -- maybe a given (y), i don't think we need to evolve the internal ABI separately from the solo5 ABI. |
1., 2., and 4. are very clear yesses. Regarding 3:
This is important to us.
Not required, or at least not worth the maintenance cost. Regarding 4. I remember we had some discussion (at Marrakesh) about having universal unikernel binaries that could be executed on all backends. If we ever intend to do so, it might be a good idea to store flags in |
Sorry I'm a bit late with my comments.
re: 3): I think the main thing is keeping track of compatibility; if a site needs backwards compatiblity they can keep the tender binaries for previous versions around when we make breaking changes where it is not feasible to support multiple ABI versions from the latest tender. I think it's also reasonable to say that the site will always have the latest tender available (meaning not trying to deal with cases where the unikernels are using a more recent ABI version than the tender, and only dealing with cases where the unikernels are older than the tender). re: 1) 2) 4): That sounds reasonable.
|
Thanks for the feedback. Having given it some thought, here's what I'll do: Regarding (1) and (2): i.e. the "outer" and "inner" manifest versions, these will stay as-is. Regarding (3) and (4): I will add a separate ELF note which will be declared in the bindings (not the manifest!) and contain a major/minor "target abi version" and "target type". These will represent the internal ABI between a tender (or possibly for Muen, the hypervisor) and the bindings. See #386 for the infrastructure needed to support loading different ELF notes from different sources. Rationale: The internal ABI is represented by everything in Regarding the relationship to the Solo5 unikernel-facing API ( Regarding specific comments:
This is effectively what you'll get from (3) and (4) in an ABI ELF note, sans the "building a tender with a non-default or custom set of modules" -- If you do that you'll need to invent your own convention of some kind. I will add --abi-version or similar to the tender, and likewise mfttool (bintool? better name?) will be able to extract the expected values from a binary for you.
It is extremely unlikely that we'll go down the route of tenders supporting multiple ABI or manifest versions, mainly due to maintenance load and not wanting to keep old code around (that way lies QEMU!). But, as they say, "no is temporary, yes is forever" and the design does give us the option of doing this.
Unlikely to happen in any near future. If it did, then it'd be a new "universal" target type. What kind of flags did you have in mind? Might be worth adding a couple of
This is what I had in mind. Loading the ELF notes correctly is fiddly at best -- I certainly don't want to provide a library for it at this stage (and deal with the extra maintenance overhead), so it'd be exec() based at least in the medium term. Of course, if you want to write an ELF NOTE loader in OCaml for albatross, there's nothing stopping you from doing that :-) |
@mato that makes sense, I added an issue about sandboxing the |
Removed the release-engineering related steps from this issue as they're not directly related. The implementations here are now complete, so all done here! |
This issue sets out the high-level design of upcoming support for multiple devices in Solo5, from the point of view of the unikernel application ("developer") and deployment operator ("user"). It is intended to be read/discussed in conjunction with the work in progress implementation in PR #373.
Steps to completion:
(WIP) Implement support for multiple devices #373Multiple devices support, application manifest #375).Application manifest
We introduce the concept of an application manifest, which is defined by the developer at application build time, using the following JSON format and customarily named
manifest.json
:NAME is a human-readable unique identifier for the device. The intention is that the name conveys some meaning of the intended use ("wiring") of the device, eg.
frontend
orstorage
. NAME must be composed of alphanumeric characters only, and within 1..67 characters in length.TYPE is the type of device being declared, currently
BLOCK_BASIC
orNET_BASIC
.Note that NAME must be unique, and there is a maximum limit of 64 devices in the manifest.
At build time,
manifest.json
is pre-processed by the newly introducedsolo5-mfttool
, generating a C source file with a binary representation. This source file is then compiled using the normal Solo5 tool chain, and linked into the unikernel binary, where it is represented as an ELF NOTE.API changes
We introduce two new data types,
solo5_handle_t
to represent an opaque "handle" to a device, andsolo5_handle_set_t
to represent a set of up to 64 handles represented as a bitmask.The Solo5 public API will change as follows (abridged):
Notably,
solo5_yield()
now returnsvoid
and, if requested, asolo5_handle_set_t
as an out parameter with the set of handles ready for input. Thesolo5_..._info()
calls have been renamed tosolo5_..._acquire()
("acquire a handle forname
") and in addition to the existinginfo
structures return asolo5_handle_t
. The actual block and network I/O calls now require asolo5_handle_t
.Tender changes (spt and hvt)
From the operational side, the command line syntax used by the tenders for attaching devices to the unikernel changes as follows:
Network (TAP) devices
Block devices
In both cases,
NAME
must be specified and match the device declared asname
in the application manifest. The declared device'stype
must also match the device being attached.Before launching the unikernel, the tender will verify that all declared devices have been attached. This is intentional, and enforces the contract between the unikernel and tender, to ensure that a misconfigured combination of devices cannot be provided to the unikernel.
All those affected by this change, please review this design at your earliest convenience for any showstoppers. /cc @djwillia @ricarkol @hannesm @Kensan @ehmry
The text was updated successfully, but these errors were encountered: