[WIP] Integration tests: Discussion #10791

Stifael · 2018-10-31T09:28:37Z

Background

Currently PX4 evolves primarily through feature addition. However, these features are added without guarantee that no other legacy implementation breaks. To reduce this uncertainty, there is now a clear effort to expand flight-testing through the @PX4/testflights team. However, many implementations cannot be directly perceived through actual testing but rather would require in depth log analysis. In addition, many legacy implementations are not known anymore and just dangle around in the code and waiting for someone to break it such that it can be re-fixed afterwards because a few users depend on it. Although with the current PX4 state there is no way around of breaking it and fixing it again (since that feature might not be known to the developer), it would be desired that after that re-fix of the broken legacy implementation, that part of the code will never break again, even if that feature is unknown to anyone.
One possible approach to tackle this problem is to make use of integration tests. PX4 already uses an integration-test framework which is based on ROS or Dronekit. However, there are about 5 integration tests overall, which already have aged quite a lot. One reason why the set of integration tests are not increasing is probably because it is difficult to run these tests on your local machine without going through a nightmare of installing additional dependencies.
Luckily, recently dronecode_sdk has evolved quite a lot, which makes it very easy to set up simple missions or offboard related maneuvers. Offboard is not fully there yet, but that is mainly due to the lack of resources and time. Having dronecode_sdk as part of the PX4-firmware, every developer can easily generate simple maneuvers that then can be used for integration testing.

Integration test pipeline

The suggested integration test pipeline is different to the integration tests known. Originally, an integration test consisted of a simulated maneuver with immediate test-checking. The approach that I suggest is based on the log-files:

Simulated Maneuver is executed. This generates a log-file .ulg
The .ulg file is the input to the integration tests.

There are several advantages of writing integration tests based on .ulg file:
Self-contained development of simulated maneuvers
The simulated maneuvers do no longer require any tests. They can be implemented in such a way that they could also be run in reality. For instance, a simple maneuver could be to fly a mission in a square where each waypoint is 5m apart. Without any effort, that same mission can be run on the actual vehicle as well. This means that if we have a set of maneuvers, the @PX4/testflights team could in addition to the usual flight tests also run these deterministic flight tests.

Self-contained testing framework based on .ulg-file
We are completely free to choose any framework for testing. Since any .ulg file can be converted into python structure through pyulog, a good option would be a python testing framework.

The tests can be split into general tests and simulated tests:

General tests are tests that are simulation independent. For instance, the mission speed can never exceed MPC_XY_CRUISE independent of simulation or actual flight. Consequently, one can write a test that checks for the cruise-speed during mission. This test can be applied to a simulated maneuvers as well as to an actual flight-test log-file.
Simulated tests are tests that are using deterministic tests. For instance, a simulated test could be a test that checks if the vehicle reached a predefined waypoint within a certain time.

Current state

px4maneuvers

px4maneuvers is a simple project that uses dronecode_sdk to generate simple missions (https://github.com/Stifael/px4maneuvers). I added this project to the firmware (which is this PR). To run the example mission, all you need to do is:

update submodule
build the example mission
run it as explained in the repo.
I added a maneuver.sh script that simplifies the build process. From the root of px4-firmware, type:

chmod +x Tools/maneuver.sh
./Tools/maneuver mission

In an other terminal, start sitl as usual.

This project can now easily be extended. I also think that it would help the active development of the SDK. For instance, if the PX4-community decides to use the SDK for integration testing as well, then I am confident that developers will be more active in shaping and contributing to the SDK project as well.

uloganalysis

The name uloganalysis is not a really good name and needs to change eventually. That said, everything is just [WIP] anyway.
This project is here. Most of the details are provided in the README.md. The main objective of this project is to re-sample and merge the pyulog output into pandas dataframe (similar to px4tools). The goal is that this project stays simple without adding too fancy analysis methods. It also should not really analyze anything, but rather should provide convenient functions to add and extract info from the .ulg file. For instance, roll-/pitch- and yaw-error can be easily computed from the vehicle_attitude and vehicle_attitude_setpoint message. Another example would be tilt, which is a fundamental constraint within the px4-firmware but is not logged. That information can be extracted with uloganalysis.
Another goal of this project is to enable other projects such as testing, visualization etc. For instance, PX4-FlightReview is a good tool for sharing general flight related information. FlightPlot is a good tool to go a bit more into detail and look at messages which are not shown in FlightReview. However, computations such as the tilt-computation is not possible either. Hence, from a developer perspective, I prefer to have a setup that is more flexible. Consequently, uloganalysis could be used in any python project (for instance jupyter-notebooks for sharing) and therefore allows the developer to make full use of the python tools.
Once this project is matured, the project could then be packaged and distributed over PyPa (everything should already be in place).

px4ulogtests

px4ulogtests is a project that consists of test-scripts and is based on uloganalysis. The purpose of this project is to use it for integration testing, but also for actual flight testing and creating reports. Currently this project only contains one test, where the the vehicle's tilt during Manual and Altitude flight is tested. This test is a general test, which means it can be applied to simulated maneuvers and actual flight-tests.

What is needed

First I need to know if such a pipeline is something that the PX4-community is interested in. If that is the case, then each project needs to be discussed and reviewed again. However, for personal reason, I definitely don't want to have one large project that contains everything. To me it is crucial that uloganalysis stays independent. However, I am fine with any project as long as it fulfills a similar purpose.
Then we also need to discuss how to integrate such a pipeline into the current CI.
Once we agreed on a pipeline (or declined it entirely), and we still want to use the proposed projects, I would then like to move these repos over to the px4 umbrella.

Stifael · 2018-10-31T09:30:52Z

I don't want to add every single developer as reviewer (seems like spamming everyone). Please just add your comments and thoughts.

dagar · 2018-10-31T14:25:49Z

@Stifael this sounds like a great idea. We have the CI capacity to run significantly more tests, and will soon have even more with @julianoes faster than realtime work.

I'll spend some time reviewing the submodule project in detail. The main thing I want highlevel is to 1) make it easy to add new tests and 2) make it absolutely trivial to reproduce a failure locally.

mrpollo · 2018-10-31T16:54:19Z

.@julianoes @JonasVautherin any word on the SDK submodule? is this the "right" way to include the SDK as a dependency for testing?

julianoes · 2018-10-31T20:13:21Z

@mrpollo: I would like it to install the latet release and test against that but until that's ready the submodule is ok.

bkueng · 2018-11-05T15:31:03Z

Nice, thanks Dennis for kicking this off. This is definitely helpful.

In addition to Daniel's points, some things from my side:

python would be my prefered choice as well, since the SDK now has a python frontend (and we should use that)
letting the analysis be based on log files (vs real-time feedback from the vehicle) has pros and cons:
- + ability to run validation offline, after a (potentially real) flight
- - no real-time checks, which might be required for some tests (e.g. timeouts if a waypoint is never reached, vtol never transitions, ...)
- I also tend towards using the log file though
tests and ulog analysis are closely coupled in many cases (e.g. check against a mission plan with changing velocities, sending attitude/velocity setpoints, and afterwards checking them, etc). So ideally a test and validation should life closely together (i.e. single file), to simplify writing of tests.
uloganalysis: what's your take on plotting? I could see this being used in Flight Review as well, and the library could then also help with algorithm prototyping.

Some more test cases that should be considered by the design:

testing RC loss
low battery failsafe
GPS loss
testing different yaw behaviors in missions

Stifael · 2018-11-11T11:29:26Z

about the negative points:

no real-time checks, which might be required for some tests (e.g. timeouts if a waypoint is never reached, vtol never transitions, ...)

That is still possible even with post ulog analysis, it just depends on the test. For instance, one can write a simple mission where the vehicle flies from one point A to point B and expect the vehicle to fly that distance within a certain time T. The genertated ulog file can still be used to check if the vehicle accomplished that simple maneuver.

So ideally a test and validation should life closely together (i.e. single file), to simplify writing of tests.

Agreed. However, I suggest to have two files. One file that has tests that always have to be satisfied, and one file with tests which are specific for a simulated test. For instance, the example above with the simple mission with waypoints A and B and expected time T is a very specific test that only applies to that particular mission. I would keep these tests seperated from the other general tests.

Some more test cases that should be considered by the design:

testing RC loss

I think this should be possible with post analysis, it just depends on what you want to test. Since most px4-flight behaviors are controlled by parameters, I think this possible.

low battery failsafe

same as above

GPS loss

same as above. Do you have anything in mind that would not be possible to test through ulog that is GPS loss related?

testing different yaw behaviors in missions

That is for sure possible.

In the simple tilt test that I have added as an example here, the test uses the parameter to check the maximum tilt and uses the navigation state topic to check the tilt for manual/stabilized only. The same logic can be applied to yaw, or any other message.

Right now I think it would be great to have a PR where the entire pipeline would be executed. For that, however, it would require some help with the CI integration tests.

bkueng · 2018-11-12T08:39:02Z

That is still possible even with post ulog analysis, it just depends on the test.

I was thinking about aborting a test or taking some other action. How do you plan to handle that?

One file that has tests that always have to be satisfied, and one file with tests which are specific for a simulated test.

Sure I'd separate these as well. I don't see the general checks as individual tests, but just a set of checks that in every test need to be satisfied. These checks can be made available to all tests through a common base class for example.

Do you have anything in mind that would not be possible to test through ulog that is GPS loss related?

No, I think it will work.

Stifael · 2018-11-12T12:27:54Z

I was thinking about aborting a test or taking some other action. How do you plan to handle that?

What would be the use case for stopping a test?

These checks can be made available to all tests through a common base class for example.

I think we are talking about the same thing. Maybe we need to discuss it during the dev-call.

Right now the way I planned the testing is as follow:

General tests

All the tests are within one file. Each test can either be called as a single test, or all tests can be run at once. Here, I would distinguish between CI and regular log file.

Regular log file

For a regular log file, I would like to run that file against all tests within the General-tests. This would require to have all tests written in such a way that any test can only fail if the base conditions are met and the test still fails. Let's take again the tilt-example: the test for tilt during Stabilized only makes sense if the message vehicle_local_attitude is present and the vehicle was in Stabilized for some time. If these conditions are not met, then the test will succeed.
If we now have a regular log file, we can then test it against the General tests and it will fail only where conditions for a test are met and the test fails.

CI

For CI, we obviously need simulated maneuvers that generate ulog files for us. Some of the simulated tests require a specific test from the Simulated tests. In addition they might require tests from the General tests as well, but not all of them. In principle one could run each simulated maneuver against all the general tests as well. However, I am wondering if that is scalable if we have about 100 simulated maneuvers and about 200 general tests.

My point is that I want to have both. Either you run the log file against all tests, or you run it against specific tests that you are interested in.

bkueng · 2018-11-13T11:18:08Z

What would be the use case for stopping a test?

For example when in a mission, and the vehicle does not make any progress anymore.

My point is that I want to have both. Either you run the log file against all tests, or you run it against specific tests that you are interested in.

Sure, I see it the same way. I guess it's best to see that once it's implemented (is it already complete?).

MaEtUgR · 2018-11-20T17:15:14Z

Tools/maneuver.sh

+    read -p "Continue (y/n)?" answer
+
+    if [[ $answer == [Yy] ]]; then
+        mkdir $build_dir


Maybe first check if the submodule is checked out properly otherwise this will fail and the script continues.

MaEtUgR · 2018-11-20T17:15:43Z

Tools/maneuver.sh

+fi
+
+# start requested maneuver
+cd $build_dir && ./maneuvers/$maneuver udp://:14540


newline before eof missing

Stifael · 2018-11-26T08:14:20Z

uloganalysis: what's your take on plotting? I could see this being used in Flight Review as well, and the library could then also help with algorithm prototyping.

yes, that is the goal. It can be used by Flight Review.

I think it would make sense to have a simple example of that testing pipeline in action. To make that work, I will probably need @dagar's help. The CI would need to run the following components:

make posix_sitl_default gazebo to start simulation
./Tools/maneuver mission for the simulated mission
pytest test_general --filepath=[absolute path to log] to run the tests

Stifael · 2018-12-04T11:01:42Z

@dagar I added now a test_runner.sh script which does the following:

start gazebo server and client, px4
read newest ulog file
run general test

Right now the logs are stored in the px4-src directory.

TODO:

create yaml config file that is read by run_tests.py
write a script to setup everything

…shell script

- move yaml to config directory - deactivate tests based on TestClass name and method

Stifael · 2018-12-12T17:01:46Z

TODO

move px4ulogtest project into px4 src-tree
rename uloganalysis and push to PyPa once matured
add to CI as an example
add jmavsim

…n license

…dded to the RTL test, it can now be used as well for testing

…shown

Stifael · 2019-02-01T11:10:50Z

uloganalysis is now called pyulgresample: https://github.com/YUNEEC/pyulgresample
The main purpose of that project is resample ulog data. Once I finished the unit-tests, I will upload it to PyPa from wich it then can be downloaded.

hamishwillee · 2019-02-03T22:46:54Z

uloganalysis is now called pyulgresample: https://github.com/YUNEEC/pyulgresample
The main purpose of that project is resample ulog data. Once I finished the unit-tests, I will upload it to PyPa from wich it then can be downloaded.

@Stifael Can you also add some information about this in https://docs.px4.io/en/log/flight_log_analysis.html#analysis-tools ?

stale · 2019-07-10T17:37:07Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

hamishwillee · 2019-07-11T00:17:39Z

Is this really stale?

Stifael · 2019-07-15T11:36:04Z

yep. I will open a new PR instead and will close this one sync it is out-dated by now.

Stifael requested review from dagar, MaEtUgR, LorenzMeier and bkueng October 31, 2018 09:28

dagar changed the title ~~[WIP] Inegration tests: Discussion~~ [WIP] Integration tests: Discussion Oct 31, 2018

dagar added the Admin: Enhancement (improvement) 💡 label Oct 31, 2018

MaEtUgR reviewed Nov 20, 2018

View reviewed changes

Tools/maneuver.sh

fi

# start requested maneuver

cd $build_dir && ./maneuvers/$maneuver udp://:14540

Copy link

Member

MaEtUgR Nov 20, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

newline before eof missing

Stifael force-pushed the integration-testing-ulog branch from d1b857e to af25acf Compare November 27, 2018 10:27

Stifael added 9 commits December 4, 2018 11:54

add px4maneuvers as submodule

f0310f5

px4ulogtests submodule

7c87113

maneuvers.sh comment fix

9f82476

px4ulogtests: update submodule

9a56181

run_tests prototype script

be92e06

Maneuvers submodule update

0ab0caf

ulogtests submodule update

547eebf

test_runner that starts simulation, px4 and analysis

f975936

ulogtests update submodule

37efe77

Stifael force-pushed the integration-testing-ulog branch from af25acf to 37efe77 Compare December 4, 2018 10:55

PhiAbs and others added 2 commits December 12, 2018 09:19

maneuver name is given as a command line argument to the test_runner …

2316d1c

…shell script

integrationtests

4cdd4bd

- move yaml to config directory - deactivate tests based on TestClass name and method

PhiAbs and others added 10 commits December 17, 2018 16:34

removed submodule

880910a

removed wrongly placed submodule and added it at the right location

79e2bb6

added missing forward slash

f4947cd

removed unneccessary print statements

19a6de6

removed license for ulogtests since it is now part of px4 with its ow…

4845aa3

…n license

update .gitmodules

8e8b9fe

tests run now for all tests not only for the newest

0fb5abb

small changes to make terminal output look better

28b8836

added new yaml file for RTL test. With the heartbeat-check recently a…

79a5ff5

…dded to the RTL test, it can now be used as well for testing

changed print statement: the full path to the tested log file is now …

b9aa1f7

…shown

dagar removed devcall labels Jan 30, 2019

Stifael added the devcall label Feb 1, 2019

hamishwillee mentioned this pull request Feb 3, 2019

Add pyulgresample analysis tool PX4/PX4-user_guide#429

Open

weekly-digest bot mentioned this pull request Feb 10, 2019

Weekly Digest (3 February, 2019 - 10 February, 2019) #11425

Closed

dagar removed the devcall label Feb 13, 2019

Stifael mentioned this pull request Mar 20, 2019

Dockerfile based on simulation but with python3 as default PX4/PX4-containers#181

Closed

stale bot added the Admin: Wont fix label Jul 10, 2019

stale bot removed the Admin: Wont fix label Jul 11, 2019

weekly-digest bot mentioned this pull request Jul 14, 2019

Weekly Digest (7 July, 2019 - 14 July, 2019) #12474

Closed

Stifael closed this Jul 15, 2019

Stifael deleted the integration-testing-ulog branch July 15, 2019 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Integration tests: Discussion #10791

[WIP] Integration tests: Discussion #10791

Stifael commented Oct 31, 2018 •

edited by AuterionWrikeBot

Loading

Stifael commented Oct 31, 2018

dagar commented Oct 31, 2018 •

edited

Loading

mrpollo commented Oct 31, 2018

julianoes commented Oct 31, 2018

bkueng commented Nov 5, 2018

Stifael commented Nov 11, 2018 •

edited

Loading

bkueng commented Nov 12, 2018

Stifael commented Nov 12, 2018 •

edited

Loading

bkueng commented Nov 13, 2018

MaEtUgR Nov 20, 2018

MaEtUgR Nov 20, 2018

Stifael commented Nov 26, 2018 •

edited

Loading

Stifael commented Dec 4, 2018

Stifael commented Dec 12, 2018 •

edited

Loading

Stifael commented Feb 1, 2019

hamishwillee commented Feb 3, 2019

stale bot commented Jul 10, 2019

hamishwillee commented Jul 11, 2019

Stifael commented Jul 15, 2019

[WIP] Integration tests: Discussion #10791

[WIP] Integration tests: Discussion #10791

Conversation

Stifael commented Oct 31, 2018 • edited by AuterionWrikeBot Loading

Background

Integration test pipeline

Current state

px4maneuvers

uloganalysis

px4ulogtests

What is needed

Stifael commented Oct 31, 2018

dagar commented Oct 31, 2018 • edited Loading

mrpollo commented Oct 31, 2018

julianoes commented Oct 31, 2018

bkueng commented Nov 5, 2018

Stifael commented Nov 11, 2018 • edited Loading

bkueng commented Nov 12, 2018

Stifael commented Nov 12, 2018 • edited Loading

General tests

Regular log file

CI

bkueng commented Nov 13, 2018

MaEtUgR Nov 20, 2018

Choose a reason for hiding this comment

MaEtUgR Nov 20, 2018

Choose a reason for hiding this comment

Stifael commented Nov 26, 2018 • edited Loading

Stifael commented Dec 4, 2018

Stifael commented Dec 12, 2018 • edited Loading

Stifael commented Feb 1, 2019

hamishwillee commented Feb 3, 2019

stale bot commented Jul 10, 2019

hamishwillee commented Jul 11, 2019

Stifael commented Jul 15, 2019

Stifael commented Oct 31, 2018 •

edited by AuterionWrikeBot

Loading

dagar commented Oct 31, 2018 •

edited

Loading

Stifael commented Nov 11, 2018 •

edited

Loading

Stifael commented Nov 12, 2018 •

edited

Loading

Stifael commented Nov 26, 2018 •

edited

Loading

Stifael commented Dec 12, 2018 •

edited

Loading