✨ NEW: Module on troubleshooting calculations #353

ramirezfranciscof · 2021-06-11T15:37:10Z

I took the content of the former running computations section that was
specific for troubleshooting errors and created a new module for it.

It includes:

An introduction with a warning of the previous knowledge required,
together with a link to the section where to get it.
A setup subsection where they prepare the calculation; I propose to
use the builder of pw and give them the faulty input parameters, but
setting up the structure, kpoints and pseudo is left as exercise.
Dry run to simulate the submission.
Troubleshooting an error: check the process status, the logs, the
output, and the inputs.
Restarting the calculation using get_builder_restart.

Closes #354

ramirezfranciscof · 2021-06-11T15:41:47Z

@mbercx : In the end you didn't create any issues for the running calculation sections, right? Not necessary since we met and discussed, just making sure I was not ignoring some important indications that I missed.

@CasperWA hey you are my session buddy, take a look, I meid dis.

ramirezfranciscof · 2021-06-11T15:57:55Z

Comments on the general structure, content, or specifics apart: I think we can probably all agree that it is ok to have here the dry run, the verdi process report and the get_builder_restart, but I would like to know if you agree that this is also the place for the outputcat and inputcat/inputls (in which case I will also later remove them from the calculations/basic section). I think this is ok since the reason why one would like to take a look at the input and output files is typically related to troubleshooting.

mbercx · 2021-06-12T05:00:05Z

Thanks, @ramirezfranciscof! I just had a quick look, and it already looks great! Will leave a detailed review to @CasperWA, but to answer your questions:

In the end you didn't create any issues for the running calculation sections, right? Not necessary since we met and discussed, just making sure I was not ignoring some important indications that I missed.

No, you're right! I wouldn't want to deprive you of the satisfaction of closing an issue, so I opened #354. 😁

[...], but I would like to know if you agree that this is also the place for the outputcat and inputcat/inputls (in which case I will also later remove them from the calculations/basic section). I think this is ok since the reason why one would like to take a look at the input and output files is typically related to troubleshooting.

I think it's indeed a good idea to show them the error in the QE stdout using outputls/outputcat. However, I'm not sure if it's needed to remove the parts where they are used in the calculation/basic section. There the purpose was to get more details on a successful run (which I think is also a valid use case), here it's to debug. I think it's fine to show the two use cases, it's not really repetitive, but rather reminds them of this command.

ramirezfranciscof · 2021-06-14T08:21:32Z

I think it's indeed a good idea to show them the error in the QE stdout using outputls/outputcat. However, I'm not sure if it's needed to remove the parts where they are used in the calculation/basic section. There the purpose was to get more details on a successful run (which I think is also a valid use case), here it's to debug. I think it's fine to show the two use cases, it's not really repetitive, but rather reminds them of this command.

Mmm yeah I see what you mean. It is true you may want to check some stuff directly from the output, but I still have the feeling that 99% of those checks are "debug related" checks. For general status you just check the verdi process list (and only check the output if there is a problem there), and all the useful data from a successful run is supposed to be on the output nodes (I would argue that if there is something you need to be checking on the output file after a successful run, you probably have a good case for requesting the plugin developer the option to include that in an output node).

There is also the matter of "flow" of the tutorial: I agree using repetition strategically can be good, but also we need to be careful because too much also confuses people (I think we had some complaints in this direction last tutorial, no?). I would perhaps save that for more critical concepts; let me see how the section looks like when I remove that, if you still don't like it I can always put it back.

mbercx · 2021-06-22T08:20:02Z

Mmm yeah I see what you mean. It is true you may want to check some stuff directly from the output, but I still have the feeling that 99% of those checks are "debug related" checks. For general status you just check the verdi process list (and only check the output if there is a problem there), and all the useful data from a successful run is supposed to be on the output nodes (I would argue that if there is something you need to be checking on the output file after a successful run, you probably have a good case for requesting the plugin developer the option to include that in an output node).

I agree that typically you wouldn't get your results using these commands, but again: a tutorial isn't meant to always show you the 'correct' way to do things. I think it's still good for them to also have a look at the (probably familiar) QE stdout.

There is also the matter of "flow" of the tutorial: I agree using repetition strategically can be good, but also we need to be careful because too much also confuses people (I think we had some complaints in this direction last tutorial, no?).

Actually, based on the feedback, the vast majority of participants of last year did not thing there was too much repetition in the tutorial material, so I wouldn't worry too much about it.

ramirezfranciscof · 2021-06-25T11:44:18Z

@CasperWA 😬

CasperWA · 2021-06-25T11:59:16Z

@CasperWA grimacing

Chill, dude.

CasperWA

Looks good.

There are some issues with "you" versus "we" though. I think the "we" perspective is from the original text and you've used "you". Choose one. Indeed, one of these perspectives should be chosen for the whole tutorial. I personally would go for "you", since it feels less disrespectful when reading it...

There also something about Node versus node. I'm not completely sure about this, though?

docs/sections/calculations/errors.md

mbercx · 2021-06-25T13:56:51Z

There are some issues with "you" versus "we" though. I think the "we" perspective is from the original text and you've used "you". Choose one. Indeed, one of these perspectives should be chosen for the whole tutorial. I personally would go for "you", since it feels less disrespectful when reading it...

Good point, probably best to give this some thought and then try to be consistent throughout the tutorial! I've opened #360 for this.

mbercx · 2021-06-25T13:58:04Z

Sorry, accidentally touched my touch pad with the palm of my hand as I was hovering over the "Close" button 😅

CasperWA · 2021-06-28T08:07:13Z

Also, going through my sections I see that we now more commonly use node and not Node. This should then be done for all pages, only using the capitalized version if one is discussing/mentioning the actual Python class name.
Mentioning @mbercx and @chrisjsewell as well for this point.

mbercx · 2021-06-28T08:29:58Z

Also, going through my sections I see that we now more commonly use node and not Node. This should then be done for all pages, only using the capitalized version if one is discussing/mentioning the actual Python class name.

Fair point! Have opened #361 for this so I keep it in mind as I'm giving a pass to all sections.

ramirezfranciscof · 2021-06-28T15:52:28Z

@CasperWA thanks for the thorough first pass! I think I have now addressed all points (+rebased the branch) and it is ready for a second go.

docs/sections/calculations/errors.md

CasperWA · 2021-06-30T14:33:55Z

Ready to be re-reviewed or?

ramirezfranciscof · 2021-06-30T15:01:17Z

Ready to be re-reviewed or?

Ah sure. Since I just accepted all the modifications I assumed that would be it but if you have more comments let them come. The only thing remaining is the PK correction which I'm only waiting for @mbercx to confirm if I open a new issue or use the existing one.

CasperWA · 2021-06-30T15:03:57Z

Ready to be re-reviewed or?

Ah sure. Since I just accepted all the modifications I assumed that would be it but if you have more comments let them come. The only thing remaining is the PK correction which I'm only waiting for @mbercx to confirm if I open a new issue or use the existing one.

Just make a new one.

ramirezfranciscof · 2021-06-30T15:09:55Z

Ba-dam done, feel free to re-re-review @CasperWA .

CasperWA

I think this is fine when the final correction here has been implemented.

docs/sections/calculations/errors.md

I took the content of the former `running computations` section that was specific for troubleshooting errors and created a new module for it. It includes: 1. An introduction with a warning of the previous knowledge required, together with a link to the section where to get it. 2. A setup subsection where they prepare the calculation; I propose to use the builder of pw and give them the faulty input parameters, but setting up the structure, kpoints and pseudo is left as exercise. 3. Dry run to simulate the submission. 4. Troubleshooting an error: check the process status, the logs, the output, and the inputs. 5. Restarting the calculation using `get_builder_restart`.

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

mbercx · 2021-06-30T15:34:32Z

The only thing remaining is the PK correction which I'm only waiting for @mbercx to confirm if I open a new issue or use the existing one.

Sorry, I hadn't noticed the "new" mention. 😅 But yeah, opening a new issue is good 👍

Major revamp of the "Running calculations" section: * Removed all things related to troubleshooting (moved to a different module, see #353 ) * Changed introduction to highlight the contrast between how data is organized when running normally vs running with AiiDA. This is in hope that it'll illustrate more explicitly "how to do with AiiDA the things I already know how to do on my own". * Separated main content into 2 main sections: preliminary setups (which are performed using verdi CLI) and preparing the calculation (which is performed using the verdi shell). This is to address the problem we typically have of people needing to go back and forth from bash to python.

ramirezfranciscof requested review from CasperWA and mbercx June 11, 2021 15:42

mbercx changed the title ~~NEW: Section on troubleshooting calculations~~ ✨ NEW: Module on troubleshooting calculations Jun 12, 2021

CasperWA linked an issue Jun 25, 2021 that may be closed by this pull request

✨ NEW: Module on "Troubleshooting" calculations #354

Closed

CasperWA suggested changes Jun 25, 2021

View reviewed changes

mbercx mentioned this pull request Jun 25, 2021

👌 IMPROVE: Use consistent tense/perspective throughout tutorial #360

Open

mbercx closed this Jun 25, 2021

mbercx reopened this Jun 25, 2021

mbercx mentioned this pull request Jun 28, 2021

👌 IMPROVE: Make sure the usage of node vs Node is consistent #361

Open

ramirezfranciscof requested a review from CasperWA June 28, 2021 15:52

CasperWA suggested changes Jun 29, 2021

View reviewed changes

ramirezfranciscof mentioned this pull request Jun 29, 2021

👌 IMPROVE: Calculation processes - Running external codes #366

Merged

ramirezfranciscof requested a review from CasperWA June 30, 2021 15:01

ramirezfranciscof mentioned this pull request Jun 30, 2021

👌 IMPROVE: Make sure the usage of PK vs PK is consistent #374

Open

CasperWA approved these changes Jun 30, 2021

View reviewed changes

docs/sections/calculations/errors.md Outdated Show resolved Hide resolved

ramirezfranciscof and others added 7 commits June 30, 2021 17:21

Apply suggestions from code review

6ff5308

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Update docs/sections/calculations/errors.md

ab02590

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Applied corrections.

a015332

Apply suggestions from code review

b935305

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Update docs/sections/calculations/errors.md

93a1daa

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Update docs/sections/calculations/errors.md

2458b58

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

ramirezfranciscof merged commit 931e3d9 into aiidateam:tutorial-2021-intro-week Jun 30, 2021

ramirezfranciscof deleted the troubleshoot branch June 30, 2021 15:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ NEW: Module on troubleshooting calculations #353

✨ NEW: Module on troubleshooting calculations #353

ramirezfranciscof commented Jun 11, 2021 •

edited

Loading

ramirezfranciscof commented Jun 11, 2021

ramirezfranciscof commented Jun 11, 2021

mbercx commented Jun 12, 2021

ramirezfranciscof commented Jun 14, 2021 •

edited

Loading

mbercx commented Jun 22, 2021

ramirezfranciscof commented Jun 25, 2021

CasperWA commented Jun 25, 2021

CasperWA left a comment

mbercx commented Jun 25, 2021 •

edited

Loading

mbercx commented Jun 25, 2021

CasperWA commented Jun 28, 2021

mbercx commented Jun 28, 2021

ramirezfranciscof commented Jun 28, 2021

CasperWA commented Jun 30, 2021

ramirezfranciscof commented Jun 30, 2021

CasperWA commented Jun 30, 2021

ramirezfranciscof commented Jun 30, 2021

CasperWA left a comment

mbercx commented Jun 30, 2021

✨ NEW: Module on troubleshooting calculations #353

✨ NEW: Module on troubleshooting calculations #353

Conversation

ramirezfranciscof commented Jun 11, 2021 • edited Loading

ramirezfranciscof commented Jun 11, 2021

ramirezfranciscof commented Jun 11, 2021

mbercx commented Jun 12, 2021

ramirezfranciscof commented Jun 14, 2021 • edited Loading

mbercx commented Jun 22, 2021

ramirezfranciscof commented Jun 25, 2021

CasperWA commented Jun 25, 2021

CasperWA left a comment

Choose a reason for hiding this comment

mbercx commented Jun 25, 2021 • edited Loading

mbercx commented Jun 25, 2021

CasperWA commented Jun 28, 2021

mbercx commented Jun 28, 2021

ramirezfranciscof commented Jun 28, 2021

CasperWA commented Jun 30, 2021

ramirezfranciscof commented Jun 30, 2021

CasperWA commented Jun 30, 2021

ramirezfranciscof commented Jun 30, 2021

CasperWA left a comment

Choose a reason for hiding this comment

mbercx commented Jun 30, 2021

ramirezfranciscof commented Jun 11, 2021 •

edited

Loading

ramirezfranciscof commented Jun 14, 2021 •

edited

Loading

mbercx commented Jun 25, 2021 •

edited

Loading