New Gaussian job and handlers #328

janosh · 2024-04-03T05:40:57Z

merging working branch gaussian into master. follow-up to #325. all credit to @rashatwi

Summary by CodeRabbit

New Features
- Introduced a new Gaussian Jobs package for managing Gaussian runs, including error handling and job setup.
- Added a range of mock input files for Gaussian calculations covering various molecular optimization scenarios.
- Enhanced pre-commit configuration to include more aggressive linting options.
Bug Fixes
- Implemented error handlers for common issues in Gaussian calculations such as wall time limits and SCF convergence.
Tests
- Added comprehensive test cases for Gaussian job management and error handling functionalities.
Chores
- Included new dependencies necessary for Gaussian-related functionalities in the project setup.

…rds and disabling symmetry

* Change better_guess func name in tests * Add type annotations to GaussianErrorHandler * `float` instead of `int | float` for mem

janosh · 2024-04-03T06:01:01Z

@rashatwi could you have a quick look at the CI logs why the test files can no longer be found?

rashatwi · 2024-04-03T06:06:41Z

@janosh The mentioned files in the error messages are in the correct location (under tests/files/gaussian), I'm not sure why this error is occurring. When I run locally, all the tests succeed. The .out.gz files are getting replaced by .out files (decompressed) before the tests run, which is causing the FileNotFoundError.

rashatwi · 2024-04-23T15:35:07Z

@janosh I fixed the issue with the test files on my forked repo. Should I create a new PR?

janosh · 2024-04-23T15:59:01Z

that would be great, thanks! sorry for letting this slip. was hoping to get back to it sooner

coderabbitai · 2024-04-24T20:45:01Z

Walkthrough

The latest update enhances the custodian package with new Gaussian computational chemistry capabilities, including job handling and error management. It introduces new modules for Gaussian jobs and handlers, along with extensive test cases and mock files for simulations. Additionally, the .pre-commit-config.yaml file has been updated to include more aggressive linting options.

Changes

Files	Summary
`.pre-commit-config.yaml`	Updated `ruff` hook arguments to include `--unsafe-fixes`.
`custodian/gaussian/...`	Introduced Gaussian job handling and error management modules.
`pyproject.toml`	Added dependencies for Gaussian functionality.
`tests/files/gaussian/...` (multiple `.com` files)	Added various Gaussian input files for testing different computational scenarios.
`tests/gaussian/...`	Added new test modules for Gaussian job and error handling functionalities.

Poem

🐇💻
In the world of code, where logic is king,
A rabbit hopped in, bringing Gaussian things.
With jobs and errors, it danced and twirled,
Crafting a domain where molecules unfurled.
Cheers to changes, big and small,
In the land of atoms, we now stand tall! 🎉🔬

Recent Review Details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 866661d and 67c79dd.

Files ignored due to path filters (21)

tests/files/gaussian/bad_file.out.gz is excluded by !**/*.gz
tests/files/gaussian/coord_inputs.out.gz is excluded by !**/*.gz
tests/files/gaussian/coords_dict_geom.out.gz is excluded by !**/*.gz
tests/files/gaussian/coords_string_geom.out.gz is excluded by !**/*.gz
tests/files/gaussian/found_coords.out.gz is excluded by !**/*.gz
tests/files/gaussian/insufficient_memory.out.gz is excluded by !**/*.gz
tests/files/gaussian/linear_bend.out.gz is excluded by !**/*.gz
tests/files/gaussian/missing_file.out.gz is excluded by !**/*.gz
tests/files/gaussian/missing_mol.out.gz is excluded by !**/*.gz
tests/files/gaussian/mol_opt.out.gz is excluded by !**/*.gz
tests/files/gaussian/opt_steps_better_guess.out.gz is excluded by !**/*.gz
tests/files/gaussian/opt_steps_cycles.out.gz is excluded by !**/*.gz
tests/files/gaussian/opt_steps_from_structure.out.gz is excluded by !**/*.gz
tests/files/gaussian/opt_steps_int_grid.out.gz is excluded by !**/*.gz
tests/files/gaussian/scf_convergence_algorithm.out.gz is excluded by !**/*.gz
tests/files/gaussian/scf_convergence_better_guess.out.gz is excluded by !**/*.gz
tests/files/gaussian/scf_convergence_cycles.out.gz is excluded by !**/*.gz
tests/files/gaussian/solute_solvent_surface.out.gz is excluded by !**/*.gz
tests/files/gaussian/syntax.out.gz is excluded by !**/*.gz
tests/files/gaussian/walltime.out.gz is excluded by !**/*.gz
tests/files/gaussian/zmatrix.out.gz is excluded by !**/*.gz

Files selected for processing (32)

.pre-commit-config.yaml (1 hunks)
custodian/gaussian/init.py (1 hunks)
custodian/gaussian/handlers.py (1 hunks)
custodian/gaussian/jobs.py (1 hunks)
pyproject.toml (1 hunks)
tests/files/gaussian/Checkpoint.chk (1 hunks)
tests/files/gaussian/Gau-mock.rwf (1 hunks)
tests/files/gaussian/Optimization.chk (1 hunks)
tests/files/gaussian/bad_file.com (1 hunks)
tests/files/gaussian/coord_inputs.com (1 hunks)
tests/files/gaussian/coords_dict_geom.com (1 hunks)
tests/files/gaussian/coords_string_geom.com (1 hunks)
tests/files/gaussian/found_coords.com (1 hunks)
tests/files/gaussian/insufficient_memory.com (1 hunks)
tests/files/gaussian/linear_bend.com (1 hunks)
tests/files/gaussian/missing_file.com (1 hunks)
tests/files/gaussian/missing_mol.com (1 hunks)
tests/files/gaussian/mol_opt.com (1 hunks)
tests/files/gaussian/opt_steps_better_guess.com (1 hunks)
tests/files/gaussian/opt_steps_cycles.com (1 hunks)
tests/files/gaussian/opt_steps_from_structure.com (1 hunks)
tests/files/gaussian/opt_steps_int_grid.com (1 hunks)
tests/files/gaussian/scf_convergence_algorithm.com (1 hunks)
tests/files/gaussian/scf_convergence_better_guess.com (1 hunks)
tests/files/gaussian/scf_convergence_cycles.com (1 hunks)
tests/files/gaussian/solute_solvent_surface.com (1 hunks)
tests/files/gaussian/syntax.com (1 hunks)
tests/files/gaussian/walltime.com (1 hunks)
tests/files/gaussian/zmatrix.com (1 hunks)
tests/gaussian/init.py (1 hunks)
tests/gaussian/test_handlers.py (1 hunks)
tests/gaussian/test_jobs.py (1 hunks)

Files skipped from review due to trivial changes (4)

tests/files/gaussian/Checkpoint.chk
tests/files/gaussian/Gau-mock.rwf
tests/files/gaussian/Optimization.chk
tests/gaussian/init.py

Additional Context Used

Ruff (1)

custodian/gaussian/__init__.py (1)

1-3: One-line docstring should fit on one line

Additional comments not posted (36)

tests/files/gaussian/bad_file.com (1)

1-9: The Gaussian input file parameters and structure appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/missing_file.com (1)

1-11: The Gaussian input file parameters and structure appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/missing_mol.com (1)

1-12: The Gaussian input file parameters and structure appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/zmatrix.com (1)

1-24: The Gaussian input file parameters and z-matrix structure appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/mol_opt.com (1)

1-24: The Gaussian input file parameters and molecular configuration appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/opt_steps_cycles.com (1)

1-24: The Gaussian input file parameters and optimization settings appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/opt_steps_from_structure.com (1)

1-24: The Gaussian input file parameters and optimization settings appear correctly formatted and consistent with typical Gaussian job requirements.

tests/files/gaussian/scf_convergence_algorithm.com (1)

1-27: Configuration setup for Gaussian job appears correct and well-defined.

tests/files/gaussian/scf_convergence_cycles.com (1)

1-27: Configuration setup for Gaussian job appears correct and well-defined.

tests/files/gaussian/scf_convergence_better_guess.com (1)

1-27: Configuration setup for Gaussian job with improved initial guess settings appears correct and beneficial.

tests/files/gaussian/opt_steps_int_grid.com (1)

1-27: Configuration setup for Gaussian job with specific optimization and integration grid settings appears correct.

tests/files/gaussian/coord_inputs.com (1)

1-27: Configuration setup for Gaussian job using z-matrix coordinates appears correct and well-defined.

tests/files/gaussian/found_coords.com (1)

1-27: Configuration setup for Gaussian job to use geometry from a checkpoint file appears correct and useful for continuing previous calculations.

tests/files/gaussian/opt_steps_better_guess.com (1)

1-27: Configuration setup for Gaussian job with optimization steps and a better initial guess appears correct and aimed at enhancing calculation efficiency.

tests/files/gaussian/coords_string_geom.com (1)

1-27: Configuration setup for Gaussian job using string-based geometry connectivity appears correct and well-defined.

tests/files/gaussian/coords_dict_geom.com (2)

1-4: Ensure the Gaussian directives are correctly configured for the intended computations.

6-24: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

.pre-commit-config.yaml (1)

13-13: Confirm the inclusion of --unsafe-fixes aligns with the project's coding standards and safety requirements.

tests/files/gaussian/insufficient_memory.com (1)

6-34: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

tests/files/gaussian/linear_bend.com (2)

1-4: Ensure the Gaussian directives are correctly configured for the intended computations.

6-47: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

tests/files/gaussian/syntax.com (1)

6-47: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

tests/files/gaussian/solute_solvent_surface.com (2)

1-4: Ensure the Gaussian directives are correctly configured for the intended computations, especially given the complexity of the solute-solvent system.

6-61: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

tests/files/gaussian/walltime.com (2)

1-4: Ensure the Gaussian directives are correctly configured for the intended computations, especially in the context of wall time testing.

6-68: Verify the molecular geometry and charge/multiplicity settings are appropriate for the intended chemical computations.

tests/gaussian/test_jobs.py (1)

22-83: Ensure the test cases for GaussianJob are comprehensive and cover all necessary scenarios, including error handling and edge cases.

pyproject.toml (1)

44-44: Dependency addition approved.

The addition of the gaussian dependency with pymatgen and matplotlib is consistent with the PR's objectives to enhance Gaussian computational chemistry functionalities.

custodian/gaussian/jobs.py (2)

30-120: Implementation of GaussianJob class approved.

The GaussianJob class is well-structured and includes appropriate methods for managing Gaussian jobs. The use of subprocess for running commands and the file management functionalities (backup and suffix) are well implemented.

120-202: Implementation of generate_better_guess method approved.

The generate_better_guess method is well-designed, using a generator to yield GaussianJob instances for better initial guesses. The integration with GaussianErrorHandler and the management of input and output files are handled appropriately.

tests/gaussian/test_handlers.py (1)

31-374: Test cases implementation approved.

The test cases in tests/gaussian/test_handlers.py are comprehensive and well-implemented. They appropriately test various Gaussian error handlers across different scenarios, ensuring robust error handling in Gaussian jobs.

custodian/gaussian/handlers.py (5)

43-47: Class definition and documentation are clear and concise.

158-188: Method implementation for recursive lowercase conversion is efficient and correct.

815-820: Class definition and documentation for WallTimeErrorHandler are clear and concise.

824-848: Initialization of WallTimeErrorHandler is robust and correctly handles the setup of wall time and buffer time.

864-871: Method for checking wall time is correctly implemented and efficiently calculates time remaining.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai generate interesting stats about this repository and render them as a table.
- @coderabbitai show all the console.log statements in this repository.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (invoked as PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger a review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 9

coderabbitai · 2024-04-24T20:52:48Z

custodian/gaussian/__init__.py

+"""
+This package implements various Gaussian Jobs and Error Handlers.
+"""


Condense the multi-line docstring into a single line to adhere to Python best practices.

- """ - This package implements various Gaussian Jobs and Error Handlers. - """ + """This package implements various Gaussian Jobs and Error Handlers."""

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Suggested change

"""

This package implements various Gaussian Jobs and Error Handlers.

"""

"""This package implements various Gaussian Jobs and Error Handlers."""

coderabbitai · 2024-04-24T20:52:48Z