Skip to content

Conversation

@yaoyi92
Copy link

@yaoyi92 yaoyi92 commented Feb 18, 2025

Summary

Discussed in #1123. Following the same strategy as ase tasks, output ForceFieldStructureTaskDocument or ForceFieldMoleculeTaskDocument based on the input type of mol_or_struct.

Checklist

Work-in-progress pull requests are encouraged, but please put [WIP] in the pull request
title.

Before a pull request can be merged, the following items must be checked:

  • Code is in the standard Python style.
    The easiest way to handle this is to run the following in the correct sequence on
    your local machine. Start with running ruff and ruff format on your new code. This will
    automatically reformat your code to PEP8 conventions and fix many linting issues.
  • Doc strings have been added in the Numpy docstring format.
    Run ruff on your code.
  • Type annotations are highly encouraged. Run mypy to
    type check your code.
  • Tests have been added for any new functionality or bug fixes.
  • All linting and tests pass.

Note that the CI system will run all the above checks. But it will be much more
efficient if you already fix most errors prior to submitting the PR. It is highly
recommended that you use the pre-commit hook provided in the repository. Simply run
pre-commit install and a check will be run prior to allowing commits.

@yaoyi92 yaoyi92 force-pushed the mol_for_forcefield branch 3 times, most recently from bfbb00f to 6e8c8e0 Compare February 18, 2025 18:42
@yaoyi92 yaoyi92 changed the title [WIP]: Molecule for forcefield Molecule for forcefield Feb 18, 2025
@yaoyi92
Copy link
Author

yaoyi92 commented Feb 18, 2025

Dear @JaGeo, can you help take a look at this PR at your convenience?

@JaGeo
Copy link
Member

JaGeo commented Feb 18, 2025

Thank you!

I am tagging @utf and @esoteric-ephemera as well as we have to decide how to handle molecular and structural outputs in general.

@yaoyi92
Copy link
Author

yaoyi92 commented Feb 18, 2025

Here is my understanding of how to handle molecular and structural outputs. I saw in atomate2, two approaches exist. In FHI-aims, and cp2k engines, their taskdoc inherents from both StructureMetadata, MoleculeMetadata and the output structure takes the same type of the input structure. While, in ase engine, the implementation is to output StructureTaskDoc or MoleculeTaskDoc based on the input structure. Since, the forcefield inherents mostly from ase, I picked the second approach. The FHI-aims/cp2k approach is cleaner in software engineering. However, StructureMetadata and MoleculeMetadata seems to have different symmetry types. I am not sure whether a conflict will happen is symmetry is used. My understanding is to make them fully compatible with each other would require a rework at the emmet-core level.

@esoteric-ephemera
Copy link
Collaborator

Hey @yaoyi92, thanks for making this extension to the forcefields! The split that occurs for StructureMetadata and MoleculeMetadata-based task documents in ASE is to ensure there are no collisions between attributes with the same names.

I would avoid merging the ememt-core classes - we're already in the process of reworking the TaskDoc and other schemas in emmet-core to enforce stricter typing

To @JaGeo's point: we have too many disparate schemas for each code, and it would likely be best to have them originate with the same base document models. Maybe we should start an issue for this?

@JaGeo
Copy link
Member

JaGeo commented Feb 20, 2025

@esoteric-ephemera I agree with your points and I also believe we should think about the task docs more broadly.

structure: Molecule | Structure,
prev_dir: str | Path | None = None,
) -> ForceFieldTaskDocument:
) -> ForceFieldStructureTaskDocument | ForceFieldMoleculeTaskDocument:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure why this isn't done for all other ForceFieldMakers? There, I only see a replacement with ForceFieldStructureTaskDocument.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Which makers are you referring to? I only found two makers that have an explicit make function, ForceFieldRelaxMaker and ForceFieldMDMaker. It seems, the other makers inherent from these two.

Comment on lines 236 to 255
@classmethod
def from_ase_task_doc(
cls, ase_task_doc: AseTaskDoc, **task_document_kwargs
) -> AseStructureTaskDoc:
"""Create an AseStructureTaskDoc for a task that has ASE-compatible outputs.
Parameters
----------
ase_task_doc : AseTaskDoc
Task doc for the calculation
task_document_kwargs : dict
Additional keyword args passed to :obj:`.AseStructureTaskDoc()`.
"""
task_document_kwargs.update(
{k: getattr(ase_task_doc, k) for k in _task_doc_translation_keys},
structure=ase_task_doc.mol_or_struct,
)
return cls.from_structure(
meta_structure=ase_task_doc.mol_or_struct, **task_document_kwargs
)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is not needed anymore?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This function is only called from the forcefields schemas.
I moved this function to src/atomate2/forcefields/schemas.py.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. However, it is a general static method. Why not keeping it here?

Copy link
Author

@yaoyi92 yaoyi92 Feb 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added the method back in this commit. 7b32166

A list of tags for the task.
task_document_kwargs : dict (deprecated)
Additional keyword args passed to :obj:`.ForceFieldTaskDocument()`.
Additional keyword args passed to :obj:`.ForceFieldStructureTaskDocument()`.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Additional keyword args passed to :obj:`.ForceFieldStructureTaskDocument()`.
Additional keyword args passed to :obj:`.ForceFieldStructureTaskDocument()` or :obj:`.ForceFieldMoleculeTaskDocument()`.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, I updated these descriptions in 4e9f061.

@yaoyi92 yaoyi92 force-pushed the mol_for_forcefield branch from 7b32166 to 4e9f061 Compare March 7, 2025 17:18
@JaGeo
Copy link
Member

JaGeo commented Mar 7, 2025

I am happy to merge after these changes. I just fear we will have a CI failure now. mp-api and monty need to be updated

@yaoyi92 yaoyi92 force-pushed the mol_for_forcefield branch from 4e9f061 to d8592fd Compare April 18, 2025 15:22
@yaoyi92
Copy link
Author

yaoyi92 commented Apr 18, 2025

Dear @JaGeo, just check, do we have the CI issue fixed? I think this pull request is read to merge.

@JaGeo JaGeo enabled auto-merge (squash) April 18, 2025 15:50
@JaGeo
Copy link
Member

JaGeo commented Apr 18, 2025

Thank you. If the tests workflow run through, it will get merged!

@JaGeo JaGeo disabled auto-merge April 18, 2025 15:52
@JaGeo
Copy link
Member

JaGeo commented Apr 18, 2025

@yaoyi92 still some failures, unfortunately.

@yaoyi92
Copy link
Author

yaoyi92 commented Apr 18, 2025

It seems my modification is incompatible with some new commits, let me see what I can do.

@yaoyi92 yaoyi92 force-pushed the mol_for_forcefield branch from d1da370 to 2f5a3a1 Compare April 18, 2025 16:35
@yaoyi92
Copy link
Author

yaoyi92 commented Apr 18, 2025

I fixed the incompatibility issue, how ever it seems there is still some error related to this merge of version bump of mdanalysis #1176.

yaoyi92 added 5 commits April 21, 2025 14:18
…ent based on the input type of mol_or_struct.

change name ForceFieldTaskDocument => ForceFieldStructureTaskDocument

output ForceFieldStructureTaskDocument or ForceFieldMoleculeTaskDocument based on type of mol_or_struct

update ForceFieldTaskDocument => ForceFieldStructureTaskDocument in the tests

import Union from typing

include Union in forcefield/md.py

take the suggestions from the formatter

take ruff's suggestions

try again with ruff format

ruff format again

try again ruff

ruff again

fix the mypy error

Take inputs of both Molecule and Structure

update docstring

add molecule test for forcefield
@yaoyi92 yaoyi92 force-pushed the mol_for_forcefield branch from 55fffef to 95fd127 Compare April 21, 2025 19:22
@yaoyi92
Copy link
Author

yaoyi92 commented Apr 21, 2025

Dear @JaGeo, can you help run the workflow again, I believe the testing can be fixed by this commit 95fd127.

@JaGeo
Copy link
Member

JaGeo commented Jun 16, 2025

@yaoyi92 hey, do you still want to get it merged? Sorry for not following up!

@yaoyi92
Copy link
Author

yaoyi92 commented Jun 17, 2025

@yaoyi92 hey, do you still want to get it merged? Sorry for not following up!

Yes, if the CI works fine now. I can try to merge it.

@yaoyi92
Copy link
Author

yaoyi92 commented Jun 22, 2025

Looks like all checks passed. Can you help merge it @JaGeo ?

@yaoyi92 yaoyi92 requested a review from JaGeo July 3, 2025 03:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants