Handle NAGLCharges by j-wags · Pull Request #1206 · openforcefield/openff-interchange

j-wags · 2025-04-23T02:25:03Z

Description

Implements NAGLCharges as currently drafted in SMIRNOFF EP 11.

This interfaces with a PR in the OpenFF Toolkit at openforcefield/openff-toolkit#2048.

The spec change also includes the definition of some new model fetching and verification behavior, which can be found at openforcefield/openff-nagl-models#61, but which isn't required for these tests to pass.

codecov · 2025-04-23T02:29:17Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 93.89%. Comparing base (35def0c) to head (281c1a8).
⚠️ Report is 43 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1206      +/-   ##
==========================================
+ Coverage   93.87%   93.89%   +0.01%     
==========================================
  Files          72       72              
  Lines        6258     6275      +17     
==========================================
+ Hits         5875     5892      +17     
  Misses        383      383

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

for more information, see https://pre-commit.ci

j-wags · 2025-07-09T23:27:00Z

openff/interchange/_tests/conftest.py

            match="all_permutations",
            distance="0.8 * angstrom ** 1",
-            charge_increment1="0.0 * elementary_charge ** 1",
+            charge_increment1="0.123 * elementary_charge ** 1",


This needed to be a nonzero value for a nagl test, happy to make a separate fixture or put this in the test directly to avoid possible cross contamination.

j-wags · 2025-07-09T23:40:52Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+        numpy.testing.assert_allclose(expected_charges_unitless, assigned_charges_unitless)
+
+
+class TestNAGLChargesErrorHandling:


All these tests below here are largely AI-assisted and curated by me, which is to say that I'm not offended at all if you think some are only marginally valuable and deserve deletion.

mattwthompson

This is looking good - I'm really happy that the implementation was relatively simple; since it "quacks" much like other charge assignment methods, the investment in the complexity that already exists in smirnoff/_nonbonded.py is finally paying off in a way.

Since this is such a critical piece of infrastructure, I wan to be more through than normal when adding it in. There are a few tests/categories of that I think are missing:

interactions with charge_from_molecules
charge assignment logging
making sure a system containing a large molecule takes a reasonable amount of time

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

mattwthompson · 2025-07-10T15:58:12Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+
+        hexane_diol.assign_partial_charges("openff-gnn-am1bcc-0.1.0-rc.3.pt")
+        # Leave the ToolkitAM1BCC tag in openff-2.1.0 to ensure that the NAGLCharges handler takes precedence
+        ff = ForceField("openff-2.1.0.offxml")


Is there a reason to use 2.1.0 specifically? There are some other Sage version(s) already in fixtures and re-using those would cut down on some repetitive code

Nope! I'll remove usages of these where possible and replace with fixtures

mattwthompson · 2025-07-10T16:00:48Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+
+            with pytest.raises(MissingPackageError, match="NAGL software isn't present"):
+                Interchange.from_smirnoff(force_field=ff, topology=hexane_diol.to_topology())


What would happen if a user did charge_from_molecules=[hexane_diol]?

That should succeed, and I've added to this test to ensure it does.

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

openff/interchange/smirnoff/_nonbonded.py

mattwthompson · 2025-07-10T16:31:20Z

openff/interchange/smirnoff/_nonbonded.py

+            from openff.toolkit.utils.toolkits import GLOBAL_TOOLKIT_REGISTRY
+
+            partial_charge_method = parameter_handler.model_file
+            if "NAGL" not in GLOBAL_TOOLKIT_REGISTRY.__repr__():
+                raise MissingPackageError(
+                    "The force field has a NAGLCharges section, but the NAGL software isn't "
+                    "present in GLOBAL_TOOLKIT_REGISTRY",
+                )
+


I think this block is doing the toolkit's job for it - what would it look like if this was removed? I have to assume the toolkit handles this error and it would bubble up to the user through Interchange in a legible way

So... This is masking a nasty side effect of shoving everything through Molecule.assign_partial_charges, which is that the Toolkit treats model names like any other charge method (that is, it doesn't know that the call is supposed to get routed to NAGLToolkitWrapper). If we didn't have this, the error raised here would be:

E ValueError: No registered toolkits can provide the capability "assign_partial_charges" for args "()" and kwargs "{'molecule': Molecule with name '' and SMILES '[H][O][C]([H])([H])[C]([H])([H])[C]([H])([H])[C]([H])([H])[C]([H])([H])[C]([H])([H])[O][H]', 'partial_charge_method': 'openff-gnn-am1bcc-0.1.0-rc.3.pt', 'use_conformers': None, 'strict_n_conformers': False, 'normalize_partial_charges': True, '_cls': <class 'openff.toolkit.topology.molecule.Molecule'>}" E Available toolkits are: [ToolkitWrapper around The RDKit version 2025.03.4] E ToolkitWrapper around The RDKit version 2025.03.4 <class 'openff.toolkit.utils.exceptions.ChargeMethodUnavailableError'> : partial_charge_method 'openff-gnn-am1bcc-0.1.0-rc.3.pt' is not available from RDKitToolkitWrapper. Available charge methods are {'mmff94': {}, 'gasteiger': {}}

Which doesn't tell the user the important thing they need to know in this case: they either need to install NAGL or get it in their ToolkitRegistry.

So I'm in favor of leaving this as-is.

Actually, this made me realize there's a family of edge cases to consider, where NAGL charge assignment fails for various reasons. For example:

If NAGL just doesn't cover some elements in the molecule, then lower-precedence charge models should be attempted (NOT the current behavior, I'm committing a test for this)

I wonder if there's already a problem with this sort of fallback if a force field only had ToolkitAM1BCC and CIMH?

The error message printed when this happens SHOULDN'T just be from the most recent charge method to fail, but instead should include reasons why EACH attempted charge method failed (otherwise the user might just see one error from CIMH when they really want to know why their NAGLCharges tag didn't work)

If the NAGL model file isn't found and can't be fetched, that should be an immediate failure (this is the case now)

If the NAGL hash comparison fails, that should be an immediate failure (this is the case now)

I understand the highlighted code to be trying to do the job of making sure that NAGL is installed/available/etc. before running charge assignment on it. If we did s/NAGL/RDKit/ and s/charge assignment/conformer generation/ would you grant that this wiring is supposed to be handled by the toolkit?

I agree that error message is not useful for a user, but I would rather have the error bubble up in one place and it be handled as gracefully as possible. This may be a sizable block of a try followed by potentially many excepts but that feels safer and more maintainable than trying to handle this particular error here. I have yet to think through all of these failure modes, but I would rather the toolkit tell me NAGL isn't installed, a file isn't found, etc. and report that back to the user here. Probably some of these cases are already not reported to the user well and could be improved.

Put a different way, I'm concerned that this opens the door to Interchange being responsible for making sure charge assignment methods and optional dependencies are properly lined up whereas Molecule is the choke point of all paths.

If we did s/NAGL/RDKit/ and s/charge assignment/conformer generation/ would you grant that this wiring is supposed to be handled by the toolkit?

Yes.

I agree with this being resolved into a bunch of try/excepts, and trying not to make Interchange responsible for so many details. I'll give that a shot, there's a little magic in ToolkitRegistry.call (the raise_exception_types named argument) that may make this more straightforward.

Awesome - I hope we can get this smoothness without a ton of effort or complexity here. (If not, a less elegant solution which gets things working okay may be appropriate!)

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

Co-authored-by: Matt Thompson <mattwthompson@protonmail.com>

…terchange

…cedence charge model that CAN assign charges

for more information, see https://pre-commit.ci

j-wags

Notes to self:

Tell matt that test_charge_assignment_logging layout is a pain to make changes to

Tell matt that almost all new tests are AI generated at my prompting and while I've reviewed and revised them they were way less work to make than they appear so I'm fine to just trash or consolidate any of them

The pytest.mark.slow decorator doesn't appear to be doing anything (the tests are run unconditionally and pytest raises an error if I pass it --runslow)

Review Codecov diff and see why its so sad https://app.codecov.io/gh/openforcefield/openff-interchange/pull/1206?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=checks&utm_campaign=pr+comments&utm_term=openforcefield

openff/interchange/smirnoff/_nonbonded.py

…ethod

mattwthompson

Fantastic work - I found one detail which I think warrants further consideration, but all other comments are minor and non-blocking

mattwthompson · 2025-08-07T16:24:37Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+            # No error should be raised if using charge_from_molecules
+            sage_with_nagl_charges.create_interchange(
+                topology=hexane_diol.to_topology(),
+                charge_from_molecules=[hexane_diol],
+            )


Worth double-checking that the charges are correct (I think re-using code from a similar test would be fine). It's possible that something weird happens when ForceField.create_interchange is only given a registry with one toolkit

mattwthompson · 2025-08-07T16:27:07Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+    @pytest.mark.xfail(
+        reason="charge assignment handler fallback behavior not yet implemented",
+        raises=ValueError,
+    )


FTR I'm happy with pushing this implementation into the future, it's an edge case which a user ought to not come across without trying funky things

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

mattwthompson · 2025-08-07T16:34:50Z

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

+        # Create a very large molecule
+        large_molecule = Molecule.from_smiles("C" * 200)  # 200-carbon alkane chain
+
+        start_time = time.time()


It shouldn't make a difference with 30 seconds as the threshold, and it's already probably "warmed up" from another test, but I wonder if the model is already loaded into memory (frequently a few seconds, compared to <1 seconds runtime) at this step

mattwthompson · 2025-08-07T16:43:37Z

openff/interchange/_tests/conftest.py

+def sage_with_nagl_charges(sage):
+    sage.get_parameter_handler(


Suggested change

def sage_with_nagl_charges(sage):

sage.get_parameter_handler(

def sage_nagl(sage):

sage.deregister_parameter_handler("ToolkitAM1BCC")

(blocking) this fixture should drop AM1-BCC from Sage

IIUC, that would get it to match what's going to be used in production

It's hard to imagine this breaking tests since AM1BCC should not be sniffed with the tests as written, but I would really like to avoid surprises in releases 2.3.0+ if this breaks anything

test_nagl_charges_precedence_over_am1bcc would break with this change, but I think the behavior (what happens when there are both? then read the title of the test back to oneself) is worth testing, so maybe just add ToolkitAM1BCCHandler back in that test only

I can take or leave the name change, I think there is value in differentiating "Sage which happens to use NAGL changes" from "Sage which really is based on NAGL charges"

Agree, great catch. Implementing now.

Implemented

openff/interchange/_tests/unit_tests/smirnoff/test_nonbonded.py

mattwthompson · 2025-08-07T16:47:47Z

Taking the liberty to click "Ready for review" since ... uh, I'm already doing that

…rom CIMH test

mattwthompson

Changes look good, please nothing surprising seems to have come up.

I looked at the code coverage report and

The non-test/non-tooling diff is quite small
The coverage is great

Merge main in, update whatever docs you can think of, and I think we're good?

j-wags · 2025-08-19T15:27:22Z

@mattwthompson I've updated the releasehistory and think this is ready to merge. This is a big PR so I'd like to turn it over to you to push the merge button. I propose calling the release 0.4.5, since that saves me a package build (OFFTK is currently depending on interchange =0.4) but no big deal if you'd prefer to change the releasenotes and call it 0.5.0.

…charges-handler

j-wags added 2 commits April 22, 2025 19:07

initial implementation of NAGLChargesHandler

0df511b

have testing env use naglcharges toolkit branch

482128c

j-wags mentioned this pull request Apr 23, 2025

Implement NAGLChargesHandler openforcefield/openff-toolkit#2048

Merged

j-wags self-assigned this Jun 13, 2025

j-wags mentioned this pull request Jun 27, 2025

Implement fetching by doi and custom hashes openforcefield/openff-nagl-models#61

Merged

j-wags and others added 6 commits July 8, 2025 16:59

Merge branch 'main' into naglcharges-handler

d7aa607

adding a bunch of tests, some todos remain

d8f7070

[pre-commit.ci] auto fixes from pre-commit.com hooks

b27d68d

for more information, see https://pre-commit.ci

I guess valueerror is fine

e592850

Merge remote-tracking branch 'origin/fix-1254' into naglcharges-handler

da7686f

update vsite charge test

6856153

j-wags commented Jul 9, 2025

View reviewed changes

j-wags changed the title ~~[WIP] Handle NAGLCharges~~ Handle NAGLCharges Jul 9, 2025

j-wags commented Jul 9, 2025

View reviewed changes

j-wags marked this pull request as ready for review July 9, 2025 23:41

j-wags requested a review from mattwthompson as a code owner July 9, 2025 23:41

j-wags assigned mattwthompson and unassigned j-wags Jul 10, 2025

mattwthompson requested changes Jul 10, 2025

View reviewed changes

mattwthompson added this to the 0.5.0 milestone Jul 10, 2025

j-wags and others added 9 commits July 11, 2025 13:23

Apply suggestions from code review

d0c522d

Co-authored-by: Matt Thompson <mattwthompson@protonmail.com>

Replace usages of Interchange.from_smirnoff with ForceField.create_in…

272092a

…terchange

remove repeated nagl FF creation and replace with new fixture

6655de5

add check that charge_from_molecules takes precedence over NAGLCharges

d0f52a4

Tighten all total charge tolerances in new tests down to 1e-10

18e7abc

add test for NAGL charge assignment failure falling back to lower pre…

2f14578

…cedence charge model that CAN assign charges

[pre-commit.ci] auto fixes from pre-commit.com hooks

39f61ef

for more information, see https://pre-commit.ci

Implement correct(er) error handling/reporting behavior for NAGLCharges

8abbd36

test new error handling logic

f869775

j-wags added 2 commits August 4, 2025 08:29

have docs env fetch dev branches of toolkit and nagl-models

3625f3e

Increase allowed execution time on nagl runtime tests

f69c249

j-wags commented Aug 5, 2025

View reviewed changes

j-wags commented Aug 6, 2025

View reviewed changes

openff/interchange/smirnoff/_nonbonded.py Outdated Show resolved Hide resolved

route all charge assignment through cached _compute_partial_charges m…

2af90cf

…ethod

jameseastwood assigned mattwthompson and unassigned j-wags Aug 7, 2025

mattwthompson mentioned this pull request Aug 7, 2025

Fix "slow" test configuration #1287

Open

mattwthompson requested changes Aug 7, 2025

View reviewed changes

mattwthompson marked this pull request as ready for review August 7, 2025 16:47

j-wags added 4 commits August 7, 2025 10:11

remove toolkitam1bcc from sage_with_nagl_charges test fixture

1a8a4d1

rename sage_with_nagl_charges test fixture to sage_nagl

1a1beda

remove unnecessary ToolkitAM1BCCHandler registration+deregistration f…

7944479

…rom CIMH test

point to main branch of nagl-models now that PR is merged

518b8b0

mattwthompson approved these changes Aug 7, 2025

View reviewed changes

mattwthompson removed their assignment Aug 12, 2025

jameseastwood assigned j-wags Aug 15, 2025

mattwthompson and others added 4 commits August 18, 2025 11:13

Merge remote-tracking branch 'upstream/main' into naglcharges-handler

9ec2ff7

Drop development versions

4fca8fd

clean up pre-merge

5676133

update releasehistory

e60656c

mattwthompson added 3 commits August 19, 2025 10:50

Merge remote-tracking branch 'upstream/main' into naglcharges-handler

d4fb4e9

Merge remote-tracking branch 'upstream/naglcharges-handler' into nagl…

3c9ff5c

…charges-handler

Merge remote-tracking branch 'upstream/main' into naglcharges-handler

281c1a8

mattwthompson merged commit 1354fc8 into main Aug 19, 2025
16 checks passed

j-wags assigned mattwthompson and unassigned j-wags Aug 19, 2025

mattwthompson deleted the naglcharges-handler branch January 5, 2026 22:04

		numpy.testing.assert_allclose(expected_charges_unitless, assigned_charges_unitless)


		class TestNAGLChargesErrorHandling:


		with pytest.raises(MissingPackageError, match="NAGL software isn't present"):
		Interchange.from_smirnoff(force_field=ff, topology=hexane_diol.to_topology())

		def sage_with_nagl_charges(sage):
		sage.get_parameter_handler(

Conversation

j-wags commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

codecov bot commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattwthompson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

j-wags left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattwthompson left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattwthompson commented Aug 7, 2025

Uh oh!

mattwthompson left a comment

Choose a reason for hiding this comment

Uh oh!

j-wags commented Aug 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

j-wags commented Apr 23, 2025 •

edited

Loading

codecov bot commented Apr 23, 2025 •

edited

Loading