Regularization in higher dimensions #354

YuanbinLiu · 2025-03-07T17:46:18Z

We have upgraded the functionality to support high-dimensional convex hull calculations with regularization. Previously, it was limited to 3D convex hulls, but we are now able to handle higher-dimensional cases, such as 4D convex hulls for three-element systems.

Test 1: Verify whether the new code can produce the same convex hull results as the old version for the 3D case.
Test 2: Evaluate whether the new code can handle high-dimensional (>3D) convex hulls.

JaGeo · 2025-03-07T18:08:53Z

Thank you @YuanbinLiu ! Some tests are still failing.

Can we handle anything beyond 4D? If not, should we add this limitation to the documentation?

YuanbinLiu · 2025-03-07T18:29:39Z

Thank you @YuanbinLiu ! Some tests are still failing.

Can we handle anything beyond 4D? If not, should we add this limitation to the documentation?

It's not limited to 4D; higher dimensions are also fesible. I'll find time to check for errors, but the tests passed before I pushed the PR.

JaGeo · 2025-03-07T18:34:35Z

@YuanbinLiu Thanks for the explanation!

Test results can be dependent on the operating system

YuanbinLiu · 2025-03-07T19:40:31Z

@YuanbinLiu Thanks for the explanation!

Test results can be dependent on the operating system

I have tried several times, all passed. I didn't found some bugs. Can anyone test it on a different system?

JaGeo · 2025-03-07T19:45:01Z

src/autoplex/fitting/common/regularization.py

+            f"Point and points must have the same dimensionality. Got {pn.shape[0]} and {preg.shape[1]}."
+        )
+    hull = ConvexHull(preg)
+    return np.all(np.dot(hull.equations[:, :-1], pn) + hull.equations[:, -1] <= 1e-12)


Is this floating point comparison maybe brittle?

It shouldn't be caused by this. I tested it on different computers with new environments, and it worked fine.

JaGeo · 2025-03-07T19:46:07Z

tests/fitting/common/test_regularization.py

-    fraction_list = [[1.0]] + [[0.0]] + [[0.5]] * 8
+    calc_hull_ND = calculate_hull_nd(label)
+
+    assert np.allclose(calc_hull_3D.equations, calc_hull_ND.equations, atol=1e-6)


Different tolerance needed?

JaGeo · 2025-03-07T19:46:22Z

tests/fitting/common/test_regularization.py

+        get_e_dist_hull_ND = get_e_distance_to_hull_nd(
+            calc_hull_ND, atom, {3: -0.28649227, 17: -0.25638457}, "REF_energy"
+        )
+        assert np.allclose(get_e_dist_hull_3D, get_e_dist_hull_ND, atol=1e-6)


JaGeo · 2025-03-07T19:50:00Z

@YuanbinLiu could you check all numerical tolerances again and especially for the failing test? It is likely just the tolerance

tests/fitting/common/test_regularization.py

JaGeo · 2025-03-08T19:54:25Z

tests/fitting/common/test_regularization.py

@@ -245,7 +245,7 @@ def test_regularization_for_three_element_system(test_dir, memory_jobstore, clea
        for at in atoms
    ]

-    assert all(d >= -1e-6 for d in des)
+    assert np.all(np.array(des) >= -1e6)


Shouldn't it be -1e-6?

Oh yes, missed that -, thats why test passed seems, I just saw the both methods returns 1e6 if it fails
https://github.com/YuanbinLiu/autoplex_pub/blob/76c29c6c3787c1cc4cd30ec13c3f42969da6296a/src/autoplex/fitting/common/regularization.py#L653
https://github.com/YuanbinLiu/autoplex_pub/blob/76c29c6c3787c1cc4cd30ec13c3f42969da6296a/src/autoplex/fitting/common/regularization.py#L653

If some configurations have a distance from the convex hull exceeding 1e6, they will be excluded from the training set afterwards.

naik-aakash · 2025-03-08T19:55:43Z

@YuanbinLiu could you check all numerical tolerances again and especially for the failing test? It is likely just the tolerance

Are the values supposed to be non zero, positive , negative ? Is there any specific range this values are suppose to be ? Also with this new function added supporting n dimensions, do we still need 3d method or it can be replaced with this new function now ? Can you comment on this @YuanbinLiu ?

tests/fitting/common/test_regularization.py

YuanbinLiu · 2025-03-10T17:05:48Z

@YuanbinLiu could you check all numerical tolerances again and especially for the failing test? It is likely just the tolerance

Are the values supposed to be non zero, positive , negative ? Is there any specific range this values are suppose to be ? Also with this new function added supporting n dimensions, do we still need 3d method or it can be replaced with this new function now ? Can you comment on this @YuanbinLiu ?

They should be non-negative (i.e., >= 0). Actually, we no longer need 3D, as this has already been handled in the new function.

QuantumChemist · 2025-03-12T10:48:16Z

src/autoplex/data/common/jobs.py

@@ -699,6 +699,7 @@ def preprocess_data(
    regularization: bool = False,
    retain_existing_sigma: bool = False,
    scheme: str = "linear-hull",
+    element_order: list = None,


Suggested change

element_order: list = None,

element_order: list | None = None,

list = None would cause a data type mismatch for list

QuantumChemist · 2025-03-12T10:51:06Z

src/autoplex/fitting/common/regularization.py

+                norm = np.cross(n_d[2] - n_d[0], n_d[1] - n_d[0])
+                plane_norm = norm / np.linalg.norm(norm)
+            else:
+                A = n_d[:-1] - n_d[0]


maybe clear variable names would be a bit better?

.github/workflows/python-package.yml

Regularization in higher dimensions

9ef6bfe

YuanbinLiu requested review from JaGeo, MorrowChem, naik-aakash and nfragapane March 7, 2025 17:46

YuanbinLiu mentioned this pull request Mar 7, 2025

Issue with volume-stoichiometry regularisation scheme #337

Open

Minor adjustment

92e6ece

JaGeo reviewed Mar 7, 2025

View reviewed changes

naik-aakash reviewed Mar 8, 2025

View reviewed changes

tests/fitting/common/test_regularization.py Outdated Show resolved Hide resolved

naik-aakash reviewed Mar 8, 2025

View reviewed changes

tests/fitting/common/test_regularization.py Outdated Show resolved Hide resolved

naik-aakash reviewed Mar 8, 2025

View reviewed changes

tests/fitting/common/test_regularization.py Show resolved Hide resolved

naik-aakash added 3 commits March 8, 2025 20:25

Update tests/fitting/common/test_regularization.py

8021db8

Update tests/fitting/common/test_regularization.py

2f16a1e

Update tests/fitting/common/test_regularization.py

76c29c6

JaGeo reviewed Mar 8, 2025

View reviewed changes

naik-aakash reviewed Mar 8, 2025

View reviewed changes

tests/fitting/common/test_regularization.py Outdated Show resolved Hide resolved

Fix condition check

bdb0ca5

YuanbinLiu added 2 commits March 10, 2025 17:41

Unit test update

0240b63

Unit test update

2fd91e4

QuantumChemist reviewed Mar 12, 2025

View reviewed changes

check package list

618cd06

naik-aakash reviewed Mar 12, 2025

View reviewed changes

.github/workflows/python-package.yml Outdated Show resolved Hide resolved

Revert

d195c19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regularization in higher dimensions #354

Regularization in higher dimensions #354

YuanbinLiu commented Mar 7, 2025 •

edited

Loading

JaGeo commented Mar 7, 2025

YuanbinLiu commented Mar 7, 2025

JaGeo commented Mar 7, 2025

YuanbinLiu commented Mar 7, 2025

JaGeo Mar 7, 2025

YuanbinLiu Mar 12, 2025

JaGeo Mar 7, 2025

JaGeo Mar 7, 2025

JaGeo commented Mar 7, 2025

JaGeo Mar 8, 2025

naik-aakash Mar 8, 2025 •

edited

Loading

YuanbinLiu Mar 10, 2025 •

edited

Loading

naik-aakash commented Mar 8, 2025 •

edited

Loading

YuanbinLiu commented Mar 10, 2025

QuantumChemist Mar 12, 2025

QuantumChemist Mar 12, 2025

	element_order: list = None,
	element_order: list \| None = None,

Regularization in higher dimensions #354

Are you sure you want to change the base?

Regularization in higher dimensions #354

Conversation

YuanbinLiu commented Mar 7, 2025 • edited Loading

JaGeo commented Mar 7, 2025

YuanbinLiu commented Mar 7, 2025

JaGeo commented Mar 7, 2025

YuanbinLiu commented Mar 7, 2025

JaGeo Mar 7, 2025

Choose a reason for hiding this comment

YuanbinLiu Mar 12, 2025

Choose a reason for hiding this comment

JaGeo Mar 7, 2025

Choose a reason for hiding this comment

JaGeo Mar 7, 2025

Choose a reason for hiding this comment

JaGeo commented Mar 7, 2025

JaGeo Mar 8, 2025

Choose a reason for hiding this comment

naik-aakash Mar 8, 2025 • edited Loading

Choose a reason for hiding this comment

YuanbinLiu Mar 10, 2025 • edited Loading

Choose a reason for hiding this comment

naik-aakash commented Mar 8, 2025 • edited Loading

YuanbinLiu commented Mar 10, 2025

QuantumChemist Mar 12, 2025

Choose a reason for hiding this comment

QuantumChemist Mar 12, 2025

Choose a reason for hiding this comment

YuanbinLiu commented Mar 7, 2025 •

edited

Loading

naik-aakash Mar 8, 2025 •

edited

Loading

YuanbinLiu Mar 10, 2025 •

edited

Loading

naik-aakash commented Mar 8, 2025 •

edited

Loading