feat: Add random state feature. #150

john-halloran · 2025-06-06T19:48:37Z

feat: Added random_state feature for reproducibility.

sbillinge

This is great!

We have to decide how much testing we will add. Ideal is 100% coverage, optimal is probably less.

Maybe write the docstrings so I can understand what the class does, then we can decide what to test?

sbillinge · 2025-06-06T20:58:34Z

src/diffpy/snmf/snmf_class.py

+        components=None,
+        random_state=None,
+    ):
+
        self.MM = MM


more descriptive name?

Changed to n_components, which is what sklearn.decomposition.NMF uses.

sbillinge · 2025-06-06T20:59:08Z

src/diffpy/snmf/snmf_class.py

+        MM,
+        Y0=None,
+        X0=None,
+        A=None,


more descriptive name?

There are many different standards for what to name these matrices. Zero agreement between sources that use NMF. I'm inclined to eventually use what sklearn.decomposition.non_negative_factorization uses, which would mean MM->X, X->W, Y->H. But I'd like to leave this as is for the moment until there's a consensus about what would be the most clear or standard. If people will be finding this tool from the sNMF paper, there's also an argument for using the X, Y, and A names because that was used there.

sbillinge · 2025-06-06T21:00:48Z

src/diffpy/snmf/snmf_class.py

@@ -4,8 +4,20 @@


 class SNMFOptimizer:
-    def __init__(self, MM, Y0=None, X0=None, A=None, rho=1e12, eta=610, max_iter=500, tol=5e-7, components=None):


we need a docstring here and in the init. Please see scikit-package FAQ about how to write these. Also, look at Yucong's code or diffpy.utils?

Added one here. The package init dates back to the old codebase, but as soon as that is updated it will get a docstring as well.

sbillinge · 2025-06-06T21:01:50Z

src/diffpy/snmf/snmf_class.py

@@ -15,23 +27,22 @@ def __init__(self, MM, Y0=None, X0=None, A=None, rho=1e12, eta=610, max_iter=500
        # Capture matrix dimensions
        self.N, self.M = MM.shape
        self.num_updates = 0
+        self.rng = np.random.default_rng(random_state)


can we have a more descriptive variable name? Is this a range? What is the range?

sbillinge · 2025-06-06T21:02:32Z

src/diffpy/snmf/snmf_class.py

        if self.A is None:
-            self.A = np.ones((self.K, self.M)) + np.random.randn(self.K, self.M) * 1e-3  # Small perturbation
+            self.A = np.ones((self.K, self.M)) + self.rng.normal(0, 1e-3, size=(self.K, self.M))


K and M are probably good names if the matrix decomposition equation is in hte docstring, so they get defined there.

john-halloran · 2025-06-06T21:09:28Z

This is great!

We have to decide how much testing we will add. Ideal is 100% coverage, optimal is probably less.

Maybe write the docstrings so I can understand what the class does, then we can decide what to test?

Thanks, will work on resolving these. To be clear, for things like the docstrings would you prefer I make new PRs, get those merged, then rebase this one, or just add to this existing PR?

john-halloran · 2025-06-08T06:23:10Z

For now, I will assume anything given as feedback in this PR could be in scope to include.

feat: Add random state feature.

d8d4e11

sbillinge reviewed Jun 6, 2025

View reviewed changes

Add class docstring

ae45726

components->n_components

3d7c8b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add random state feature. #150

feat: Add random state feature. #150

john-halloran commented Jun 6, 2025

Uh oh!

sbillinge left a comment

Uh oh!

sbillinge Jun 6, 2025

Uh oh!

john-halloran Jun 8, 2025

Uh oh!

sbillinge Jun 6, 2025

Uh oh!

john-halloran Jun 8, 2025

Uh oh!

sbillinge Jun 6, 2025

Uh oh!

john-halloran Jun 8, 2025

Uh oh!

sbillinge Jun 6, 2025

Uh oh!

sbillinge Jun 6, 2025

Uh oh!

john-halloran commented Jun 6, 2025

Uh oh!

john-halloran commented Jun 8, 2025

Uh oh!

Uh oh!

		@@ -4,8 +4,20 @@


		class SNMFOptimizer:
		def __init__(self, MM, Y0=None, X0=None, A=None, rho=1e12, eta=610, max_iter=500, tol=5e-7, components=None):

feat: Add random state feature. #150

Are you sure you want to change the base?

feat: Add random state feature. #150

Conversation

john-halloran commented Jun 6, 2025

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

john-halloran commented Jun 6, 2025

Uh oh!

john-halloran commented Jun 8, 2025

Uh oh!

Uh oh!