Updated `parse_policy_info` function in augment.py #13509

LakshmiKalaKadali · 2024-12-17T06:17:33Z

This PR is raised to add more flexible control over the randomness in the augmentation process by changing level += tf.random.normal([], dtype=tf.float32) to level += level_std * tf.random.normal([], dtype=tf.float32) to the function parse_policy_info instead of standard deviation always 1.

This PR is raised to add more flexible control over the randomness in the augmentation process by changing `level += tf.random.normal([], dtype=tf.float32)` to `level += level_std * tf.random.normal([], dtype=tf.float32)` to the function `parse_policy_info` instead of standard deviation always 1.

luotigerlsx · 2025-01-08T17:26:38Z

official/vision/ops/augment.py

@@ -1869,7 +1869,7 @@ def _parse_policy_info(name: str,
  func = NAME_TO_FUNC[name]

  if level_std > 0:
-    level += tf.random.normal([], dtype=tf.float32)
+    level += level_std*tf.random.normal([], dtype=tf.float32)


This will change the behavior completely. I am worrying about the effects of this. Have you done any tests verifying it won't break existing results using this augmentation ?

It's OK to bring more flexibility, but the default behavior should keep backward compatibility.

OrangeDoro

Hi! I'm a grad student working on a research project about using large language models to automate code review. Based on your commit 06cbb37 and the changes in official/vision/ops/augment.py, my tool generated this comment:

Null Value Checks: The function _parse_policy_info does not check if the parameters are None before using them. It is advisable to add checks at the beginning of the function to ensure that these parameters are valid.
Data Type and Range Validation: Ensure that level_std is validated before it is used in the multiplication. If level_std is negative or not a number, it could lead to unexpected behavior. Consider adding checks to ensure that level_std is a non-negative float.
Type Checks: There are no checks to ensure that the types of the parameters are as expected. For instance, replace_value should be a list of integers, and level_std should be a float.
Clipping of Level: Verify that the new value of level after scaling does not exceed _MAX_LEVEL or fall below 0, especially if level_std is large.
Handling Abnormal Page Data: The code does not handle cases where level might exceed _MAX_LEVEL after the addition of noise. It is important to ensure that the input to tf.clip_by_value is valid.
Functionality of level_to_arg: Ensure that the functions mapped in args can handle the new range of level values correctly. If any of these functions expect level to be within a specific range, the scaling could lead to errors or unexpected behavior.
Function Argument Validation: The function level_to_arg returns a dictionary of functions based on the name parameter. There should be a check to ensure that name is valid and exists in the args dictionary.
Scaling of Random Normal Value: Ensure that level_std is intended to be a scaling factor for the randomness; otherwise, this could introduce unintended behavior.
Error Handling: Consider implementing error handling for cases where the random generation or subsequent calculations fail. This can prevent the application from crashing or behaving unpredictably.
Testing: Implement unit tests that cover various scenarios, including edge cases where level_std is 0, very small, or very large, to ensure that the changes do not introduce any logical errors in the overall functionality.
Testing for Variability: Add tests to verify the variability of level when level_std is set to different values (e.g., 0, positive values). Ensure that the output level reflects the expected range based on the input level_std.
Boundary Tests: Add tests to check the behavior of the level variable when level_std is set to 0. The output should equal the input level.

As part of my research, I'm trying to understand how useful these comments are in real-world development. If you have a moment, I'd be super grateful if you could quickly reply to these two yes/no questions:

Does this comment provide suggestions from a dimension you hadn’t considered?
Do you find this comment helpful?

Thanks a lot for your time and feedback! And sorry again if this message is a bother.

LakshmiKalaKadali requested review from yeqingli and rachellj218 as code owners December 17, 2024 06:17

LakshmiKalaKadali requested a review from luotigerlsx December 17, 2024 06:26

LakshmiKalaKadali added the models:official models that come under official repository label Dec 17, 2024

LakshmiKalaKadali mentioned this pull request Dec 23, 2024

RandAugment magnitude standard deviation is either 0 or 1 #10980

Closed

LakshmiKalaKadali removed the request for review from rachellj218 January 8, 2025 07:20

luotigerlsx reviewed Jan 8, 2025

View reviewed changes

yeqingli added the ready to pull ready to pull (create internal pr review and merge automatically) label Jan 14, 2025

LakshmiKalaKadali removed the ready to pull ready to pull (create internal pr review and merge automatically) label Jan 20, 2025

OrangeDoro reviewed Jul 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated `parse_policy_info` function in augment.py #13509

Updated `parse_policy_info` function in augment.py #13509

LakshmiKalaKadali commented Dec 17, 2024

Uh oh!

luotigerlsx Jan 8, 2025

Uh oh!

luotigerlsx Jan 8, 2025

Uh oh!

OrangeDoro left a comment

Uh oh!

Uh oh!

Updated parse_policy_info function in augment.py #13509

Are you sure you want to change the base?

Updated parse_policy_info function in augment.py #13509

Conversation

LakshmiKalaKadali commented Dec 17, 2024

Uh oh!

luotigerlsx Jan 8, 2025

Choose a reason for hiding this comment

Uh oh!

luotigerlsx Jan 8, 2025

Choose a reason for hiding this comment

Uh oh!

OrangeDoro left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Updated `parse_policy_info` function in augment.py #13509

Updated `parse_policy_info` function in augment.py #13509