Improve error message with mismatched valtype/dtype #2739

jacobhinkle · 2024-08-01T14:03:46Z

We already check types when binding arguments, but now we explicitly state which arguments mismatch, and in which positions.

Fixes #2670 by improving the error message. Before this change the error message was

Expected T0_g[ iS0{i1} ] to be bound to an at::Tensor, but got l

After this PR, the error is

Expected input 0, T0_g[ iS0{i0} ], to be an at::Tensor but got scalar 2

We already check types when binding arguments, but now we explicitly state which arguments mismatch, and in which positions. Fixes #2670 by improving the error message. Before this change the error message was ``` RuntimeError: Expected T0_g[ iS0{i1} ] to be bound to an at::Tensor, but got l ``` After this PR, the error is ``` Mismatch in input type: argument 0 (T0_g[ iS0{i0} ]) should be a float tensor but double scalar 2 was provided. ```

jacobhinkle · 2024-08-01T14:03:57Z

!build

tfogal

That error message is a great improvement! Thank you.

jacobhinkle · 2024-08-02T13:22:23Z

This is not an ideal fix since it replicates (poorly) the validation that already exists in ExpressionEvaluator::bind. I did it this way to be able to provide the argument number, which we lose once we get to the bind call. I'll try again to just improve the error message locally within bind and omit the argument number for now. In the future, it might be nice to be able to declare error contexts with guards so that additional info is printed. For example in bindInputs we could have:

for (const auto i : c10::irange(inputs.size())) {
  ErrorContext e("Error binding input " + std::to_string(i));
  expr_eval.bind(inputs[i], *args[i], true);
}

Then if NVF_CHECK or NVF_ERROR fails inside the lifetime of e, we would get an error message like Error binding input 2: <actual error>. That's a more general problem that we should handle separately (I'll post an issue for it). For now I will just try and clean up the bind error messages and revert the current changes.

This can be handled locally in bind_

jacobhinkle · 2024-08-05T14:18:42Z

I fixed this up by determining the input position inside the bind_ call when a check fails. I also added some cases to the test to check that our error messages work properly for other kinds of malformed inputs.

tfogal

Re-+1'ing the new changes.

But I also want to plug #1758: I'd really encourage an up-front pass to validate as soon as we can. As we found with latency work (#2136), ExprEvaluator is on the critical path, and so I worry about adding more checks in there.

Admittedly--these checks have to be on the critical path (since we can only validate when we see what inputs we actually receive). But it would probably behoove us to do an up-front check for UX purposes and avoid checks inside each Expr evaluation.

csrc/expr_evaluator.cpp

jacobhinkle requested review from tfogal and kevinstephano August 1, 2024 14:03

tfogal approved these changes Aug 1, 2024

View reviewed changes

jacobhinkle added 3 commits August 5, 2024 12:33

Revert changes to executor_utils and kernel_cache

2793b57

This can be handled locally in bind_

Clean up error messages in expr_evaluator.cpp and add tests

2f77819

Merge remote-tracking branch 'origin/main' into input_tensor_check

a2cc2f9

tfogal approved these changes Aug 5, 2024

View reviewed changes

csrc/expr_evaluator.cpp Outdated Show resolved Hide resolved

Remove unused include

b92088e

jacobhinkle merged commit 37289f4 into main Aug 6, 2024
5 checks passed

jacobhinkle deleted the input_tensor_check branch August 6, 2024 12:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve error message with mismatched valtype/dtype #2739

Improve error message with mismatched valtype/dtype #2739

jacobhinkle commented Aug 1, 2024 •

edited

Loading

jacobhinkle commented Aug 1, 2024

tfogal left a comment

jacobhinkle commented Aug 2, 2024

jacobhinkle commented Aug 5, 2024

tfogal left a comment

Improve error message with mismatched valtype/dtype #2739

Improve error message with mismatched valtype/dtype #2739

Conversation

jacobhinkle commented Aug 1, 2024 • edited Loading

jacobhinkle commented Aug 1, 2024

tfogal left a comment

Choose a reason for hiding this comment

jacobhinkle commented Aug 2, 2024

jacobhinkle commented Aug 5, 2024

tfogal left a comment

Choose a reason for hiding this comment

jacobhinkle commented Aug 1, 2024 •

edited

Loading