difference in the confidence values after stop word removal #831

cahuja1992 · 2019-07-18T05:42:56Z

Even though the stop word removal is enabled, the confidence of the utterance with and without stop words are different.

Example:
"turn on the wifi" vs "turn on wifi", here the is the stop word.
After looking into the code, I realized that it is actually the confidence value calculation that might be taking the number of tokens into account also.

The text was updated successfully, but these errors were encountered:

adrienball · 2019-07-18T08:31:17Z

@cahuja1992

Even though the stop word removal is enabled

Are you referring to the ignore_stop_words parameter of the LookupIntentParserConfig ?

This parser tries to find an exact match with one of the training samples, and it can do so by ignoring stop words. If it does not find a match, then the probabilistic intent parser is used and this one will not ignore stop words.

In your case, is one of the two formulations ("turn on the wifi" or "turn on wifi") in your training data ?

cahuja1992 · 2019-07-18T15:29:06Z

@adrienball
Okay got it. Then what I need is while predicting also, can we remove the stop words from the input utterance?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

difference in the confidence values after stop word removal #831

difference in the confidence values after stop word removal #831

cahuja1992 commented Jul 18, 2019

adrienball commented Jul 18, 2019

cahuja1992 commented Jul 18, 2019 •

edited

Loading

difference in the confidence values after stop word removal #831

difference in the confidence values after stop word removal #831

Comments

cahuja1992 commented Jul 18, 2019

adrienball commented Jul 18, 2019

cahuja1992 commented Jul 18, 2019 • edited Loading

cahuja1992 commented Jul 18, 2019 •

edited

Loading