Add new parsing apis #724

adrienball · 2018-12-11T18:16:27Z

Description:
This PR introduces 2 new parsing APIs.
The first one allows to run intent classification only, and get back a list of intents along with their probability:

>>> nlu_engine.get_intents("turn the lights on")
[
 {
   "intentName": "turnLightOn",
   "probability": 0.6363648460343694
 },
 {
   "intentName": null,
   "probability": 0.2580088944934134
 },
 {
   "intentName": "turnLightOff",
   "probability": 0.22791834836267366
 },
 {
   "intentName": "setTemperature",
   "probability": 0.181781583254962
 }
]

The second API allows to extract the slots when the intent is already known:

>>> engine.get_slots(u"Hey, lights on in the lounge !", "turnLightOn")
[
 {
   "range": {
     "start": 22,
     "end": 28
   },
   "rawValue": "lounge",
   "value": {
     "kind": "Custom",
     "value": "living room"
   },
   "entity": "room",
   "slotName": "room"
 }
]

On top of these two new APIs, an optional top_n parameter is added to the parse method, allowing to perform intent parsing on the top_n most likely intents.

This should address #623, and cover the feature brought by @deeiip in #715.

Checklist:

My PR is ready for code review
I have added some tests, if applicable, and run the whole test suite, including linting tests
I have updated the documentation, if applicable

codecov-io · 2018-12-12T10:51:27Z

Codecov Report

Merging #724 into develop will increase coverage by 0.27%.
The diff coverage is 91.24%.

@@             Coverage Diff             @@
##           develop     #724      +/-   ##
===========================================
+ Coverage    88.16%   88.44%   +0.27%     
===========================================
  Files           65       66       +1     
  Lines         3853     3963     +110     
  Branches       735      765      +30     
===========================================
+ Hits          3397     3505     +108     
- Misses         347      349       +2     
  Partials       109      109

ClemDoum · 2018-12-14T10:34:00Z

snips_nlu/utils.py

@@ -73,24 +69,29 @@ def classproperty(func):
 # pylint: enable=invalid-name


-def type_error(expected_type, found_type):
-    return TypeError("Expected %s but found: %s" % (expected_type, found_type))
+def type_error(expected_type, found_type, object_label=None):


I find the naming a bit misleading now.

ClemDoum · 2018-12-14T10:34:18Z

snips_nlu/utils.py



 def missing_key_error(key, object_label=None):
    if object_label is None:
-        return KeyError("Missing key: '%s'" % key)
-    return KeyError("Expected %s to have key: '%s'" % (object_label, key))
+        raise DatasetFormatError("Missing key: '%s'" % key)


I find the naming of the function a bit misleading now.

ClemDoum · 2018-12-14T10:36:24Z

snips_nlu/exceptions.py

+    format"""
+
+
+class IntentFormatError(SnipsNLUError):


Shouldn't these last 2 inherit from the DatasetFormatError ?

ClemDoum · 2018-12-14T10:45:21Z

snips_nlu/intent_classifier/log_reg_classifier.py

@@ -116,33 +116,49 @@ def get_intent(self, text, intents_filter=None):
            NotTrained: When the intent classifier is not fitted

        """
+        intents_results = self._get_intents(text, intents_filter)
+        if not intents_results or intents_results[0][RES_INTENT_NAME] is None:
+            return None


I'm wondering why we did this choice at the time.
I would prefer to prefer to have {"intentName": null, "probability": 0.6} as for the other intents.

Maybe this is the right time.

I would also move this logic to the IntentClassifier since the logic of returning the most likely intents, something like:

class IntentClassifier(with_metaclass(ABCMeta, ProcessingUnit)): @abstractmethod def fit(self, dataset): pass def get_intent(self, text, intents_filter): return self.get_intents(text)[0] @abstractmethod def get_intents(self, text): """Performs intent classification on the provided *text* and returns the list of intents ordered by decreasing probability The length of the returned list is exactly the number of intents in the dataset + 1 for the None intent .. note:: The probabilities returned along with each intent are not guaranteed to sum to 1.0. They should be considered as scores between 0 and 1. """ pass

since I don't think the logic of returning the most likely intent out of the intents list will change from one IntentClassifier from another.

But then you don't pass the intents_filter parameter.

timtutt · 2019-01-26T05:04:27Z

When will there be a new release that includes this PR?

ClemDoum · 2019-01-30T08:36:35Z

Hi @timtutt a new release should be available by the end of the week or early next week

adrienball requested a review from ClemDoum December 11, 2018 18:16

adrienball force-pushed the task/new-parsing-apis branch from 76cfd38 to 3e27bd5 Compare December 11, 2018 18:20

adrienball mentioned this pull request Dec 11, 2018

Feature top n intent #623 #715

Closed

3 tasks

adrienball force-pushed the task/new-parsing-apis branch from 1b033fa to ec1a307 Compare December 12, 2018 10:51

adrienball added 6 commits December 12, 2018 14:59

Group Snips NLU errors in dedicated file

d3a106c

Add get_slots API

ed40e0f

Add get_intents and parse top intents APIs

3dabf60

Update pylint

720f60a

Update documentation

d44150d

Fix tests

cc4bef5

adrienball force-pushed the task/new-parsing-apis branch from ec1a307 to cc4bef5 Compare December 12, 2018 14:07

ClemDoum requested changes Dec 14, 2018

View reviewed changes

Improve code after review

d1ad768

ClemDoum approved these changes Dec 14, 2018

View reviewed changes

adrienball merged commit c439a81 into develop Dec 14, 2018

adrienball deleted the task/new-parsing-apis branch December 14, 2018 15:33

adrienball mentioned this pull request Jan 25, 2019

Add new parsing apis snipsco/snips-nlu-rs#107

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new parsing apis #724

Add new parsing apis #724

adrienball commented Dec 11, 2018 •

edited

Loading

codecov-io commented Dec 12, 2018 •

edited

Loading

ClemDoum Dec 14, 2018

ClemDoum Dec 14, 2018

ClemDoum Dec 14, 2018

adrienball Dec 14, 2018

ClemDoum Dec 14, 2018

ClemDoum Dec 14, 2018

ClemDoum Dec 14, 2018

adrienball Dec 14, 2018

timtutt commented Jan 26, 2019

ClemDoum commented Jan 30, 2019

Add new parsing apis #724

Add new parsing apis #724

Conversation

adrienball commented Dec 11, 2018 • edited Loading

codecov-io commented Dec 12, 2018 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

timtutt commented Jan 26, 2019

ClemDoum commented Jan 30, 2019

adrienball commented Dec 11, 2018 •

edited

Loading

codecov-io commented Dec 12, 2018 •

edited

Loading