modelop
diff --git a/‎Archived/Product Documentation/Java Model Runner.md
Lines changed: 8 additions & 10 deletions b/‎Archived/Product Documentation/Java Model Runner.md
Lines changed: 8 additions & 10 deletions
diff --git a/‎Archived/Product Documentation/Time Streams.md
Lines changed: 122 additions & 0 deletions b/‎Archived/Product Documentation/Time Streams.md
Lines changed: 122 additions & 0 deletions
diff --git a/‎Getting Started/FastScore Specs/index.md
Lines changed: 40 additions & 39 deletions b/‎Getting Started/FastScore Specs/index.md
Lines changed: 40 additions & 39 deletions
@@ -14,14 +14,14 @@ This page describes how to load and run models in each of these cases.
 
 ## Generic Java models
 
-A generic Java model can execute arbitrary Java code. In order to run this model in FastScore, it must implement a particular model interface: the `IJavaModel` interface. This interface includes `begin`, `action`, and `end` methods, analogous to Python and R models. 
+A generic Java model can execute arbitrary Java code. In order to run this model in FastScore, it must implement a particular model interface: the `FastScoreModel` interface. This interface includes `begin`, `action`, and `end` methods, analogous to Python and R models. Note that only the `action` method is required, therefore the rest of the methods will need to be overridden.
 
 ``` java
-import fastscore.IJavaModel;
+import fastscore.FastScoreModel;
 
-public class MyModel implements IJavaModel 
+public class MyModel implements FastScoreModel 
 {
-  
+  @Override
   public void begin()
   {
   ...
@@ -32,6 +32,7 @@ public class MyModel implements IJavaModel
   ...
   }
 
+  @Override
   public void end()
   {
   ...
@@ -76,7 +77,7 @@ A Spark model must follow the same conformance guidelines as a generic Java mode
 Here is an example Spark model that assumes that the `LogisticRegressionModel` was previously created and saved under the `scalaLogisticRegressionWithBFGSModel` folder and then uploaded to FastScore as an attachment.
 
 ``` java
-import fastscore.IJavaModel;
+import fastscore.FastScoreModel;
 import org.apache.spark.SparkConf;
 import org.apache.spark.SparkContext;
 import org.apache.spark.sql.SparkSession;
@@ -86,14 +87,15 @@ import org.apache.spark.mllib.classification.LogisticRegressionModel;
 import org.apache.spark.mllib.linalg.Vector;
 import org.apache.spark.mllib.linalg.Vectors;
 
-public class MLLibLRModel implements IJavaModel {
+public class MLLibLRModel implements FastScoreModel {
      
     LogisticRegressionModel _lrModel;
 
     public MLLibLRModel() {
         System.out.println("MLLib Linear Regression model");
     }
     
+    @Override
     public void begin() {
         SparkConf conf = new SparkConf();
         conf.setAppName("ML Lib LR Model");
@@ -120,10 +122,6 @@ public class MLLibLRModel implements IJavaModel {
         }
     
     }
-     
-    public void end() {
-         
-    }
 }
 ```
 
 
@@ -0,0 +1,122 @@
+---
+title: "Time Streams"
+excerpt: "A stream that delivers timestamps instead of data"
+---
+# Time Streams
+
+## Overview
+
+Many analytic models have a notion of time. Such models may receive time-related
+values as a part of their inputs. Or, they obtain such values using standard
+library calls. The latter adds an external dependency to the model. FastScore
+cannot control the time value the model gets from the system. The time
+streams proposed here remove the dependency.
+
+A time stream feeds timestamps to the model according to the settings in its
+stream descriptor. Time streams enable the following two important use cases:
+
+* A 'fake' time for simulations, verification, and model training;
+* Periodic model activations without actual data.
+
+## A time stream descriptor
+
+An example of a time stream descriptor:
+``` json
+{
+  "Transport": {
+    "Type": "time",
+    "Period: 2.0
+  },
+  "Schema": {
+    "Type": "long",
+    "logicalType": "timestamp-millis"
+  }
+}
+```
+
+The above stream delivers timestamps to the model every 2s.
+
+A Transport element of the a stream supports the following properties:
+
+Property | Type | Required | Default | Description
+---------|------|----------|---------|------------
+Type | string | Yes | | Set to "time" or "Time"
+TimeZero | string or null | No | null | The beginning of simulated time (iso8601)
+Delay | number | No | 0.0 | Wait this number of seconds before sending the first timestamp
+Period | number | No | 1.0 | Time between timestamps in seconds
+MaxCount | integer or null | No | null | Generate no more than this number of timestamps
+Overflow | string | No | "all" | Out-of-sync timestamps (either "skip" or "all")
+
+TimeZero controls the simulated time. The difference between normal and
+simulated times is calucated at the stream instantiation. The model will receive
+the following timestamps: TimeZero + Delay, TimeZero + Delay + Period,... If
+TimeZero is omitted or set to null, the time stream uses the current time.
+
+The number of timestamps delivered to the model can be capped using MaxCount
+property. After the stream generates this many timestamps it signals EOF.
+
+If the model is slow it may not be able to process all timestamps on time. The
+stream behaviour with respect to the out-of-sync timestamps depends on the
+Overflow property. If Overflow is "all", stream delivers all timestamp
+regardless of their timeliness. This may result in batches of timestamps
+delivered at the same time. If Overflow is "skip", only timely timestamps are
+delivered. Skipped timestamps produce a warning message.
+
+The Transport element may be set to "time" to assume default values for all
+properties.
+
+As with any boundary-preserving stream, the Envelope property of a time stream
+must be either omitted or set to null. The Encoding property must be either
+omitted or set to "bert".
+
+The Avro schema has a special logical type for timestamp. Or, rather two such
+types: one for millisecond --- and another for microsecond resolution. We only
+support millisecond timestamps (timestamp-millis).
+
+The time stream must not use batching. A timestamp must be delivered to the
+model immediately without buffering. Thus, Batching must be omitted or set to
+null.
+
+The simplest valid time stream descriptor looks as follows:
+``` json
+{
+  "Transport": "time"
+}
+```
+
+It is equivalent to the following stream descriptor:
+``` json
+{
+  "Transport": {
+    "Type": "time",
+    "TimeZero": null,   // normal time
+    "Delay": 0.0,       // no delay
+    "Period": 1.0,      // every 1s
+    "MaxCount": null,   // indefinite length
+    "Overflow": "all"   // deliver out-of-sync timestamps
+  },
+  "Envelope": null,     // no envelope
+  "Encoding": "bert",   // internal encoding
+  "Schema": {
+    "type": "long",
+    "logicalType": "timestamp-millis"
+  },
+  "Batching": null      // no batching
+}
+```
+
+Time streams are input only.
+
+## Other considerations
+
+There is a somewhat similar situation with the access to a random number
+generator. A model verification may need to 'replay' the sequence of random
+numbers used by the model. Or, the model may need cryptographically-strong
+random numbers from a hardware source. The mapping to the FastScore stream
+concept is less obvious here. The practical workaround could be to seed RNG
+using a timestamp provided by a simulated time stream.
+
+## More info
+
+TODO
+
@@ -9,42 +9,43 @@
 | H2O | ✓ |  | CPU utilization (data deserialization) | ✓ |
 | Matlab | ✓ |  | Sensors | ✓ |
 | C | ✓ |  | Default sensors installed | ✓ |
-|  |  |  | Dashboard sensor support | ✓ |
-| **Certified Deployment Options** |  |  |  |  |
-| Linux | ✓ |  | **Workflow, Concurrency, Scaling, etc** |  |
-| AWS | ✓ |  | Single model complex analytic workflows | ✓ |
-| On-premise | ✓ |  | Multi-model complex analytic workflows | ✓ |
-| Private Cloud | ✓ |  | Single machine scaling | ✓ |
-| Public Cloud | ✓ |  | Infrastructure Scaling (multi-server, cloud, etc) | ✓ |
-| Azure | ✓ |  | Intra-engine concurrecy | ✓ |
-| Google Cloud | ✓ |  | Multi-engine concurrency | ✓ |
-| MacOS | ✓ |  | Model state persistence checkpointing | ✓ |
-|  |  |  | Model state staring | ✓ |
-| **Data Source Types** |  |  | Multiple input/output streams | ✓ |
-| REST | ✓ |  |  |  |
-| Kafka | ✓ |  | **Third Party Orchestrators** | ✓ |
-| File | ✓ |  | Mesos/Marathon/DCOS | ✓ |
-| ODBC | ✓ |  | Swarm | ✓ |
-| HTTP | ✓ |  | Kubernetes | ✓ |
-| Experimental (TCP/UDP/Exec) | ✓ |  |  |  |
-| Kafka (Authenticated) | ✓ |  | **Model Management and AnalyticOps** |  |
-| S3 (Authenticated) | ✓ |  | Store/Edit/Select Models | ✓ |
-|  |  |  | Store/Edit/Select Streams | ✓ |
-| **Schema Definition Formats** |  |  | Store/Edit/Select Schemas | ✓ |
-| Avro Schema | ✓ |  |  |  |
-| Avro Schema Extensions (Restrictions) | ✓ |  | **Machine Learning Integration** |  |
-|  |  |  | R [ R ] | ✓ |
-| **Data Encoding Formats** |  |  | scikit-learn [ Python ] | ✓ |
-| Raw | ✓ |  | ml.lib [POJO ] | ✓ |
-| JSON | ✓ |  | H2O [POJO] | ✓ |
-| Avro-binary | ✓ |  | Tensorflow [ Python, R ] | ✓ |
-| UTF-8 | ✓ |  |  |  |
-| SOAP/RPC | ✓ |  | **Integration and Management Interfaces** |  |
-|  |  |  | RESTful API | ✓ |
-| **Environment Management** |  |  | GUI Dashboard | ✓ |
-| Import Policy | ✓ |  | CLI | ✓ |
-|  |  |  | Model deploy Jupyter | ✓ |
-| **FastScore SDK** |  |  |  |  |
-| Python 2 | ✓ |  | **Authentication and Access Control** |  |
-| Python 3 | ✓ |  | LDAP Authentication | ✓ |
-| Scala/Java | ✓ |  | Dashboard LDAP Authentication | ✓ |
+| Scala | ✓ |  | Dashboard sensor support | ✓ |
+|  |  |  |  |  |
+| **Certified Deployment Options** |  |  | **Workflow, Concurrency, Scaling, etc** |  |
+| Linux | ✓ |  | Single model complex analytic workflows | ✓ |
+| AWS | ✓ |  | Multi-model complex analytic workflows | ✓ |
+| On-premise | ✓ |  | Single machine scaling | ✓ |
+| Private Cloud | ✓ |  | Infrastructure Scaling (multi-server, cloud, etc) | ✓ |
+| Public Cloud | ✓ |  | Intra-engine concurrecy | ✓ |
+| Azure | ✓ |  | Multi-engine concurrency | ✓ |
+| Google Cloud | ✓ |  | Model state persistence checkpointing | ✓ |
+| MacOS | ✓ |  | Model state staring | ✓ |
+|  |  |  | Multiple input/output streams | ✓ |
+| **Data Source Types** |  |  |  |  |
+| REST | ✓ |  | **Third Party Orchestrators** | ✓ |
+| Kafka | ✓ |  | Mesos/Marathon/DCOS | ✓ |
+| File | ✓ |  | Swarm | ✓ |
+| ODBC | ✓ |  | Kubernetes | ✓ |
+| HTTP | ✓ |  |  |  |
+| Experimental (TCP/UDP/Exec) | ✓ |  | **Model Management and AnalyticOps** |  |
+| Kafka (Authenticated) | ✓ |  | Store/Edit/Select Models | ✓ |
+| S3 (Authenticated) | ✓ |  | Store/Edit/Select Streams | ✓ |
+|  |  |  | Store/Edit/Select Schemas | ✓ |
+| **Schema Definition Formats** |  |  |  |  |
+| Avro Schema | ✓ |  | **Machine Learning Integration** |  |
+| Avro Schema Extensions (Restrictions) | ✓ |  | R [ R ] | ✓ |
+|  |  |  | scikit-learn [ Python ] | ✓ |
+| **Data Encoding Formats** |  |  | ml.lib [POJO ] | ✓ |
+| Raw | ✓ |  | H2O [POJO] | ✓ |
+| JSON | ✓ |  | Tensorflow [ Python, R ] | ✓ |
+| Avro-binary | ✓ |  |  |  |
+| UTF-8 | ✓ |  | **Integration and Management Interfaces** |  |
+| SOAP/RPC | ✓ |  | RESTful API | ✓ |
+|  |  |  | GUI Dashboard | ✓ |
+| **Environment Management** |  |  | CLI | ✓ |
+| Import Policy | ✓ |  | Model deploy Jupyter | ✓ |
+|  |  |  |  |  |
+| **FastScore SDK** |  |  | **Authentication and Access Control** |  |
+| Python 2 | ✓ |  | LDAP Authentication | ✓ |
+| Python 3 | ✓ |  | Dashboard LDAP Authentication | ✓ |
+| Scala/Java | ✓ |  |  |