platinummonkey
diff --git a/‎.travis.yml
-12 b/‎.travis.yml
-12
diff --git a/‎README.md
+128-3 b/‎README.md
+128-3
diff --git a/‎core/doc.go
+2 b/‎core/doc.go
+2
diff --git a/‎doc.go
+2 b/‎doc.go
+2
diff --git a/‎docs/archetypes/default.md
-6 b/‎docs/archetypes/default.md
-6
diff --git a/‎docs/assets/moving_percentile_reference.pdf
-219 KB b/‎docs/assets/moving_percentile_reference.pdf
-219 KB
diff --git a/‎docs/config.toml
-5 b/‎docs/config.toml
-5
diff --git a/‎docs/content/_index.md
-157 b/‎docs/content/_index.md
-157
diff --git a/‎docs/content/_references.md
-3 b/‎docs/content/_references.md
-3
diff --git a/‎docs/themes/hugo-theme-learn b/‎docs/themes/hugo-theme-learn
diff --git a/‎examples/doc.go
+2 b/‎examples/doc.go
+2
diff --git a/‎grpc/doc.go
+2 b/‎grpc/doc.go
+2
diff --git a/‎limit/doc.go
+2 b/‎limit/doc.go
+2
diff --git a/‎limit/functions/doc.go
+2 b/‎limit/functions/doc.go
+2
diff --git a/‎limiter/doc.go
+2 b/‎limiter/doc.go
+2
diff --git a/‎measurements/doc.go
+2 b/‎measurements/doc.go
+2
diff --git a/‎metric_registry/datadog/registry.go
+1 b/‎metric_registry/datadog/registry.go
+1
diff --git a/‎metric_registry/doc.go
+2 b/‎metric_registry/doc.go
+2
diff --git a/‎metric_registry/gometrics/registry.go
+1 b/‎metric_registry/gometrics/registry.go
+1
diff --git a/‎patterns/doc.go
+2 b/‎patterns/doc.go
+2
diff --git a/‎patterns/pool/doc.go
+2 b/‎patterns/pool/doc.go
+2
diff --git a/‎patterns/example_fixed_pool_test.go renamed to ‎patterns/pool/example_fixed_pool_test.go
+1-1 b/‎patterns/example_fixed_pool_test.go renamed to ‎patterns/pool/example_fixed_pool_test.go
+1-1
diff --git a/‎patterns/example_generic_pool_test.go renamed to ‎patterns/pool/example_generic_pool_test.go
+1-1 b/‎patterns/example_generic_pool_test.go renamed to ‎patterns/pool/example_generic_pool_test.go
+1-1
diff --git a/‎patterns/example_lifo_fixed_pool_test.go renamed to ‎patterns/pool/example_lifo_fixed_pool_test.go
+1-1 b/‎patterns/example_lifo_fixed_pool_test.go renamed to ‎patterns/pool/example_lifo_fixed_pool_test.go
+1-1
@@ -8,8 +8,6 @@ addons:
     packages:
       - python-pygments
 install:
-  - wget https://github.com/gohugoio/hugo/releases/download/v0.48/hugo_0.48_Linux-64bit.deb
-  - sudo dpkg -i hugo_0.48_Linux-64bit.deb
   - go get golang.org/x/tools/cmd/cover
   - go get github.com/mattn/goveralls
   - go get golang.org/x/lint/golint
@@ -19,13 +17,3 @@ script:
   - go vet ./...
   - go test -v -race -covermode=atomic -coverprofile=coverage.out ./...
   - $(go env GOPATH | awk 'BEGIN{FS=":"} {print $1}')/bin/goveralls -coverprofile=coverage.out -service=travis-ci -repotoken=${COVERALLS_TOKEN}
-after_success:
-  - cd docs && hugo && mv docs/* .
-deploy:
-  provider: pages
-  skip-cleanup: true
-  github-token: $GITHUB_TOKEN
-  keep-history: true
-  on:
-    branch: master
-  local-dir: docs
@@ -1,7 +1,132 @@
+[![GoDoc](https://godoc.org/github.com/platinummonkey/go-concurrency-limits?status.svg)](https://godoc.org/github.com/platinummonkey/go-concurrency-limits)
 [![Build Status](https://travis-ci.org/platinummonkey/go-concurrency-limits.svg?branch=master)](https://travis-ci.org/platinummonkey/go-concurrency-limits) [![Coverage Status](https://img.shields.io/coveralls/github/platinummonkey/go-concurrency-limits/master.svg)](https://coveralls.io/github/platinummonkey/go-concurrency-limits)
+[![Releases](https://img.shields.io/github/release/platinummonkey/go-concurrency-limits.svg)](https://github.com/platinummonkey/go-concurrency-limits/releases) [![Releases](https://img.shields.io/github/downloads/platinummonkey/go-concurrency-limits/total.svg)](https://github.com/platinummonkey/go-concurrency-limits/releases)
 
-# Overview
+# Background
 
-Go Implementation of Netflix/concurrency-limits Java  Library that implements and integrates concepts from TCP congestion control to auto-detect concurrency limits to achieve optimal throughput with optimal latency.
+When thinking of service availability operators traditionally think in terms of RPS (requests per second). Stress tests 
+are normally performed to determine the RPS at which point the service tips over. RPS limits are then set somewhere 
+below this tipping point (say 75% of this value) and enforced via a token bucket. However, in large distributed systems 
+that auto-scale this value quickly goes out of date and the service falls over by becoming non-responsive as it is 
+unable to gracefully shed excess load. Instead of thinking in terms of RPS, we should be thinking in terms of 
+concurrent request where we apply queuing theory to determine the number of concurrent requests a service can handle 
+before a queue starts to build up, latencies increase and the service eventually exhausts a hard limit such as CPU, 
+memory, disk or network. This relationship is covered very nicely with Little's Law where 
+Limit = Average RPS * Average Latency.
 
-For more information [Docs](http://accelerate-experience.com/go-concurrency-limits/)
+Concurrency limits are very easy to enforce but difficult to determine as they would require operators to fully 
+understand the hardware services run on and coordinate how they scale. Instead we'd prefer to measure or estimate the 
+concurrency limits at each point in the network. As systems scale and hit limits each node will adjust and enforce its 
+local view of the limit. To estimate the limit we borrow from common TCP congestion control algorithms by equating a 
+system's concurrency limit to a TCP congestion window.
+
+Before applying the algorithm we need to set some ground rules.
+
+- We accept that every system has an inherent concurrency limit that is determined by a hard resources, such as number of CPU cores.
+- We accept that this limit can change as a system auto-scales.
+- For large and complex distributed systems it's impossible to know all the hard resources.
+- We can use latency measurements to determine when queuing happens.
+- We can use timeouts and rejected requests to aggressively back off.
+
+# Limit Algorithms
+
+## Vegas
+
+Delay based algorithm where the bottleneck queue is estimated as
+
+```
+L * (1 - minRTT/sampleRtt)
+```
+
+At the end of each sampling window the limit is increased by 1 if the queue is less than alpha (typically a value 
+between 2-3) or decreased by 1 if the queue is greater than beta (typically a value between 4-6 requests).
+
+## Gradient2
+
+This algorithm attempts to address bias and drift when using minimum latency measurements. To do this the algorithm 
+tracks uses the measure of divergence between two exponential averages over a long and short time time window. Using 
+averages the algorithm can smooth out the impact of outliers for bursty traffic. Divergence duration is used as a proxy 
+to identify a queueing trend at which point the algorithm aggresively reduces the limit.
+
+# Enforcement Strategies
+
+## Simple
+
+In the simplest use case we don't want to differentiate between requests and so enforce a single gauge of the number of 
+inflight requests. Requests are rejected immediately once the gauge value equals the limit.
+
+## Partitioned
+
+For a slightly more complex system, it's desirable to partition requests to different backend/services. For example,
+you might shard by a customer id modulus 64 and the remainder you use as a unique backend identifier to target the
+the request. This allows for specific partitions to begin failing while others are operation normally. 
+  
+## Percentage
+
+For more complex systems it's desirable to provide certain quality of service guarantees while still making efficient 
+use of resources. Here we guarantee specific types of requests get a certain percentage of the concurrency limit. For 
+example, a system that takes both live and batch traffic may want to give live traffic 100% of the limit during heavy 
+load and is OK with starving batch traffic. Or, a system may want to guarantee that 50% of the limit is given to write 
+traffic so writes are never starved.
+
+# Integrations
+
+## GRPC
+
+A concurrency limiter may be installed either on the server or client. The choice of limiter depends on your use case. 
+For the most part it is recommended to use a dynamic delay based limiter such as the VegasLimit on the server and 
+either a pure loss based (AIMDLimit) or combined loss and delay based limiter on the client.
+
+### Server Limiter
+
+The purpose of the server limiter is to protect the server from either increased client traffic (batch apps or retry 
+storms) or latency spikes from a dependent service. With the limiter installed the server can ensure that latencies 
+remain low by rejecting excess traffic with `Status.UNAVAILABLE` errors.
+
+In this example a GRPC server is configured with a single adaptive limiter that is shared among batch and live traffic 
+with live traffic guaranteed 90% of throughput and 10% guaranteed to batch. For simplicity we just expect the client to 
+send a "group" header identifying it as 'live' or 'batch'. Ideally this should be done using TLS certificates and a 
+server side lookup of identity to grouping. Any requests not identified as either live or batch may only use excess 
+capacity.
+
+```golang
+import (
+    gclGrpc "github.com/platnummonkey/go-concurrency-limits/grpc"
+)
+
+// setup grpc server with this option
+serverOption := grpc.UnaryInterceptor(
+    gclGrpc.UnaryServerInterceptor(
+        gclGrpc.WithLimiter(...),
+        gclGrpc.WithServerResponseTypeClassifier(..),
+    ),
+)
+```
+
+### Client Limiter
+
+There are two main use cases for client side limiters. A client side limiter can protect the client service from its 
+dependent services by failing fast and serving a degraded experience to its client instead of having its latency go up 
+and its resources eventually exhausted. For batch applications that call other services a client side limiter acts as a 
+backpressure mechanism ensuring that the batch application does not put unnecessary load on dependent services.
+
+In this example a GRPC client will use a blocking version of the VegasLimit to block the caller when the limit has been 
+reached.
+
+```golang
+import (
+    gclGrpc "github.com/platnummonkey/go-concurrency-limits/grpc"
+)
+
+// setup grpc client with this option
+dialOption := grpc.WithUnaryInterceptor(
+    gclGrpc.UnaryClientInterceptor(
+        gclGrpc.WithLimiter(...),
+        gclGrpc.WithClientResponseTypeClassifier(...),
+    ),
+)
+```
+
+# References Used
+1. Original Java implementation - Netflix - https://github.com/netflix/concurrency-limits/
+1. Windowless Moving Percentile - Martin Jambon - https://mjambon.com/2016-07-23-moving-percentile/
@@ -0,0 +1,2 @@
+// Package core provides the package interfaces.
+package core
@@ -0,0 +1,2 @@
+// Package go_concurrency_limits provides primitives for concurrency control in complex systems.
+package go_concurrency_limits
@@ -0,0 +1,2 @@
+// Package examples contains examples of using this package to solve concurrency problems.
+package examples
@@ -0,0 +1,2 @@
+// Package grpc provides GRPC client/server mixins to add concurrency control.
+package grpc
@@ -0,0 +1,2 @@
+// Package limit provides several useful limit implementations.
+package limit
@@ -0,0 +1,2 @@
+// Package functions provides additional helper functions to the limit package.
+package functions
@@ -0,0 +1,2 @@
+// Package limiter provides common limiter implementations that are useful.
+package limiter
@@ -0,0 +1,2 @@
+// Package measurements provides measurement reading implementations
+package measurements
@@ -1,3 +1,4 @@
+// Package datadog implements the metric registry interface for a Datadog provider.
 package datadog
 
 import (
 
@@ -0,0 +1,2 @@
+// Package metric_registry provides common implementations of metric registries.
+package metric_registry
@@ -1,3 +1,4 @@
+// Package gometrics implements the metric registry interface for a gometrics provider.
 package gometrics
 
 import (
 
@@ -0,0 +1,2 @@
+// Package patterns provides common patterns as higher level abstractions from the library building blocks.
+package patterns
@@ -0,0 +1,2 @@
+// Package pool provides common pool patterns for concurrency control.
+package pool
@@ -1,4 +1,4 @@
-package patterns
+package pool
 
 import (
 	"context"
 
@@ -1,4 +1,4 @@
-package patterns
+package pool
 
 import (
 	"context"
 
@@ -1,4 +1,4 @@
-package patterns
+package pool
 
 import (
 	"context"
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package core provides the package interfaces.`
	`2`	`+package core`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package go_concurrency_limits provides primitives for concurrency control in complex systems.`
	`2`	`+package go_concurrency_limits`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package examples contains examples of using this package to solve concurrency problems.`
	`2`	`+package examples`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package grpc provides GRPC client/server mixins to add concurrency control.`
	`2`	`+package grpc`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package limit provides several useful limit implementations.`
	`2`	`+package limit`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package functions provides additional helper functions to the limit package.`
	`2`	`+package functions`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package limiter provides common limiter implementations that are useful.`
	`2`	`+package limiter`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package measurements provides measurement reading implementations`
	`2`	`+package measurements`
Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,4 @@`
	`1`	`+// Package datadog implements the metric registry interface for a Datadog provider.`
`1`	`2`	`package datadog`
`2`	`3`
`3`	`4`	`import (`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+// Package metric_registry provides common implementations of metric registries.`
	`2`	`+package metric_registry`