New tool to save HTTP stats

June 21, 2022 · 4 min read

Creator of Spider

Spider may be very useful to extract system statistics.
But its small retention time (due to high data volume) limits the usage you may do with these stats.

This new tool extracts HTTP statistics from Spider captured data and save them into another index with longer retention time.

The tool:

is deployed as a Docker, may be executed in a crontab as as a Kubernetes job
takes as parameters (among others):
- the relative period to extract stats from,
- the grouping axes,
- the field on which to compute stats,
- any filter to customize source data

Dashboard

I designed a first dashboard in Kibana (but could be done in Grafana) as an example:

Dashboard SSS

The dashboard can be filtered by service, api, status and external/internal requests.

Configuration

Example:

{
  "statsPeriod": {
    "start": "now/h-1h",
    "end": "now/h"
  },
  "whisperer": "",
  "spiderEsNode": "http://...",
  "monitoringEsNode": "http://...",
  "env": "...",
  "innerClusterNetwork": "10.1.0.0/24",
  "stats": [
    {
      "label": "External requests",
      "groups": [
        { "label":  "server", "field": "stats.dst.name" },
        { "label":  "template", "field": "req.template" },
        { "label":  "status", "field": "stats.statusCode" }
      ],
      "statField": "stats.duration",
      "limitTo": "EXTERNALS"
    },
    {
      "label": "Internal requests",
      "groups": [
        { "label":  "server", "field": "stats.dst.name" },
        { "label":  "template", "field": "req.template" },
        { "label":  "status", "field": "stats.statusCode" }
      ],
      "statField": "stats.duration",
      "limitTo": "INTERNALS"
    },
    {
      "label": "Controls by Park",
      "groups": [
        { "label":  "server", "field": "stats.dst.name" },
        { "label":  "template", "field": "req.template" },
        { "label":  "status", "field": "stats.statusCode" },
        { "label":  "park", "field": "stats.tags.park" }
      ],
      "statField": "stats.duration",
      "limitTo": "EXTERNALS",
      "filter": "req.template:\"POST /controls\""
    }
  ]
}

Source

spiderEsNode: Spider ES server where the data is
whisperer: Id of the whisperer capturing the data to export
env: Label of the environment (used in the exported stat structure)
innerClusterNetwork: Network used in the system under capture

  "whisperer": "u1BHa8ZlT8GP8qPGQDkqRw",
  "spiderEsNode": "https://user:pass@node:port",
  "env": "SIT1",
  "innerClusterNetwork": "10.1.0.0/24",

Destination

monitoringEsNode : Target Elasticsearch node

  "monitoringEsNode": "https://user:pass@node:port"

Period

Define the relative period to extract the stats from:

 "statsPeriod": {
    "start": "now/h-1h",
    "end": "now/h"
  }

Stats

List of stats to export:

label: Tag the extracted stats
groups: Define the grouping fields for the stats, with their label in the extracted stats
statField: On what field stats are computed
limitTo: Adds a filter, when present to limit stats on EXTERNALS or INTERNALS calls, based on the defined network in config
filter: Add a custom filter to limit stats to... whatever

Output

An extracted stat looks like this, in the target index:

{
  "env" : "SIT1",
  "label" : "External requests",
  "start" : "2022-06-19T20:00:00.077Z",
  "stop" : "2022-06-19T20:59:59.821Z",
  "timespan" : {
    "gte" : "2022-06-19T20:00:00.077Z",
    "lte" : "2022-06-19T20:59:59.821Z"
  },
  "group" : {
    "server" : "businesscustomer_parks_service",
    "template" : "SEARCH /areas",
    "status" : 200
  },
  "stats" : {
    "count" : 9,
    "min" : 0.046460999999999995,
    "max" : 0.356437,
    "avg" : 0.17882433333333334,
    "sum" : 1.609419,
    "stddev" : 0.09696893834740185
  },
  "percentiles" : {
    "50" : 0.18603499999999998,
    "90" : 0.32588659999999997,
    "95" : 0.356437,
    "99" : 0.356437
  },
  "distribution" : {
    "0" : 7,
    "250" : 2
  }
}

start, stop, and timespan: Period of the stat, timespan being a date range for easier queries
group: values of the group
stats: simple stats
percentiles: most meaningful percentiles
distribution: distribution histogram of the calls, per 250ms steps

Packaging

The tool is distributed as a docker image: registry.gitlab.com/spider-analyzer/images/save-http-stats

Running

Initialisation

Initialisation creates the output index specified in the config.json

docker run \
 -v [path to config]:/app/config.json \
 --network host \
 --rm \
 registry.gitlab.com/spider-analyzer/images/save-http-stats \
 init

This scripts creates:

An ILM spider-http-stats-ilm to rule the indices lifecycle and storage
- Roll over every week + search optimisation
- Deletion after 1 year
An index template spider-http-stats
An alias spider-search-http-stats to use to search the stats
- It will allow searching on all stats indices
A first index spider-http-stats-...
- With the date of today
- With the alias spider-active-http-stats

The active alias is used to index in the extract script.
A new index is created automatically every week to store new extracted data.

Extracting stats

Command to run the script

docker run \
 -v [path to config]:/app/config.json \
 --network host \
 --name save-http-stats \
 --rm \
 --restart no \
 registry.gitlab.com/spider-analyzer/images/save-http-stats

Best is to add this in a small shell script, and add it to cron.
Every [scheduled] time, the script will run and extract the programmed stats from the related time.

Dashboard​

Configuration​

Source​

Destination​

Period​

Stats​

Output​

Packaging​

Running​

Initialisation​

Extracting stats​