System metrics with Prometheus

Team Edition system performance can be monitored to understand system health, evaluate network traffic, and detect issues. Each of the Team Edition services expose a set of metrics that can be visualized using the built-in Prometheus expression browser. Metrics are provided in OpenMetrics (Prometheus) format.

Accessing Prometheus

To access your system metrics from the UI, click on the My account button in the top right, and then click Metrics. This will open a new browser tab with the root URL of the Team Edition installation appended with /prometheus/. For example, https://yourcompany.com/prometheus/.

../_images/prometheus_access.png

Alternatively, you can add /prometheus to the root URL of the Team Edition installation.

Using the expression browser

Prometheus uses a built-in expression browser for time series visualizations of system metrics.

Follow these steps to create visualizations from the expression browser:

  1. Select a metric from the dropdown by clicking in the cell that reads insert metric at cursor.

  2. Click on the Graph tab.

  3. Select a time period. We recommend two weeks (“2w”).

  4. Select an “until” date, that being the point in time up to which the selected metric will displayed on the graph. The default time is the current date and time.

  5. Click Execute.

A graph will be populated with the selected metric, and a console readout will appear beneath it.

Tip

You can isolate a single resource by clicking it in the legend below the graph.

Setting a file limit alert

Using the two metrics above, process_open_fds and process_max_fds, you can quickly write an alert to warn you when a process hits, say, 80% of the limit:

groups:
- name: example
  rules:
  - alert: ProcessNearFDLimits
    expr: process_open_fds / process_max_fds > 0.8
    for: 10m