Aggregation Functions

Note

Understanding the difference between smart aggregates (.avg()), aggregated metrics (.avg), and special functions - critical for writing correct NQL queries.

Overview

NQL provides multiple ways to aggregate data, and choosing the right one is crucial for accurate results. The most common confusion is between .avg() and .avg - they're different and used in different contexts.

Three function categories: 1. Smart Aggregates - Functions with () - use in compute/summarize 2. Aggregated Metrics - Database fields without () - use in where clauses 3. Special Functions - count(), countif(), last(), time_elapsed()

Quick Decision Guide

Context	Use This	Example
In compute or summarize	Smart aggregates `.avg()`	`compute avg_cpu = cpu_time.avg()`
In where clause	Aggregated metric `.avg`	`where cpu_usage.avg > 50`
For peak values	Chained `.avg.max()`	`summarize peak = cpu_usage.avg.max()`
Count unique objects	`<object>.count()`	`summarize devices = device.count()`
Count events (avoid!)	`count()`	❌ Returns sample count on events

The 95% Rule

Use .avg() in compute/summarize 95% of the time - this gives you the true average considering cardinality.

Only use .avg (without parentheses) in where clauses for filtering.

Smart Aggregates (Functions with Parentheses)

Use these in compute and summarize clauses.

Available Functions

Function	Description	Returns	Example
`.avg()`	True average (accounting for cardinality)	Number	`cpu_time.avg()`
`.sum()`	Sum of all values	Number	`memory.sum()`
`.count()`	Number of aggregated values	Integer	`events.count()`
`.min()`	Minimum value	Number	`memory.min()`
`.max()`	Maximum value	Number	`memory.max()`

How They Work

Smart aggregates are computed on-the-fly and provide accurate aggregation by accounting for data cardinality.

Equivalent formulas:

.avg() = .sum.sum() / .count.sum()
.sum() = .sum.sum()
.max() = .max.max()
.min() = .min.min()
.count() = .count.sum()

When to Use Smart Aggregates

/* ✅ CORRECT - Use in compute */
devices during past 7d
| include device_performance.events
| compute avg_memory = free_memory.avg()  # Smart aggregate
| list device.name, avg_memory

/* ✅ CORRECT - Use in summarize */
execution.events during past 7d
| summarize avg_cpu = cpu_time.avg() by device.name  # Smart aggregate

Aggregated Metrics (Database Fields)

These are pre-computed fields in the database - use them in where clauses for filtering.

Available Fields

Field	Description	Use In
`.avg`	Pre-computed average	where clauses
`.sum`	Pre-computed sum	where clauses
`.count`	Pre-computed count	where clauses
`.min`	Pre-computed minimum	where clauses
`.max`	Pre-computed maximum	where clauses

When to Use Aggregated Metrics

/* ✅ CORRECT - Use .avg (no parentheses) in where */
devices during past 7d
| include device_performance.events
| where cpu_usage.avg > 50  # Aggregated metric field
| list device.name

The .avg vs .avg() vs .avg.avg() vs .avg.max() Question

This is the most common source of confusion in NQL. Here's the complete breakdown:

`.avg` - Pre-Computed Average Field

Use in: where clauses for filtering

devices during past 7d
| include device_performance.events
| where cpu_usage.avg > 50  # Field for filtering

`.avg()` - Smart Aggregate Function ⭐ RECOMMENDED

Use in: compute and summarize clauses

This is what you want 95% of the time!

devices during past 7d
| include device_performance.events
| compute avg_cpu = cpu_usage.avg()  # Accurate average

Why it's recommended:

Accounts for cardinality (number of samples per device/object)
Returns true average across all data points
Result is consistent regardless of time selection

`.avg.avg()` - Average of Samples ⚠️ NOT RECOMMENDED

Nexthink does NOT recommend using this!

The problem:

For past 2d: Averages 2 samples (2 days)
For past 48h: Averages 192 samples (48 hours × 4 samples/hour)
Same time period, different results!

/* ❌ NOT RECOMMENDED - Result varies by time selection */
device_performance.events during past 7d
| summarize sample_avg = cpu_usage.avg.avg()  # Don't use this

/* ✅ CORRECT - Consistent, accurate result */
device_performance.events during past 7d
| summarize true_avg = cpu_usage.avg()  # Use this instead

`.avg.max()` - Maximum Observed Value

Use in: Finding peak values

/* ✅ CORRECT - Find peak CPU usage */
device_performance.events during past 7d
| summarize
    avg_cpu = cpu_usage.avg(),      # Average CPU
    peak_cpu = cpu_usage.avg.max()  # Peak CPU observed
  by device.name
| where peak_cpu > 80

When to use:

Finding maximum observed values
Identifying spikes or peaks
Alert thresholds based on peak performance

Special Functions

count() - Count Records

Syntax: count() (without a field name)

What it does: Counts events or records

CRITICAL: Never Use count() on Sampled Events!

On event tables like execution.events, count() returns the number of samples, not meaningful business metrics.

/* ❌ BAD - Returns sample count (meaningless!) */
execution.events during past 7d
| summarize total = count()
/* Returns: 1,245,892 samples - not useful! */

/* ✅ GOOD - Count unique objects */
execution.events during past 7d
| summarize
unique_devices = device.count(),
unique_apps = application.count()
/* Returns: 142 devices, 38 apps - business value! */

When count() IS useful:

/* ✅ Count non-sampled events (crashes, boots) */
execution.crashes during past 7d
| summarize crash_count = count()

/* ✅ Count after aggregation */
devices during past 7d
| summarize device_count = count()

What You Want	Use This	Context	Example
Filter by average	`.avg`	where clause	`where cpu.avg > 50`
Calculate average	`.avg()`	compute/summarize	`compute x = cpu.avg()`
Find peak value	`.avg.max()`	compute/summarize	`summarize peak = cpu.avg.max()`
Count unique devices	`device.count()`	compute/summarize	`summarize n = device.count()`
Check if has events	`device.count()`	compute	`compute has = device.count()`
Count non-sampled events	`count()`	summarize	`summarize n = count()`
Conditional count	`countif()`	summarize	`summarize n = countif(x > 10)`
Latest value	`.last()`	compute/summarize	`compute current = field.last()`
Sum values	`.sum()`	compute/summarize	`compute total = field.sum()`

Aggregation Functions

Overview

Quick Decision Guide

Smart Aggregates (Functions with Parentheses)

Available Functions

How They Work

When to Use Smart Aggregates

Aggregated Metrics (Database Fields)

Available Fields

When to Use Aggregated Metrics

The .avg vs .avg() vs .avg.avg() vs .avg.max() Question

`.avg` - Pre-Computed Average Field

`.avg()` - Smart Aggregate Function ⭐ RECOMMENDED

`.avg.avg()` - Average of Samples ⚠️ NOT RECOMMENDED

`.avg.max()` - Maximum Observed Value

Special Functions

count() - Count Records

countif() - Conditional Counting

.last() - Most Recent Value

time_elapsed() - Time Since Timestamp

Real-World Examples

Common Mistakes & Anti-Patterns

Summary Table: When to Use Each

Additional Resources

Aggregation Functions

Overview

Quick Decision Guide

Smart Aggregates (Functions with Parentheses)

Available Functions

How They Work

When to Use Smart Aggregates

Aggregated Metrics (Database Fields)

Available Fields

When to Use Aggregated Metrics

The .avg vs .avg() vs .avg.avg() vs .avg.max() Question

.avg - Pre-Computed Average Field

.avg() - Smart Aggregate Function ⭐ RECOMMENDED

.avg.avg() - Average of Samples ⚠️ NOT RECOMMENDED

.avg.max() - Maximum Observed Value

Special Functions

count() - Count Records

countif() - Conditional Counting

.last() - Most Recent Value

time_elapsed() - Time Since Timestamp

Real-World Examples

Common Mistakes & Anti-Patterns

Summary Table: When to Use Each

Related Topics

Additional Resources

`.avg` - Pre-Computed Average Field

`.avg()` - Smart Aggregate Function ⭐ RECOMMENDED

`.avg.avg()` - Average of Samples ⚠️ NOT RECOMMENDED

`.avg.max()` - Maximum Observed Value