Conducts a "Statistical Jury" audit to measure Decision Noise by invoking multiple independent judgments.
This skill implements the "Statistical Jury" method to quantify Decision Noise. It requires the agent to simulate multiple independent judges evaluating the same material to detect variance.
task sub-agents (type: noise_judge).
task tool handles this by default).import math
def calculate_noise_metrics(scores):
n = len(scores)
if n == 0: return 0, 0
mean = sum(scores) / n
variance = sum([(x - mean) ** 2 for x in scores]) / n
std_dev = math.sqrt(variance)
return {
"mean": mean,
"std_dev": std_dev,
"range": max(scores) - min(scores)
}