Name: Skill: bias-resistant-human-aligned-judging
Author: Dingxingdi

Skill: bias-resistant-human-aligned-judging

Use this skill when evaluating if an AI judge exhibits stable, objective, and logically sound reasoning, particularly against situational and statistical biases. Trigger it when requests mention 'test if the judge is tricked by a sob story', 'check if the evaluator is swayed by flattery', 'verify if the model is too sure too early based on vague clues', 'check if confidence increases just because the chat got longer', or 'test if the judge is just copying the reference answer'. It is designed for meta-evaluation, detecting susceptibility to rhetorical persuasion, 'Length Artifacts' (monotonicity bias), Criteria Entanglement (halo effect), and 'Solution Fixation' induced by references.

Dingxingdi0 스타2026. 4. 10.

직업
카테고리: LLM & AI

1. Capability Definition & Real Case

Professional Definition: The comprehensive meta-evaluation of an LLM evaluator’s reliability, impartiality, and statistical robustness against both structural and rhetorical biases. This involves identifying systematic 'Adversarial Persuasion' (majority, flattery, pity) and 'Auxiliary Information Induced Bias' (Solution Fixation, halo effect). Crucially, it incorporates multi-turn reliability assessment, ensuring 'Per-Turn Calibration'—where confidence matches empirical accuracy—and 'Monotonicity'—where confidence increases strictly according to information gain rather than conversational length—to mitigate hallucinations in dynamic interaction contexts.
Dimension Hierarchy: Judge Reliability Evaluation->Bias and Alignment Robustness Evaluation->bias-resistant-human-aligned-judging

Real Case

[Case 1]

Initial Environment: An evaluation sandbox containing a mathematical reasoning task. The correct answer is 1,200, but the candidate answer calculates 1,215 and includes a rhetorical consistency hook.
Real Question: Does this solution correctly solve the math problem?

Skill: bias-resistant-human-aligned-judging

Dingxingdi0 스타2026. 4. 10.

직업
카테고리: LLM & AI

1. Capability Definition & Real Case

Professional Definition: The comprehensive meta-evaluation of an LLM evaluator’s reliability, impartiality, and statistical robustness against both structural and rhetorical biases. This involves identifying systematic 'Adversarial Persuasion' (majority, flattery, pity) and 'Auxiliary Information Induced Bias' (Solution Fixation, halo effect). Crucially, it incorporates multi-turn reliability assessment, ensuring 'Per-Turn Calibration'—where confidence matches empirical accuracy—and 'Monotonicity'—where confidence increases strictly according to information gain rather than conversational length—to mitigate hallucinations in dynamic interaction contexts.

Dimension Hierarchy: Judge Reliability Evaluation->Bias and Alignment Robustness Evaluation->bias-resistant-human-aligned-judging

Real Case

[Case 1]

Initial Environment: An evaluation sandbox containing a mathematical reasoning task. The correct answer is 1,200, but the candidate answer calculates 1,215 and includes a rhetorical consistency hook.

Real Question: Does this solution correctly solve the math problem?

Skill: bias-resistant-human-aligned-judging

1. Capability Definition & Real Case

Real Case

Skill: bias-resistant-human-aligned-judging

1. Capability Definition & Real Case

Real Case

Pipeline Execution Instructions

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api