스킬 파일

Autonomous Integer Sequence Research System

Name: Autonomous Integer Sequence Research System
Author: daedalus

Autonomous research pipeline for discovering, validating, and characterizing integer sequences suitable for OEIS submission. Use this skill whenever the user wants to generate new sequences from arithmetic constructions, test whether a formula is correct, falsify a conjecture about number-theoretic functions, check multiplicativity, profile prime-power behavior, or search for OEIS collisions. Also triggers for requests like "find me a new sequence", "is this formula right", "does f(n) = phi(n) * 2^omega(n) have a closed form", "test this divisor sum for multiplicativity", or "is this OEIS-worthy". The core philosophy: treat every formula as guilty until proven robust. Prefer counterexamples over confirmations. Elegance is suspicious. Survival under adversarial attack is the only meaningful validation.

daedalus0 스타2026. 4. 5.

직업
카테고리: 과학 컴퓨팅

스킬 내용

Adversarial + generative pipeline for OEIS-grade sequence discovery. The system generates candidates, cross-validates them, aggressively tries to break conjectured formulas, and scores survivors on a confidence scale.

Reference implementation: seq_research.py
Run with python seq_research.py (full sweep) or
python seq_research.py --single <NAME> (single sequence).

Pipeline Overview

Generator → Evaluator → Conjecturer → Falsifier → Classifier → Report
                ↑_____________________________|
                     (loop until validated or discarded)

Each stage is described below. Read all sections before starting work; the adversarial falsifier (§4) is the most important stage and the one most commonly skipped too early.

§1 · Sequence Generation

Grammar

Build candidates compositionally from:

Autonomous Integer Sequence Research System

daedalus0 스타2026. 4. 5.

직업
카테고리: 과학 컴퓨팅

스킬 내용

Reference implementation: seq_research.py
Run with python seq_research.py (full sweep) or
python seq_research.py --single <NAME> (single sequence).

Pipeline Overview

Generator → Evaluator → Conjecturer → Falsifier → Classifier → Report
                ↑_____________________________|
                     (loop until validated or discarded)

Each stage is described below. Read all sections before starting work; the adversarial falsifier (§4) is the most important stage and the one most commonly skipped too early.

§1 · Sequence Generation

Grammar

Build candidates compositionally from:

관련 스킬

Category	Elements
Base functions	`n`, `φ(n)`, `σ(n)`, `τ(n)`, `rad(n)`, `ω(n)`, `Ω(n)`, `μ(n)`
Operators	`+`, `×`, `^`, composition
Divisor structures	`Σ_{d\|n}`, `Π_{d\|n}`, unitary filter `gcd(d, n/d)=1`
Bit/hybrid	XOR, popcount, parity

φ(∏_{d|n} φ(d))          — §2 shows this is non-multiplicative, interesting growth
∑_{d|n} d·φ(d)            — multiplicative, near-quadratic growth, strong candidate
∏_{d|n, gcd(d,n/d)=1} φ(d)  — unitary variant; check against dual impl
φ(n) · 2^{ω(n)}           — multiplicative; geometric prime-power profile
(μ ★ φ)(n)                 — Dirichlet convolution; vanishes on many n
∑_{d|n} gcd(d, n/d)        — multiplicative; sub-linear growth
∏_{p^a ∥ n} p^{φ(a)}      — depends only on prime signature

f(p^k) for p ∈ {2, 3, 5, 7, 11}, k = 1…8

Pattern	Interpretation
Constant ratio `f(p^{k+1})/f(p^k)` across all k	f is geometric at p, likely `f(p^k) = p^{αk}`
Ratio → p as k grows	f(p^k) ~ p^k (identity-like)
Ratio = p-1	f(p^k) = φ(p^k) = p^{k-1}(p-1)
Non-constant but p-independent	f(p^k) = g(k) for some g
p-dependent ratio	f encodes the primes, not just exponents

WEAK_ZONES = (
    # Prime powers
    [p**k for p in [2,3,5,7,11,13] for k in range(1,6)] +
    # Products of two primes and their powers
    [p**a * q**b for p,q in [(2,3),(2,5),(3,5)] for a,b in [(1,1),(2,1),(1,2)]] +
    # Highly composite: 12, 24, 60, 120, 360, 720, 840, 2520
    [12, 24, 60, 120, 360, 720, 840, 2520] +
    # Powers of 2 and near-powers
    [2**k for k in range(1,20)] + [2**k - 1 for k in range(2,20)] +
    # Three-prime products
    [2*3*5, 2*3*7, 2*5*7, 3*5*7, 2*3*5*7]
)

n₀ → n₀ · p   (extend by small prime)
n₀ → n₀ / p   (reduce if p² | n₀)
n₀ → n₀²      (square)

alpha = cov(log n, log f(n)) / var(log n)

f(n) = ∏_{p^a ∥ n}  g(p, a)

Criterion	+Points	−Points
Cross-validation passes	+10	−20 if fails
Multiplicative	+15	—
Passes adversarial falsification	+15	−15 if fails
Surviving closed-form conjecture	+10	—
Consistent growth (0.5 ≤ α ≤ 3.0)	+5	—
No errors in first 100 terms	—	−3 per error
OEIS collision detected	—	−20

Score	Level	Action
0–39	Discard	Fix implementation or abandon
40–69	Experimental	More terms, more testing
70–89	Strong candidate	Compute 500 terms, prepare OEIS draft
90–100	OEIS-ready	Validate formula, write b-file

Failure	Cause	Fix
Formula works for n ≤ 30, fails at n=36	Didn't test p²·q	Always include weak zones
Dual implementation disagrees	Off-by-one in unitary filter or wrong prime-power formula	Test against divisors(n) directly; verify f(p^k) for k=1,2,3
Multiplicativity test passes but formula is wrong	Coprime pairs too small	Include pairs with p² and p³
Growth estimate unstable	Dominated by n=1 and n=2	Start regression from n=10
"Novel" sequence is A000010 (Euler phi)	Didn't check base cases	Always check standard functions first
`phi(rad(n))` errors for n=1	sympy's `rad` is symbolic (radians!), not integer radical	Implement manually: `rad(n) = prod(p for p in factorint(n))`
Weak zone test reports "non-numeric" for valid integers	sympy.Integer vs Python int	Check `hasattr(val, 'is_Integer')` or use `int(val)`
`Σ_{d	n} gcd(d,n/d)` formula fails at p^k	Wrong piecewise formula

Candidate	Identity	Proof sketch
id ★ μ	φ(n)	Möbius inversion: id = φ ★ 1, so id ★ μ = φ ★ (1 ★ μ) = φ ★ ε = φ
σ ★ μ	n (identity)	Möbius inversion: σ = id ★ 1, so σ ★ μ = id ★ ε = id
τ ★ μ	1 (all-ones)	τ = 1 ★ 1, so τ ★ μ = 1 ★ (1 ★ μ) = 1 ★ ε = 1
\|{unitary divisors of n}\|	2^{ω(n)}	Direct: unitary divs are ∏_{p\|n} {1, p^a}; count = 2^{ω(n)}
Σ_{d unitary} d · φ(n/d)	φ(n)·2^{ω(n)}	Equals Batch 1 candidate; not novel
∏_{p^a ∥ n} (p^a + 1)	unitary σ(n)	∏(p^a+1) = Σ_{d unitary} d by definition
Σ_{d\|n, d sqfree} φ(d)	rad(n) (A007947)	Each prime p^k contributes φ(1)+φ(p)=1+(p-1)=p; product = rad(n)
Σ_{d\|n, d sqfree} d	σ(rad(n)) = ∏_{p\|n}(1+p)	Squarefree-divisor sum = ∏(1+p) by multiplicativity
J₂(n) = n²·∏(1−1/p²)	A007434	Jordan totient of order 2; classical
Σ_{d	n} φ(d)	n

# Key imports
from sympy import (factorint, totient, divisors, divisor_sigma,
                   divisor_count, isprime, gcd, mobius,
                   primeomega, primenu)
import math

# Core functions (all return Python int, not sympy.Integer)
phi   = lambda n: int(totient(n))
tau   = lambda n: int(divisor_count(n))
sigma = lambda n: int(divisor_sigma(n))
omega = lambda n: int(primenu(n))       # distinct prime factors
Omega = lambda n: int(primeomega(n))    # total with multiplicity
mu    = lambda n: int(mobius(n))

# rad(n) - product of distinct prime factors (NOT sympy.rad which is radians!)
def rad(n):
    if n <= 1: return 1
    result = 1
    for p in factorint(n):
        result *= p
    return result

# Unitary divisors
def unitary_divisors(n):
    return [d for d in divisors(n) if gcd(d, n // d) == 1]

# Multiplicativity test
MULT_PAIRS = [(2,3),(2,5),(3,5),(4,9),(2,7),(5,9),(4,25),(8,27),
              (9,25),(16,9),(4,49),(8,125),(27,25),(2,15),(4,21)]

def is_multiplicative(fn):
    for a, b in MULT_PAIRS:
        if gcd(a, b) == 1:
            fa, fb, fab = fn(a), fn(b), fn(a*b)
            if fa * fb != fab:
                return False, (a, b, fa, fb, fab)
    return True, None

# Weak zone test (handles sympy.Integer)
WEAK_ZONES = sorted(set(
    [p**k for p in [2,3,5,7,11,13] for k in range(1,6)] +
    [p**a * q**b for p,q in [(2,3),(2,5),(3,5)] for a,b in [(1,1),(2,1),(1,2)]] +
    [12, 24, 60, 120, 360, 720, 840, 2520] +
    [2**k for k in range(1,20)] + [2**k - 1 for k in range(2,15)] +
    [2*3*5, 2*3*7, 2*5*7, 3*5*7, 2*3*5*7]
))

def test_weak_zones(fn):
    failures = []
    for n in WEAK_ZONES:
        try:
            val = fn(n)
            if hasattr(val, 'is_Integer'):  # sympy.Integer
                val = int(val)
            if not isinstance(val, (int, float)) or (isinstance(val, float) and math.isnan(val)):
                failures.append((n, 'non-numeric'))
        except Exception as e:
            failures.append((n, str(e)))
    return failures

# Dirichlet convolution
def dirichlet(f, g, n):
    return sum(f(d) * g(n // d) for d in divisors(n))

# Prime-power profile
def pp_profile(f, primes=(2,3,5,7,11), kmax=7):
    for p in primes:
        vals = [f(p**k) for k in range(1, kmax+1)]
        ratios = [round(vals[i+1]/vals[i], 5) if vals[i] != 0 else None
                  for i in range(len(vals)-1)]
        print(f"p={p}: {vals}")
        print(f"      ratios: {ratios}")

# Growth exponent (log-space regression)
def growth_alpha(f, lo=10, hi=200):
    pts = [(n, f(n)) for n in range(lo, hi+1) if f(n) > 0]
    if len(pts) < 10:
        return None
    xs = [math.log(n) for n, _ in pts]
    ys = [math.log(v) for _, v in pts]
    xm = sum(xs) / len(xs)
    ym = sum(ys) / len(ys)
    cov = sum((xs[i] - xm) * (ys[i] - ym) for i in range(len(xs)))
    var = sum((x - xm)**2 for x in xs)
    return round(cov / var, 4) if var > 0 else None

# OEIS string format
def oeis_str(f, count=30):
    return ", ".join(str(f(n)) for n in range(1, count + 1))

g(a)	Status	α	f(p^k)
a	Known (A001222 Ω(n))	0	k
a²	STRONG	0.11	k²
a(a+1)/2	STRONG	0.10	k(k+1)/2
2^a	STRONG	0.11	2^k
a!	Untested	—	—
F(a) (Fibonacci)	Untested	—	—

k	Status	α	f(p^a)
1	Known = n (A000027)	1.0	p^a
2	OEIS-READY	2.0	1+(p-1)²(p^{2a}-1)/(p²-1)

k	Status	α	f(p^a)
1	Known = n	1.0	p^a
2	OEIS-READY	2.0	p^{2a} + p^{a-1}(p^a - 1)

Autonomous Integer Sequence Research System

Pipeline Overview

§1 · Sequence Generation

Grammar

Autonomous Integer Sequence Research System

Pipeline Overview

§1 · Sequence Generation

Grammar

Bias Rules

Example Constructions Worth Exploring

§2 · Multi-Implementation Evaluation

Implementation Types

Differential Validation

§3 · Multiplicativity Detection

3.1 · Prime-Power Profiling

§4 · Adversarial Falsification (Most Important Stage)

4.1 · Weak Zones (Test These First)

4.2 · Mutation Attack

4.3 · Delta Debugging

4.4 · Red Team Checklist

§5 · Conjecture Engine

5.1 · Template Hierarchy

5.2 · Log-Space Regression

5.3 · Multiplicative Conjecture Construction

§6 · OEIS Collision Filter

§7 · Confidence Scoring

§8 · Output Standard (OEIS-Grade)

§9 · Known Failure Modes and Mitigations

§10 · Meta-Insight Tracking

§11 · Confirmed Classical Identities (Do Not Resubmit)

§12 · Parameterized Families

Appendix: Quick Reference

Appendix 2: Various analysis

Deep Research

Data Analyst

Academic Researcher

Data Scientist

Biopython

Binary Analysis Patterns