搵技能.../

Langfuse Cloud (EU) Account:
- Project URL: https://cloud.langfuse.com/project/cmhuwhcfe006yad06cqfub107
- API keys available (public + secret)
- EU data residency confirmed

Environment Variables:

export LANGFUSE_PUBLIC_KEY="pk-lf-..."
export LANGFUSE_SECRET_KEY="sk-lf-..."
export LANGFUSE_HOST="https://cloud.langfuse.com"

Dependencies:
- langfuse Python package (will be installed if missing)
- llama-index-core>=0.12.0 (for callback handler)
- Existing Phoenix instrumentation code identified

Locate Phoenix Configuration:

# Search for Phoenix setup
grep -r "phoenix" main/src/monitoring/ --include="*.py"
grep -r "from phoenix" main/src/ --include="*.py"
grep -r "import phoenix" main/src/ --include="*.py"

Identify Instrumentation Points:
- Read main/src/core/unified_workflow.py - identify workflow entry points
- Read main/src/agents/ - identify agent methods needing tracing
- Look for existing OpenTelemetry span creation
- Document all files importing Phoenix
Analyze Compliance Attributes:
- Check if GAMP-5 attributes are set (category, confidence)
- Check if ALCOA+ attributes are set (user_id, session_id, timestamps)
- Verify 21 CFR Part 11 metadata if applicable

Generate Assessment Report:

# Phoenix → Langfuse Migration Assessment

## Current Phoenix Instrumentation
- Configuration file: <path>
- Instrumented files: <count>
- Span count per workflow: <number>
- Compliance attributes: <present/missing>

## Migration Scope
- Files requiring decorator addition: <list>
- Phoenix imports to remove: <count>
- Callback handlers to replace: <list>
- Estimated migration time: <minutes>

## Risk Assessment
- Breaking changes: <yes/no>
- Test coverage: <percentage>
- Rollback complexity: <low/medium/high>

Install Langfuse SDK:

# Add to pyproject.toml
uv add langfuse

# For LlamaIndex integration
uv add llama-index-instrumentation-langfuse

Create Langfuse Configuration Module:
- File: main/src/monitoring/langfuse_config.py
- Content: See reference/decorator-patterns.md for template
- Key functions:
  - setup_langfuse(): Initialize client with EU cloud config
  - get_langfuse_client(): Singleton accessor
  - get_langfuse_callback_handler(): LlamaIndex integration
  - add_compliance_attributes(): GAMP-5/ALCOA+ attribute helper

Verify Cloud Connectivity:

# Test script (temporary)
from main.src.monitoring.langfuse_config import setup_langfuse

client = setup_langfuse()
client.trace(name="connectivity-test", input={"test": True})
client.flush()

# Verify trace appears at:
# https://cloud.langfuse.com/project/cmhuwhcfe006yad06cqfub107/traces

Update Environment Configuration:
- Add Langfuse environment variables to .env.example
- Update main/src/config.py to load Langfuse settings
- Add Langfuse to ObservabilityConfig dataclass

Add Decorators to Workflow Entry Points:

Use the automated script for systematic instrumentation:

python .claude/skills/langfuse-integration/scripts/add_instrumentation.py \
  --target main/src/core/unified_workflow.py \
  --dry-run  # Preview changes first

Manual pattern (if script unavailable):

# main/src/core/unified_workflow.py
from langfuse import observe

class UnifiedWorkflow(Workflow):
    @observe(name="unified-workflow-run", as_type="span")
    async def run(self, ctx: Context, ev: StartEvent) -> StopEvent:
        # Existing code unchanged
        ...

Instrument Agent Methods:

Target key agent operations:

# main/src/agents/categorizer.py
from langfuse import observe

@observe(name="gamp5-categorization", as_type="span")
async def categorize_urs(self, urs_content: str) -> dict:
    # Add compliance attributes
    from langfuse import get_current_observation
    obs = get_current_observation()
    if obs:
        obs.update(metadata={
            "compliance.gamp5.applicable": True,
            "compliance.alcoa_plus.attributable": True
        })

    # Existing categorization logic
    result = await self._categorize(urs_content)

    # Tag with category
    if obs:
        obs.update(metadata={
            "compliance.gamp5.category": result["category"]
        })

    return result

Replace LlamaIndex Callback Handler:

# main/src/core/unified_workflow.py or main/main.py
# OLD (Phoenix):
# from phoenix.otel import register
# tracer_provider = register()

# NEW (Langfuse):
from langfuse.llama_index import LlamaIndexCallbackHandler

langfuse_handler = LlamaIndexCallbackHandler(
    public_key=os.getenv("LANGFUSE_PUBLIC_KEY"),
    secret_key=os.getenv("LANGFUSE_SECRET_KEY"),
    host=os.getenv("LANGFUSE_HOST")
)

# Register with workflow
workflow = UnifiedWorkflow(
    callbacks=[langfuse_handler],
    timeout=600
)

Propagate User/Session Attributes:

Remove Phoenix Configuration File:

# Backup first (optional)
cp main/src/monitoring/phoenix_config.py main/src/monitoring/phoenix_config.py.bak

# Remove
rm main/src/monitoring/phoenix_config.py

Update Imports:

Use automated script:

python .claude/skills/langfuse-integration/scripts/remove_phoenix.py \
  --target main/src/ \
  --dry-run  # Preview changes

Manual pattern:

# Remove all instances of:
# - from phoenix.otel import register
# - from phoenix import ...
# - import phoenix
# - Any calls to phoenix.trace(), register(), etc.

Remove Phoenix from Dependencies:

# Remove from pyproject.toml
uv remove arize-phoenix arize-phoenix-otel

Update Monitoring Module Init:

# main/src/monitoring/__init__.py
# OLD:
# from .phoenix_config import setup_phoenix, PhoenixManager

# NEW:
from .langfuse_config import setup_langfuse, get_langfuse_client

__all__ = ["setup_langfuse", "get_langfuse_client"]

Remove Phoenix Server Command (if applicable):

# Check if phoenix serve is in any scripts
grep -r "phoenix serve" . --include="*.sh" --include="*.py" --include="*.md"

# Remove or comment out

Run Integration Health Check:

python .claude/skills/langfuse-integration/scripts/validate_integration.py

Expected output:

✅ Langfuse SDK installed
✅ API keys configured
✅ Cloud connectivity successful
✅ Test trace created: trace_id=xxx
✅ @observe decorators found: 15
✅ Callback handler configured
❌ No Phoenix imports found (expected)

Run End-to-End Workflow:

# Execute test workflow with real URS
uv run python main/main.py --urs examples/test_urs_001.md

Verify Trace in Dashboard:
- Navigate to: https://cloud.langfuse.com/project/cmhuwhcfe006yad06cqfub107/traces
- Find most recent trace by timestamp
- Check:
  - ✅ Trace appears (not 404)
  - ✅ Span count matches expected (compare to Phoenix baseline)
  - ✅ User ID populated
  - ✅ Session ID populated
  - ✅ Tags include "pharmaceutical", "gamp5"
  - ✅ GAMP-5 category metadata present
  - ✅ No errors in observations

Compare Span Structure:

# If Phoenix baseline available, compare span counts
echo "Phoenix baseline: 131 spans/workflow"
echo "Langfuse actual: <count from dashboard>"
# Acceptable range: 120-140 (some variation expected)

Test Compliance Attributes:
- Click on categorization span in dashboard
- Verify metadata contains:
  - compliance.gamp5.category: 1-5
  - compliance.alcoa_plus.attributable: true
  - user.clerk_id: <actual user ID>
  - job.id: <actual job ID>
Run Existing Tests:

Update Quick Start Guide:
- Edit main/docs/guides/QUICK_START_GUIDE.md
- Replace Phoenix setup instructions with Langfuse
- Update environment variable examples
- Add Langfuse dashboard URL
Update README:
- Replace Phoenix badge/link with Langfuse
- Update observability section
- Add Langfuse Cloud (EU) data residency note

Create Migration Notes:

# Phoenix → Langfuse Migration Summary

**Date**: <YYYY-MM-DD>
**Scope**: Complete Phoenix replacement

## Changes Made
- Removed: phoenix_config.py, Phoenix dependencies
- Added: langfuse_config.py, Langfuse SDK
- Instrumented: 15 functions with @observe decorators
- Replaced: LlamaIndex callback handler

## Verification
- Trace count: 131 spans/workflow (matches Phoenix baseline)
- Dashboard URL: https://cloud.langfuse.com/project/cmhuwhcfe006yad06cqfub107
- Compliance: GAMP-5 + ALCOA+ attributes preserved

## Rollback (if needed)
- Restore phoenix_config.py.bak
- Run: uv add arize-phoenix arize-phoenix-otel
- Remove @observe decorators

Update CLAUDE.md:
- Replace Phoenix references in "Technology Stack" section
- Update observability commands
- Add Langfuse skill invocation instructions

Commit Changes:

git add -A
git status  # Review changes

# Commit with detailed message
git commit -m "$(cat <<'EOF'
feat: Replace Phoenix with Langfuse Cloud (EU) observability

- Add Langfuse SDK and LlamaIndex instrumentation
- Add @observe decorators to 15 workflow/agent functions
- Configure Langfuse Cloud (EU) with GAMP-5 compliance attributes
- Remove Phoenix dependencies and configuration
- Verify trace parity: 131 spans/workflow maintained
- Update documentation (Quick Start, README, CLAUDE.md)

Task: PRP 2.3 (LangFuse Integration and Dashboard)
Validation: All tests passing, traces visible in dashboard

🤖 Generated with Claude Code

Co-Authored-By: Claude <[email protected]>
EOF
)"

ModuleNotFoundError: No module named 'langfuse'

uv add langfuse llama-index-instrumentation-langfuse
uv sync

Check API keys:

import os
print(f"Public key: {os.getenv('LANGFUSE_PUBLIC_KEY')[:10]}...")
print(f"Secret key configured: {bool(os.getenv('LANGFUSE_SECRET_KEY'))}")

Check flush call:

from langfuse import get_client
client = get_client()
client.flush()  # CRITICAL: Must flush before exit

Check network connectivity:
```
curl -I https://cloud.langfuse.com
```

# Ensure get_current_observation() is called inside decorated function
from langfuse import observe, get_current_observation

@observe()
def my_function():
    obs = get_current_observation()
    if obs:  # CRITICAL: Check if obs exists
        obs.update(metadata={"compliance.gamp5.category": 5})

# Find missing decorators
grep -r "async def" main/src/agents/ --include="*.py" | \
  grep -v "@observe"

# Tune batch settings
from langfuse import Langfuse

client = Langfuse(
    flush_interval=5,  # Flush every 5 seconds instead of 1
    flush_at=50,       # Batch 50 events before flushing
)

from langfuse import get_client

langfuse = get_client()

def complex_workflow():
    with langfuse.start_as_current_span(
        name="complex-workflow",
        as_type="span"
    ) as span:
        span.update(input={"mode": "batch"})

        # Manual sub-span creation
        with langfuse.start_as_current_span(
            name="data-validation",
            as_type="span"
        ) as sub_span:
            validate_data()
            sub_span.update(output={"valid": True})

        # Main logic
        result = process_data()

        span.update(output=result)

from langfuse import get_current_observation

obs = get_current_observation()
if obs:
    obs.event(
        name="gamp5-category-assigned",
        metadata={
            "category": 5,
            "confidence": 0.95,
            "timestamp": datetime.now().isoformat()
        }
    )

from langfuse import observe, get_current_trace

@observe()
async def multi_tenant_workflow(org_id: str, user_id: str):
    trace = get_current_trace()
    if trace:
        trace.update(
            user_id=user_id,
            tags=[f"org:{org_id}", "gamp5"],
            metadata={
                "organization.id": org_id,
                "organization.name": get_org_name(org_id),
                "compliance.data_residency": "EU"
            }
        )

    # Workflow logic
    ...

# In API endpoint or workflow entry point
from langfuse import observe, get_current_trace

@observe()
async def generate_test_suite(user_id: str, urs_file: str, job_id: str):
    # Set trace-level attributes
    trace = get_current_trace()
    if trace:
        trace.update(
            user_id=user_id,
            session_id=job_id,
            tags=["pharmaceutical", "gamp5"],
            metadata={
                "compliance.alcoa_plus.attributable": True,
                "user.clerk_id": user_id,
                "job.id": job_id
            }
        )

    # All nested operations inherit these attributes
    result = await unified_workflow.run(urs_file)
    return result

# Ensure no regressions
pytest main/tests/ -v

# Check for import errors
mypy main/src/

# Check for Phoenix references
ruff check main/src/

Langfuse Integration Skill | Skills Pool

Langfuse Integration Skill

Langfuse Integration Skill

When to Use This Skill

Prerequisites

Workflow Phases

Phase 1: Assessment and Analysis (5-10 minutes)

Phase 2: Langfuse Configuration Setup (10-15 minutes)

Phase 3: Code Instrumentation (20-30 minutes)

Phase 4: Phoenix Removal (10-15 minutes)

Phase 5: Validation and Testing (15-20 minutes)

Phase 6: Documentation and Finalization (5-10 minutes)

Success Criteria

Functional Requirements

Observability Requirements

Compliance Requirements

Quality Requirements

Documentation Requirements

Troubleshooting

Issue: Langfuse SDK Import Error

Issue: Traces Not Appearing in Dashboard

Issue: Missing Compliance Attributes

Issue: Span Count Mismatch

Issue: High Latency After Migration

Reference Materials

Decorator Patterns

Phoenix Migration Guide

Compliance Attributes

Advanced Usage

Context Manager Pattern (Fine-Grained Control)

Custom Event Tracking

Multi-Tenant Attribution

Skill Completion Checklist

Post-Migration: Next Steps

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid