スキルを検索.../

Azure Resource Health Diagnose | Skills Pool

Find Monitoring Sources:
- Use azmcp-monitor-workspace-list to identify Log Analytics workspaces
- Locate Application Insights instances associated with the resource
- Identify relevant log tables using azmcp-monitor-table-list

Execute Diagnostic Queries: Use azmcp-monitor-log-query with targeted KQL queries based on resource type:

General Error Analysis:

// Recent errors and exceptions
union isfuzzy=true 
    AzureDiagnostics,
    AppServiceHTTPLogs,
    AppServiceAppLogs,
    AzureActivity
| where TimeGenerated > ago(24h)
| where Level == "Error" or ResultType != "Success"
| summarize ErrorCount=count() by Resource, ResultType, bin(TimeGenerated, 1h)
| order by TimeGenerated desc

Performance Analysis:

// Performance degradation patterns
Perf
| where TimeGenerated > ago(7d)
| where ObjectName == "Processor" and CounterName == "% Processor Time"
| summarize avg(CounterValue) by Computer, bin(TimeGenerated, 1h)
| where avg_CounterValue > 80

Application-Specific Queries:

// Application Insights - Failed requests
requests
| where timestamp > ago(24h)
| where success == false
| summarize FailureCount=count() by resultCode, bin(timestamp, 1h)
| order by timestamp desc

// Database - Connection failures
AzureDiagnostics
| where ResourceProvider == "MICROSOFT.SQL"
| where Category == "SQLSecurityAuditEvents"
| where action_name_s == "CONNECTION_FAILED"
| summarize ConnectionFailures=count() by bin(TimeGenerated, 1h)

Pattern Recognition:
- Identify recurring error patterns or anomalies
- Correlate errors with deployment times or configuration changes
- Analyze performance trends and degradation patterns
- Look for dependency failures or external service issues

Display Health Assessment Summary:

🏥 Azure Resource Health Assessment

📊 Resource Overview:
• Resource: [Name] ([Type])
• Status: [Healthy/Warning/Critical]
• Location: [Region]
• Last Analyzed: [Timestamp]

🚨 Issues Identified:
• Critical: X issues requiring immediate attention
• High: Y issues affecting performance/reliability  
• Medium: Z issues for optimization
• Low: N informational items

🔍 Top Issues:
1. [Issue Type]: [Description] - Impact: [High/Medium/Low]
2. [Issue Type]: [Description] - Impact: [High/Medium/Low]
3. [Issue Type]: [Description] - Impact: [High/Medium/Low]

🛠️ Remediation Plan:
• Immediate Actions: X items
• Short-term Fixes: Y items  
• Long-term Improvements: Z items
• Estimated Resolution Time: [Timeline]

❓ Proceed with detailed remediation plan? (y/n)

Generate Detailed Report:

# Azure Resource Health Report: [Resource Name]

**Generated**: [Timestamp]  
**Resource**: [Full Resource ID]  
**Overall Health**: [Status with color indicator]

## 🔍 Executive Summary
[Brief overview of health status and key findings]

## 📊 Health Metrics
- **Availability**: X% over last 24h
- **Performance**: [Average response time/throughput]
- **Error Rate**: X% over last 24h
- **Resource Utilization**: [CPU/Memory/Storage percentages]

## 🚨 Issues Identified

### Critical Issues
- **[Issue 1]**: [Description]
  - **Root Cause**: [Analysis]
  - **Impact**: [Business impact]
  - **Immediate Action**: [Required steps]

### High Priority Issues  
- **[Issue 2]**: [Description]
  - **Root Cause**: [Analysis]
  - **Impact**: [Performance/reliability impact]
  - **Recommended Fix**: [Solution steps]

## 🛠️ Remediation Plan

### Phase 1: Immediate Actions (0-2 hours)
```bash
# Critical fixes to restore service
[Azure CLI commands with explanations]

Phase 2: Short-term Fixes (2-24 hours)

# Performance and reliability improvements
[Azure CLI commands with explanations]

Phase 3: Long-term Improvements (1-4 weeks)

# Architectural and preventive measures
[Azure CLI commands and configuration changes]

📈 Monitoring Recommendations

Alerts to Configure: [List of recommended alerts]
Dashboards to Create: [Monitoring dashboard suggestions]
Regular Health Checks: [Recommended frequency and scope]

✅ Validation Steps

Verify issue resolution through logs
Confirm performance improvements
Test application functionality
Update monitoring and alerting
Document lessons learned

📝 Prevention Measures

[Recommendations to prevent similar issues]
[Process improvements]
[Monitoring enhancements]

Azure Resource Health Diagnose

Azure Resource Health & Issue Diagnosis

Prerequisites

Workflow Steps

Step 1: Get Azure Best Practices

Azure Resource Health Diagnose

Azure Resource Health & Issue Diagnosis

Prerequisites

Workflow Steps

Step 1: Get Azure Best Practices

Step 2: Resource Discovery & Identification

Step 3: Health Status Assessment

Step 4: Log & Telemetry Analysis

Step 5: Issue Classification & Root Cause Analysis

Step 6: Generate Remediation Plan

Step 7: User Confirmation & Report Generation

Phase 2: Short-term Fixes (2-24 hours)

Phase 3: Long-term Improvements (1-4 weeks)

📈 Monitoring Recommendations

✅ Validation Steps

📝 Prevention Measures

Error Handling

Success Criteria

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags