Name: Aws Ec2
Author: jacobscunn07

Search skills.../

Aws Ec2 | Skills Pool

Component	What You Configure
Instance type	CPU, memory, network, storage
AMI	OS, pre-installed software, root volume snapshot
Storage	EBS volumes and/or instance store
Networking	VPC, subnet, security groups, Elastic IP
IAM role	Permissions for the instance to call AWS APIs
User data	Bootstrap script run on first launch

m  6  i  d  .  2xlarge
│  │  │  │      └── Size
│  │  │  └── Additional capability (d = local NVMe storage, n = network optimized, b = block storage optimized)
│  │  └── Processor (i = Intel, a = AMD, g = Graviton, no letter = varies)
│  └── Generation (higher = newer)
└── Family (m = general purpose, c = compute, r = memory, i = storage, g = GPU, p = ML training)

Family	Examples	Optimized For	Choose When
General Purpose	m7i, m7g, t3	Balanced CPU/memory	Web servers, app servers, small DBs
Compute Optimized	c7i, c7g, c6a	High CPU:memory ratio	Batch, HPC, gaming, video encoding
Memory Optimized	r7i, r7g, x2idn, u-*	High memory:CPU ratio	In-memory DBs, Redis, SAP HANA
Storage Optimized	i4i, i3en, d3	High disk I/O	NoSQL DBs, data warehouses, Hadoop
Accelerated Computing	p5, g5, trn1, inf2	GPU / custom silicon	ML training/inference, graphics
HPC Optimized	hpc7g, hpc6id	High-bandwidth networking	Tightly coupled HPC simulations

Option	Discount vs On-Demand	Commitment	Interruption	Best For
On-Demand	0%	None	Never	Dev/test, unpredictable workloads, baseline
Compute Savings Plans	Up to 66%	1 or 3 yr	Never	Flexible steady-state compute (any family/region)
EC2 Instance Savings Plans	Up to 72%	1 or 3 yr	Never	Steady-state, single instance family per region
Reserved Instances	Up to 72%	1 or 3 yr	Never	Stable workloads with predictable configuration
Spot Instances	Up to 90%	None	Yes (2 min warning)	Fault-tolerant batch, stateless, flexible jobs
Dedicated Hosts	On-Demand or RI rates	Optional	Never	BYOL (license compliance), regulatory isolation
Capacity Reservations	On-Demand rates	None	Never	Guarantee capacity in a specific AZ

# Check interruption notice from inside instance
TOKEN=$(curl -X PUT "http://169.254.169.254/latest/api/token" -H "X-aws-ec2-metadata-token-ttl-seconds: 21600")
curl -H "X-aws-ec2-metadata-token: $TOKEN" \
  http://169.254.169.254/latest/meta-data/spot/termination-time
# Returns 404 if not being interrupted; returns timestamp if interruption imminent

Source	Use When
AWS Quick Start AMIs	Default; well-maintained Amazon Linux, Ubuntu, Windows
AWS Marketplace AMIs	Pre-configured commercial software (NAT appliances, security tools)
Community AMIs	Third-party; evaluate carefully before use in production
Your own custom AMI	Standardize a golden image with your software baked in

Type	Max IOPS	Max Throughput	Min Durability	Use Case
gp3	16,000 (per volume) / 80,000 (multi-attach io2)	1,000 MiB/s	99.8%–99.9%	Default for almost everything
gp2	16,000	250 MiB/s	99.8%–99.9%	Legacy; migrate to gp3
io2 Block Express	256,000	4,000 MiB/s	99.999%	High-performance DBs, <500μs latency
io1	64,000	1,000 MiB/s	99.8%–99.9%	I/O intensive DBs (prefer io2 for new volumes)
st1	500	500 MiB/s	99.8%–99.9%	Big data, log processing, streaming reads
sc1	250	250 MiB/s	99.8%–99.9%	Infrequently accessed cold data

Type       Protocol   Port Range   Source/Destination
SSH        TCP        22           10.0.0.0/8
HTTPS      TCP        443          0.0.0.0/0
Custom TCP TCP        8080         sg-0abc123 (another SG ID)

Internet → ALB SG (allow 80, 443 from 0.0.0.0/0)
ALB SG   → App SG (allow 8080 from ALB SG ID)
App SG   → DB SG  (allow 5432 from App SG ID)
App SG   → 0.0.0.0/0 (allow 443 outbound for AWS API calls)

pending → running → stopping → stopped → pending (restart)
                 ↘              ↓
                  shutting-down → terminated

State	Billing	EBS Root Volume	Instance Store
pending	Charged	Attached	Attached
running	Charged	Attached	Data persists
stopping	Not charged	Retained	Data preserved until stopped
stopped	EBS storage only	Retained	Data lost
shutting-down	Not charged	Being deleted (default)	Data lost
terminated	Not charged	Deleted (default)	Data lost

{
  "LaunchTemplateName": "my-app-lt",
  "LaunchTemplateData": {
    "ImageId": "ami-0abcdef1234567890",
    "InstanceType": "m7g.large",
    "IamInstanceProfile": { "Name": "my-app-instance-profile" },
    "SecurityGroupIds": ["sg-0abc123"],
    "UserData": "IyEvYmluL2Jhc2g...",
    "MetadataOptions": {
      "HttpTokens": "required",
      "HttpPutResponseHopLimit": 1
    },
    "BlockDeviceMappings": [{
      "DeviceName": "/dev/xvda",
      "Ebs": {
        "VolumeSize": 30,
        "VolumeType": "gp3",
        "Encrypted": true,
        "DeleteOnTermination": true
      }
    }]
  }
}

#!/bin/bash
yum update -y
yum install -y amazon-cloudwatch-agent
/opt/aws/amazon-cloudwatch-agent/bin/amazon-cloudwatch-agent-ctl \
  -a fetch-config -m ec2 -s \
  -c ssm:/my-app/cloudwatch-config

Component	Purpose
Auto Scaling Group (ASG)	Defines min/desired/max capacity and where to run
Launch Template	Defines what to run (AMI, instance type, etc.)
Scaling Policy	Rules for when to add or remove capacity
Health Checks	Determines when an instance is unhealthy and must be replaced

Type	How It Works	Best For
Target Tracking	Maintain a CloudWatch metric at a target value	Default choice — simplest to configure
Step Scaling	Add/remove N instances at defined alarm thresholds	Faster reaction to burst traffic
Scheduled	Scale at specific times	Predictable traffic patterns (business hours, batch windows)
Predictive	ML forecast of future load	Daily/weekly cycles

{
  "TargetValue": 70.0,
  "PredefinedMetricSpecification": {
    "PredefinedMetricType": "ASGAverageCPUUtilization"
  },
  "ScaleOutCooldown": 60,
  "ScaleInCooldown": 300
}

"MixedInstancesPolicy": {
  "InstancesDistribution": {
    "OnDemandBaseCapacity": 2,
    "OnDemandPercentageAboveBaseCapacity": 20,
    "SpotAllocationStrategy": "capacity-optimized"
  },
  "LaunchTemplate": {
    "LaunchTemplateSpecification": { "LaunchTemplateName": "my-app-lt" },
    "Overrides": [
      { "InstanceType": "m7g.large" },
      { "InstanceType": "m6g.large" },
      { "InstanceType": "m7i.large" },
      { "InstanceType": "m6i.large" }
    ]
  }
}

Launch: pending:wait → (your hook runs) → pending:proceed → running
Terminate: terminating:wait → (your hook runs) → terminating:proceed → terminated

Internet
    │
    ALB (public subnets, multi-AZ)
    │   Listener: 443 → target group
    │
ASG (private subnets, min=2, desired=4, max=20)
    Instance type: m7g.large
    Scaling: Target tracking on ALBRequestCountPerTarget
    Health check: ELB
    Mixed policy: 2 On-Demand base + rest Spot (capacity-optimized)

S3 event → SQS queue → ASG (Spot only)
    Min=0, Max=100
    Scaling: Target tracking on SQS ApproximateNumberOfMessagesVisible
    Mixed: c7g.xlarge, c6g.xlarge, c7i.xlarge (diversified)
    User data: Poll SQS, process, delete message, drain on SIGTERM

EC2 Image Builder pipeline (weekly)
    Base: Amazon Linux 2023
    Components: CloudWatch agent, SSM agent, security hardening
    Test: Launch instance, run integration tests
    Output: New AMI version → SSM Parameter /my-org/golden-ami/latest

ASG launch templates reference SSM parameter (not hardcoded AMI ID)
Instance refresh on new parameter value → rolling replacement

Symptom	Likely Cause
`InsufficientInstanceCapacity`	No On-Demand capacity in the AZ — try a different AZ or instance type
`InstanceLimitExceeded`	Reached vCPU quota — request a limit increase via Service Quotas
Instance stuck in `pending`	Check system log (`get-console-output`) for OS boot errors
`InvalidAMIID.NotFound`	AMI doesn't exist in this region — copy the AMI first
Instance launched but unreachable	Check security group inbound rules, subnet route table, NACL

Symptom	Likely Cause
SSH timeout	Security group doesn't allow port 22 from your IP; or instance in private subnet with no bastion/VPN
`Permission denied (publickey)`	Wrong key pair or wrong user (`ec2-user` for Amazon Linux, `ubuntu` for Ubuntu)
`Host key verification failed`	Instance replaced but known_hosts has old key — remove old entry
No response via SSM Session Manager	SSM agent not running; instance IAM role missing `AmazonSSMManagedInstanceCore` policy; or no SSM endpoint/NAT

Symptom	Likely Cause
ASG not scaling out	Scaling policy cooldown active; or instance launch failing and causing `Failed` activity
Instances launched but immediately terminated	ELB health check failing — check app health endpoint, security group allows health check port from ALB
Desired count keeps oscillating	Scale-in cooldown too short; or CloudWatch metric is noisy — add a buffer to the target value
Instance refresh stuck	Minimum healthy percentage too high to replace all instances — reduce or wait

Symptom	Likely Cause
High disk latency on gp2	Volume out of burst credits — migrate to gp3 with explicit IOPS provisioned
IOPS lower than provisioned on io2	Instance not EBS-optimized, or not on Nitro hypervisor
Throughput bottleneck	Per-instance EBS bandwidth limit reached — check `EBSBytesPerSecond` CloudWatch metric
`No space left on device`	Volume full — expand EBS volume online (`modify-volume`) then extend filesystem

	gp2	gp3
Baseline IOPS	3 IOPS/GiB (min 100, max 16,000)	3,000 (flat baseline)
Throughput	Up to 250 MiB/s	Up to 1,000 MiB/s
IOPS provisioning	Tied to size	Independent (up to 16,000)
Cost	Higher	~20% cheaper

Aws Ec2

AWS EC2 Expert Skill

When to Use This Skill

Aws Ec2

AWS EC2 Expert Skill

When to Use This Skill

Core Concepts

Instance Types

Naming Convention

Instance Families

T-Series Burstable Instances

Graviton (AWS Custom ARM)

Sizing Guidance

Purchasing Options

Savings Plans vs. Reserved Instances

Spot Instance Strategy

Dedicated Hosts

AMIs (Amazon Machine Images)

AMI Sources

Creating a Golden AMI

AMI Regions

Root Device Types

EBS Storage

Volume Types

gp3 vs gp2

Instance Store

Security Groups

Key Behaviors

Rule Structure

Common Security Group Patterns

Instance Lifecycle

Launch Templates

Minimal Launch Template

User Data

Auto Scaling

Key Components

Scaling Policy Types

Mixed Instance Types (Spot + On-Demand)

Health Checks

Lifecycle Hooks

Architecture Patterns

Standard Auto-Scaled Web Tier

Spot Batch Processing

Golden AMI Pipeline

Security Best Practices

Common Troubleshooting

Instance Won't Launch

Can't Connect to Instance

Auto Scaling Issues

EBS Performance Issues

Feishu Drive

Nanoclaw Repl

Crosspost

Cloudflare

Mcp Integration

Setup Deploy