Amazon EC2 (Elastic Compute Cloud)

EC2 = Elastic Compute Cloud → AWS’s most popular IaaS (Infrastructure as a Service) offering.
With EC2, you can:
- Rent virtual machines (instances).
- Attach storage with EBS (Elastic Block Store) or EFS.
- Distribute traffic with ELB (Elastic Load Balancing).
- Scale automatically with Auto Scaling Groups (ASG).
Why it matters: Understanding EC2 is fundamental to understanding the AWS Cloud.

EC2 Sizing & Configuration Options

When launching an EC2 instance, you configure:

Operating System (OS): Linux, Windows, or macOS.
CPU: Number of vCPUs and compute power.
Memory (RAM): Amount of memory allocated.
Storage Options:
- Network-attached storage: EBS or EFS.
- Instance Store (local hardware): Temporary but very fast.
Networking: Network card bandwidth, public IP address.
Firewall: Security Groups control inbound/outbound traffic.
Bootstrap Scripts: EC2 User Data can configure the instance at first launch (e.g., install packages, run setup commands).

💡 Exam Tip:

For machine learning workloads, AWS offers specialized EC2 families and custom silicon:

Custom ML training chip designed for deep learning models with 100B+ parameters.
Trn1 instances have up to 16 Trainium accelerators.
50% lower training cost compared to GPU-based instances.

👉 Exam Tip:

EC2 = Virtual servers in the cloud.
Storage: EBS/EFS for persistent storage, Instance Store for temporary high-speed storage.
ELB & ASG: Scaling and load balancing.
Security Group: Acts as firewall.
User Data: Automates setup at instance launch.
AI Workloads:
- GPU Instances (P, G series) for ML/DL.
- Trainium (Trn1) for training.
- Inferentia (Inf1/Inf2) for inference.

💡 Common Exam Question Patterns:

“Which instance type is best for large-scale ML model training?” → Trn1 (Trainium).
“Which service provides cost-effective inference at scale?” → Inferentia.
“Where do you configure startup scripts for EC2?” → EC2 User Data.