What is Serverless Computing?

Serverless computing is a cloud execution model where the cloud provider dynamically manages the allocation of machine resources. The term "serverless" does not mean that servers are absent — it means that developers no longer need to think about them. You write code, deploy it, and the provider handles provisioning, scaling, patching, and capacity planning.

The Problem Serverless Solves

Traditionally, running a web application or backend API required you to:

Provision servers — choose instance types, configure operating systems, install runtimes
Manage capacity — monitor traffic and manually (or via auto-scaling rules) add or remove instances
Patch and maintain — apply security updates, rotate certificates, manage OS-level dependencies
Pay for idle time — a server running at 5 % utilisation still costs the same as one running at 90 %

Serverless eliminates these concerns. You deploy a function or container, define a trigger, and the cloud provider does the rest.

Key Characteristics of Serverless

Characteristic	Description
No server management	You never provision, patch, or maintain any servers
Event-driven execution	Functions run in response to events — HTTP requests, queue messages, file uploads, schedules
Automatic scaling	The platform scales from zero to thousands of concurrent executions with no configuration
Pay-per-use billing	You are charged only for the compute time your code actually consumes, measured in milliseconds
Ephemeral compute	Each invocation is stateless — the execution environment may be reused but is never guaranteed

Serverless vs Traditional Architectures

Understanding where serverless fits requires comparing it with other hosting models.

Always-On Servers (EC2 / VMs)

Stack layer (top to bottom)	Managed by
Your Application	You
Runtime (Node, Python)	You
Operating System	You
Virtual Machine / EC2	You
Physical Hardware	Provider

Containers (ECS / EKS / Fargate)

Stack layer (top to bottom)	Managed by
Your Application	You
Container Image (Dockerfile)	You
Container Orchestrator	Provider
Physical Hardware	Provider

Serverless (Lambda)

Stack layer (top to bottom)	Managed by
Your Function Code	You
Managed Runtime + Platform	Provider
Physical Hardware	Provider

As you move down this spectrum, you give up control but gain operational simplicity. Serverless sits at the far end — maximum abstraction, minimum operations.

The Serverless Spectrum

Serverless is not a single technology; it is a design philosophy. AWS offers a range of serverless services:

Category	Serverless Service	Purpose
Compute	AWS Lambda	Run code without provisioning servers
API	Amazon API Gateway	Create, publish, and manage APIs
Storage	Amazon S3	Object storage with event notifications
Database	Amazon DynamoDB	Fully managed NoSQL database
Messaging	Amazon SQS, SNS	Message queues and pub/sub notifications
Orchestration	AWS Step Functions	Coordinate multi-step workflows
Event Bus	Amazon EventBridge	Serverless event routing
Streaming	Amazon Kinesis Data Firehose	Real-time data streaming

When you combine these services, you build entire applications without managing a single server.

How Serverless Billing Works

Serverless pricing follows a consumption-based model. For AWS Lambda specifically:

\text{Monthly cost} = (\text{Number of requests} \times \text{Price per request}) + (\text{GB-seconds consumed} \times \text{Price per GB-second})

Example calculation:

Metric	Value
Requests per month	10,000,000
Average duration	200 ms
Memory allocated	256 MB
Price per request	$0.20 per million
Price per GB-second	$0.0000166667

\begin{aligned} \text{Request charges:} \quad & \frac{10{,}000{,}000}{1{,}000{,}000} \times \$0.20 = \$2.00 \\[4pt] \text{Compute charges:} \quad & 10{,}000{,}000 \times 0.2\,\text{s} \times 0.25\,\text{GB} \times \$0.0000166667 = \$8.33 \\[4pt] \text{Total:} \quad & \$10.33/\text{month} \end{aligned}

Compare this with an always-on t3.medium EC2 instance at ~$30/month that sits idle most of the time.

Free tier: AWS Lambda includes 1 million free requests and 400,000 GB-seconds per month — enough for many development and low-traffic production workloads.

When to Use Serverless

Serverless is an excellent fit for:

APIs and microservices — handling HTTP requests with variable traffic
Data processing pipelines — transforming files as they arrive in S3
Scheduled tasks — cron-like jobs (e.g., nightly report generation)
Event-driven workflows — reacting to database changes, queue messages, IoT events
Prototyping and MVPs — ship fast without infrastructure overhead

When Serverless May Not Be Ideal

Not every workload belongs on serverless:

Scenario	Why Serverless Struggles
Long-running processes (> 15 minutes)	Lambda has a maximum timeout of 15 minutes
High-performance computing	Limited CPU and memory ceilings
Consistent high-throughput	Always-on containers may be cheaper at sustained scale
Stateful applications	Each invocation is stateless; external state stores are required
Vendor lock-in concerns	Serverless services are tightly coupled to the provider

The decision is not binary. Many production systems use a hybrid approach — serverless for event-driven components, containers or VMs for long-running or stateful workloads.

Cold Starts Explained

A cold start occurs when the serverless platform must create a new execution environment for your function. This involves:

Downloading your deployment package
Initialising the runtime (e.g., starting the Node.js or Python interpreter)
Running your initialisation code (module imports, database connections)

graph LR
  A["Platform init<br/>~100-500 ms"] --> B["Runtime init<br/>~50-200 ms"] --> C["Handler execution<br/>Your code runs"]

After the first invocation, the execution environment is reused for subsequent requests — this is called a warm start, which skips steps 1 and 2.

Factors that affect cold start duration:

Runtime choice — compiled languages (Go, Rust) start faster than interpreted languages (Java, .NET)
Package size — smaller deployment packages initialise faster
Memory allocation — higher memory settings also provide more CPU, speeding up initialisation
VPC configuration — functions inside a VPC used to suffer longer cold starts (largely resolved with Hyperplane ENI)

The Shared Responsibility Model for Serverless

Security responsibilities shift significantly in a serverless model:

Responsibility	Traditional (EC2)	Serverless (Lambda)
Physical security	Provider	Provider
OS patching	You	Provider
Runtime updates	You	Provider
Application code	You	You
IAM permissions	You	You
Data encryption	You	You
Input validation	You	You

You still own application-level security — input validation, least-privilege IAM roles, and encryption at rest and in transit.

Summary

Serverless means the provider manages all infrastructure — you focus purely on code.
Functions are event-driven, stateless, and billed on a pay-per-use basis.
AWS Lambda is the core serverless compute service, supported by API Gateway, DynamoDB, S3, SQS, SNS, Step Functions, and EventBridge.
Cold starts add latency on the first invocation but are mitigated by warm execution environments.
Serverless is ideal for APIs, event processing, scheduled tasks, and prototyping but may not suit long-running, high-compute, or stateful workloads.
The shared responsibility model shifts more operational burden to the provider, but application security remains yours.

What is Serverless Computing?

What is Serverless Computing?

The Problem Serverless Solves

Key Characteristics of Serverless

Serverless vs Traditional Architectures

Always-On Servers (EC2 / VMs)

Containers (ECS / EKS / Fargate)

Serverless (Lambda)

The Serverless Spectrum

How Serverless Billing Works

When to Use Serverless

When Serverless May Not Be Ideal

Cold Starts Explained

The Shared Responsibility Model for Serverless

Summary

More in Cloud