Blog | Mon Villarin

Serverless Video-on-Demand Platform on AWS

Mon Villarin — Fri, 17 Oct 2025 09:18:19 GMT

Delivering high-quality on-demand video at scale is both a technical and operational challenge. This project — a Terraform-backed AWS Video-on-Demand (VoD) pipeline — demonstrates a pragmatic serverless architecture that balances cost, performance, and manageability. Below I walk through the key design choices, how components interact, and why this approach works well for many media workloads.

Overview

The solution is organized into three workflow stages: Ingest, Process, and Publish. Each stage is implemented with a small set of Lambda functions and wired together by Step Functions. S3 is used for source and output storage; MediaConvert performs encoding; MediaPackage handles packaging for adaptive streaming. A single DynamoDB table tracks workflow state for each video (keyed by a GUID). The IaC is written in Terraform and split into modules for storage, compute, orchestration, messaging and custom resources.

Why this pattern?

Modern VoD pipelines must do three things reliably: (1) accept and validate source assets, (2) transcode and produce multiple adaptive bitrates, and (3) package and publish outputs for clients. Serverless pieces (Lambda + Step Functions) let you represent each logical task as an independently versioned function, which simplifies testing, rollback and incremental improvements. MediaConvert is a managed encoder with native integration to other AWS media services and handles many of the complex codec details for you.

Ingest: catching the right event and normalizing input

The workflow starts when an object is uploaded to the source S3 bucket. The Step Functions trigger Lambda step-functions which determines whether the event came from a video file or a metadata JSON file. This repo supports two modes:

Video-triggered — upload a video and the ingest pipeline starts automatically
Metadata-triggered — upload a JSON file referencing a pre-uploaded video for per-video overrides (custom template, frame-capture, archiving, etc.)

This dual-mode is practical for operations teams that want manual control (metadata overlay) and for automated ingestion (S3-triggered). The input-validate Lambda standardizes variables and sets sensible defaults via environment variables defined by Terraform.

Process: profile, select template, and encode

The mediainfo Lambda extracts technical metadata (frame size, codecs, duration) and stores it on the workflow object. profiler chooses the best output template. Key design choice: the profiler avoids upscaling — it selects the highest template that does not exceed the source resolution. This preserves output quality and reduces unnecessary encoding cost.

Encoding is handled by encode, which uses MediaConvert. Notable features in the implementation:

Endpoint discovery: code calls MediaConvert DescribeEndpoints to get account-specific endpoints before submitting jobs
Template fallback: if a named job template is missing, the encode function tries _fixed variants and alternate template families (mvod vs qvbr) before failing, improving robustness
Output destination mapping: output groups are copied from the template and their S3 destination paths are adapted per-job (hls/, dash/, cmaf/ folders)
Frame capture (thumbnail generation) can be enabled per-job and writes to a thumbnails/ folder

This produces CMAF/HLS/DASH output sets suitable for broad device compatibility. Using a single universal CMAF template simplifies operations while QVBR provides quality/cost trade-offs tuned per-resolution.

Publish: validate, archive and package

Once MediaConvert completes, the output-validate Lambda verifies the output files and records the result in DynamoDB. If enabled, archive-source tags the original file for lifecycle transition to Glacier (or Deep Archive). When MediaPackage is enabled, media-package-assets ingests the job outputs into MediaPackage VOD and sets up packaging groups (HLS/DASH/CMAF), returning playback endpoints that can be distributed through CloudFront.

Operational considerations

State tracking: A single DynamoDB table keyed by guid stores the lifecycle for each video, metadata and timestamps. This simplifies querying, retries, and postmortem audits.

Error handling: The repo includes an error-handler Lambda used by other functions; it can update DynamoDB, publish SNS messages and centralize retry or alerting logic.

Security and least privilege

IAM roles are scoped for each Lambda with the minimal actions they require: S3 Get/Put, DynamoDB Update, MediaConvert CreateJob/GetJobTemplate, and MediaPackage ingest. Terraform modules consistently apply solution tags (SolutionId = SO0021) and default resource naming patterns to make policy scoping and cost allocation straightforward.

Deployment and infrastructure

This implementation uses Terraform >= 1.5 and modularized configuration in IaC/modules/. Lambda code lives in IaC/lambda_functions/; the Terraform archive_file data sources build ZIP archives from those folders so deploys can be done with terraform apply directly from the cloned repository (after dependencies are installed).

PowerShell helper scripts in IaC/ create MediaConvert templates and MediaPackage resources as needed. The workflow trigger (VideoFile vs MetadataFile) is a deployment-time choice and controls how S3 notifications are wired.

Key operational knobs

Accelerated transcoding: options ENABLED, PREFERRED, DISABLED — PREFERRED is a nice cost/latency balance
Frame capture: enables thumbnail output during the encode job
Archive policy: integrates with lifecycle to Glacier/Deep Archive for long-term retention and cost savings

Testing and observability

Lambda functions log structured events to CloudWatch. Step Functions give a visual trace for each workflow execution. Metrics (job counts, failures, encoding time) and CloudWatch alarms should be added for production readiness. The project already supports SNS/SQS for downstream notifications which can feed CI or monitoring pipelines.

Business benefits

This architecture abstracts the complexity of encoding and packaging behind a reproducible, IaC-managed pipeline. It reduces operational overhead by using managed services (MediaConvert, MediaPackage) while giving engineering teams deterministic control over the workflow via small Lambda components. Cost controls like QVBR, archiving, and optional accelerated transcoding allow you to tune spend against SLA needs.

Next steps and improvements

CI/CD: add a GitHub Actions pipeline to run Terraform plan/apply with environment-specific backend configs
Unit/integration tests: provide unit tests for Lambdas and integration tests that submit a small job to a sandbox account
Monitoring: add CloudWatch dashboards, metrics, and alerting for failed jobs or encoding backlogs
Security hardening: optional VPC-enabled Lambdas and tighter IAM resource ARNs for MediaPackage/MediaConvert

Conclusion

This repository is a practical, modular starting point for building a scalable VoD pipeline on AWS. The combination of simple Lambdas, careful template selection, and managed media services delivers a reliable, cost-conscious platform suitable for most VOD workloads.

I created a simple frontend to demonstrate how the encoded media works with Automatic Bitrate Ladder (ABR). The player dynamically switches between multiple video resolutions depending on the viewer’s available bandwidth, providing an adaptive streaming experience.

See it in action: https://vod.monvillarin.com

(No copyright infringement intended. For educational purposes only.)

LinkedIn: linkedin.com/in/ramon-villarin

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/aws-video-on-demand

Traditional 3-Tier Website Deployment on AWS: A Real-World Case Study

Mon Villarin — Wed, 27 Aug 2025 14:08:44 GMT

Introduction

In the world of web application architecture, few patterns have stood the test of time like the traditional 3-tier architecture. This time-tested approach separates applications into three distinct layers: presentation, application, and database. Each layer serves a specific purpose and can be scaled, secured, and maintained independently.

What is 3-Tier Architecture?

The 3-tier architecture consists of:

Presentation Layer (Tier 1): The user interface and user experience components
Application Layer (Tier 2): The business logic and application processing
Database Layer (Tier 3): Data storage and management

This separation of concerns provides several key benefits:

Scalability: Each tier can be scaled independently based on demand
Security: Layers can be isolated with different security controls
Maintainability: Changes to one layer don't necessarily impact others
Flexibility: Different technologies can be used for each layer

Why AWS for 3-Tier Architecture?

Amazon Web Services (AWS) provides an ideal platform for deploying 3-tier architectures due to its:

Comprehensive Service Portfolio: From compute (EC2) to managed databases (RDS) to load balancing (ALB)
Global Infrastructure: Multiple regions and availability zones for high availability
Security Features: VPCs, security groups, and IAM for granular access control
Managed Services: Reduce operational overhead with services like RDS and EFS
Cost Optimization: Pay-as-you-go pricing with various instance types and storage options

In this case study, I'll walk you through how I built a production-ready WordPress hosting infrastructure on AWS using Terraform, implementing a traditional 3-tier architecture that's both secure and scalable.

Project Overview

The Challenge

I needed to create a robust, scalable WordPress hosting solution that could:

Handle Variable Traffic: Support both low and high traffic periods
Ensure High Availability: Minimize downtime through redundancy
Maintain Security: Protect against common web vulnerabilities
Enable Easy Management: Allow for straightforward updates and maintenance
Support Growth: Scale resources as the website grows

Project Goals

The primary objectives for this infrastructure project were:

Scalability: Design an architecture that can grow with demand
Security: Implement defense-in-depth security principles
High Availability: Deploy across multiple Availability Zones
Cost Efficiency: Use appropriate instance sizes and managed services
Maintainability: Create clean, modular infrastructure code
Production-Ready: Include monitoring, backup, and disaster recovery considerations

Technology Stack

Infrastructure as Code: Terraform for reproducible deployments
Cloud Platform: Amazon Web Services (AWS)
Application: WordPress (PHP-based CMS)
Database: MySQL via Amazon RDS
Web Server: Apache/Nginx on Amazon Linux 2
Storage: Amazon EFS for shared file storage

Architecture Details

Let me break down each layer of the architecture and explain the design decisions behind each component.

Networking Layer: The Foundation

The networking layer forms the foundation of our 3-tier architecture, providing the secure, isolated environment where our application will run.

Virtual Private Cloud (VPC)

VPC CIDR: 10.0.0.0/16
- Provides isolated network environment
- Enables custom routing and security policies
- Supports both IPv4 and IPv6 (if needed)

Subnet Strategy

I implemented a multi-AZ subnet strategy for high availability:

Public Subnets (2 AZs):

10.0.1.0/24 (us-east-1a)
10.0.2.0/24 (us-east-1b)
Host: Application Load Balancer, NAT Gateways
Direct internet access via Internet Gateway

Private Application Subnets (2 AZs):

10.0.11.0/24 (us-east-1a)
10.0.12.0/24 (us-east-1b)
Host: EC2 web servers, EFS mount targets
Internet access via NAT Gateways

Private Database Subnets (2 AZs):

10.0.21.0/24 (us-east-1a)
10.0.22.0/24 (us-east-1b)
Host: RDS database instances
No direct internet access

Internet Connectivity Components

Internet Gateway: Provides internet access to public subnets NAT Gateways: Enable outbound internet access for private subnets (for updates, patches) Elastic IPs: Static IP addresses for NAT Gateways Route Tables: Direct traffic flow between subnets and gateways

This networking design ensures that:

Web servers can receive updates but aren't directly accessible from the internet
Database servers are completely isolated from internet access
Load balancers can distribute traffic from the internet to private web servers

Application Layer: The Processing Engine

The application layer handles all business logic and serves as the bridge between users and data.

EC2 Instances

I deployed two EC2 instances across different Availability Zones:

Instance Configuration:
- Type: t3.medium (2 vCPU, 4 GB RAM)
- AMI: Amazon Linux 2
- Storage: 20 GB GP3 EBS volumes
- Placement: Private application subnets

Why t3.medium?

Burstable performance for variable WordPress workloads
Cost-effective for small to medium websites
Sufficient resources for WordPress + MySQL client + web server

Application Load Balancer (ALB)

The ALB serves as the entry point for all web traffic:

Features Implemented:

Health Checks: Monitors /health endpoint on web servers
Cross-AZ Load Balancing: Distributes traffic across both availability zones
Sticky Sessions: Can be enabled for applications requiring session affinity
SSL Termination: Ready for HTTPS certificate attachment

Target Groups:

Health check path: /
Health check interval: 30 seconds
Healthy threshold: 2 consecutive successful checks
Unhealthy threshold: 5 consecutive failed checks

Security Groups: Network-Level Firewalls

I implemented five distinct security groups following the principle of least privilege:

ALB Security Group:

Inbound: HTTP (80), HTTPS (443) from 0.0.0.0/0
Outbound: All traffic to 0.0.0.0/0

WebServer Security Group:

Inbound: HTTP (80), HTTPS (443) from ALB Security Group
Inbound: SSH (22) from SSH Security Group
Outbound: All traffic to 0.0.0.0/0

Database Security Group:

Inbound: MySQL (3306) from WebServer Security Group only
Outbound: All traffic to 0.0.0.0/0

EFS Security Group:

Inbound: NFS (2049) from WebServer Security Group
Inbound: NFS (2049) from self (for mount targets)
Outbound: All traffic to 0.0.0.0/0

SSH Security Group:

Inbound: SSH (22) from 0.0.0.0/0 (restrict in production)
Outbound: All traffic to 0.0.0.0/0

Database Layer: The Data Foundation

The database layer provides persistent, reliable data storage for the WordPress application.

Amazon RDS MySQL

I chose RDS over self-managed MySQL for several reasons:

Configuration:

Engine: MySQL 8.0
Instance Class: db.t3.micro
Storage: 20 GB GP2 (expandable)
Multi-AZ: Enabled for production
Backup Retention: 7 days
Maintenance Window: Sunday 3:00-4:00 AM UTC

Benefits of RDS:

Automated Backups: Point-in-time recovery up to 35 days
Multi-AZ Deployment: Automatic failover for high availability
Automated Patching: OS and database patches applied automatically
Monitoring: CloudWatch metrics and Performance Insights
Security: Encryption at rest and in transit options

Amazon EFS: Shared File Storage

WordPress requires shared storage for themes, plugins, and media uploads when running multiple instances.

EFS Configuration:

Performance Mode: General Purpose
Throughput Mode: Provisioned (if needed)
Storage Class: Standard
Encryption: At rest and in transit
Mount Targets: One per AZ in private subnets

Why EFS over EBS:

Shared Access: Multiple EC2 instances can mount simultaneously
Automatic Scaling: Storage grows and shrinks automatically
High Availability: Built-in redundancy across AZs
POSIX Compliance: Works seamlessly with WordPress file operations

Terraform Implementation

One of the key decisions in this project was organizing the Terraform code into reusable modules rather than creating all resources in a single configuration file.

Module Structure Strategy

I organized the infrastructure into five distinct modules:

modules/
├── networking/     # VPC, subnets, gateways, routing
├── security/       # Security groups and network ACLs
├── database/       # RDS instance and subnet groups
├── storage/        # EFS file system and mount targets
└── compute/        # EC2 instances, ALB, target groups

Benefits of Modular Approach

1. Reusability

# Can be reused across environments
module "networking" {
  source = "./modules/networking"

  environment = "production"  # or "staging", "dev"
  vpc_cidr    = "10.0.0.0/16"
  region      = "us-east-1"
}

2. Maintainability

Each module has a single responsibility
Changes to networking don't affect database configuration
Easier to troubleshoot and debug issues

3. Testing

Individual modules can be tested in isolation
Faster development cycles
Reduced blast radius for changes

4. Team Collaboration

Different team members can work on different modules
Clear ownership boundaries
Easier code reviews

Variable Management Strategy

Each module includes comprehensive variable validation:

variable "vpc_cidr" {
  description = "CIDR block for VPC"
  type        = string
  validation {
    condition     = can(cidrhost(var.vpc_cidr, 0))
    error_message = "VPC CIDR must be a valid IPv4 CIDR block."
  }
}

Resource Naming Convention

I implemented a consistent naming strategy across all resources:

Format: {environment}-{project}-{resource-type}
Examples:
- dev-wordpress-vpc
- prod-wordpress-alb-sg
- staging-wordpress-rds

This naming convention provides:

Environment Identification: Clear separation between dev/staging/prod
Resource Grouping: Easy filtering in AWS console
Cost Tracking: Simplified cost allocation by environment

Deployment Process

The deployment process is designed to be straightforward and repeatable across different environments.

Prerequisites Setup

Before deploying, ensure you have:

# Install Terraform
curl -fsSL https://apt.releases.hashicorp.com/gpg | sudo apt-key add -
sudo apt-add-repository "deb [arch=amd64] https://apt.releases.hashicorp.com $(lsb_release -cs) main"
sudo apt-get update && sudo apt-get install terraform

# Configure AWS CLI
aws configure
# Enter your Access Key ID, Secret Access Key, Region, and Output format

Step-by-Step Deployment

1. Clone and Initialize

git clone 
cd wordpress-aws-infrastructure
terraform init

The terraform init command:

Downloads required provider plugins (AWS)
Initializes the backend for state storage
Prepares the working directory

2. Plan the Deployment

terraform plan -out=tfplan

This command:

Shows exactly what resources will be created
Validates the configuration syntax
Checks for potential issues before applying

3. Apply the Infrastructure

terraform apply tfplan

The apply process typically takes 10-15 minutes and creates approximately 25-30 AWS resources.

4. Verify Deployment

# Check ALB DNS name
terraform output alb_dns_name

# Test connectivity
curl http://$(terraform output -raw alb_dns_name)

Environment-Specific Deployments

For different environments, modify the local variables in main.tf:

# Development Environment
locals {
  environment = "dev"
  project     = "wordpress"
}

# Production Environment
locals {
  environment = "prod"
  project     = "wordpress"
}

Challenges & Lessons Learned

Building this infrastructure taught me several valuable lessons about AWS networking, security, and Terraform best practices.

Challenge 1: NAT Gateway vs Internet Gateway Routing

The Problem: Initially, I struggled with understanding when to use NAT Gateways versus Internet Gateways and how to properly configure route tables.

The Solution:

Internet Gateway: Provides bidirectional internet access for public subnets
NAT Gateway: Provides outbound-only internet access for private subnets

Route Table Configuration:

# Public subnet route table
resource "aws_route" "public_internet_access" {
  route_table_id         = aws_route_table.public.id
  destination_cidr_block = "0.0.0.0/0"
  gateway_id             = aws_internet_gateway.main.id
}

# Private subnet route table
resource "aws_route" "private_internet_access" {
  route_table_id         = aws_route_table.private.id
  destination_cidr_block = "0.0.0.0/0"
  nat_gateway_id         = aws_nat_gateway.main.id
}

Lesson Learned: Draw network diagrams before implementing. Understanding traffic flow is crucial for proper routing configuration.

Challenge 2: Database Connectivity from Private Subnets

The Problem: EC2 instances in private subnets couldn't connect to the RDS database, even though both were in private subnets.

The Root Cause: Security group rules weren't properly configured to allow MySQL traffic between the web servers and database.

The Solution:

# Database security group allows MySQL from web servers
resource "aws_security_group_rule" "database_mysql_from_webserver" {
  type                     = "ingress"
  from_port                = 3306
  to_port                  = 3306
  protocol                 = "tcp"
  source_security_group_id = aws_security_group.webserver.id
  security_group_id        = aws_security_group.database.id
}

Lesson Learned: Security groups act as virtual firewalls. Always test connectivity between tiers and use security group references instead of CIDR blocks for internal communication.

Challenge 3: EFS Mount Target Placement

The Problem: EFS mount targets were initially created in public subnets, causing connectivity issues from EC2 instances in private subnets.

The Solution: Mount targets must be in the same subnets as the EC2 instances that will access them:

resource "aws_efs_mount_target" "main" {
  count           = length(var.private_app_subnet_ids)
  file_system_id  = aws_efs_file_system.main.id
  subnet_id       = var.private_app_subnet_ids[count.index]
  security_groups = [var.efs_security_group_id]
}

Lesson Learned: Understand AWS service networking requirements. Not all services work the same way across subnets.

Challenge 4: Managing Terraform State

The Problem: Initially stored Terraform state locally, which caused issues when working from different machines and made collaboration difficult.

The Solution: Implemented remote state storage with S3 backend:

terraform {
  backend "s3" {
    bucket = "my-terraform-state-bucket"
    key    = "wordpress/terraform.tfstate"
    region = "us-east-1"
  }
}

Lesson Learned: Always use remote state storage for any infrastructure that will be maintained long-term or by multiple people.

Challenge 5: Resource Dependencies

The Problem: Terraform sometimes tried to create resources before their dependencies were ready, causing deployment failures.

The Solution: Explicit dependency management:

module "compute" {
  source = "./modules/compute"

  # ... other variables ...

  depends_on = [
    module.networking,
    module.security,
    module.database,
    module.storage
  ]
}

Lesson Learned: While Terraform is good at inferring dependencies, explicit depends_on declarations prevent race conditions in complex deployments.

Conclusion

Building this traditional 3-tier WordPress infrastructure on AWS using Terraform has been an invaluable learning experience that demonstrates the power and flexibility of cloud-native architectures.

Key Benefits of This Approach

1. Proven Architecture Pattern The 3-tier architecture has been battle-tested in enterprise environments for decades. It provides:

Clear separation of concerns
Independent scaling capabilities
Well-understood security boundaries
Straightforward troubleshooting paths

2. AWS Managed Services Integration By leveraging AWS managed services like RDS and EFS, we achieved:

Reduced operational overhead
Built-in high availability and backup capabilities
Automatic security patching
Cost optimization through right-sizing

3. Infrastructure as Code Benefits Using Terraform provided:

Reproducible deployments across environments
Version-controlled infrastructure changes
Automated resource provisioning
Consistent configuration management

4. Security Best Practices The implementation follows AWS security best practices:

Defense in depth with multiple security layers
Principle of least privilege for access controls
Network isolation between tiers
Encrypted data storage and transmission

When to Use This Architecture

This traditional 3-tier approach is ideal for:

Legacy Application Migrations: Moving existing applications to the cloud
Predictable Workloads: Applications with consistent traffic patterns
Compliance Requirements: Environments requiring specific security controls
Team Familiarity: Organizations with traditional infrastructure expertise
Cost Predictability: Workloads where reserved instances provide cost benefits

Limitations and Considerations

However, this approach may not be optimal for:

Highly Variable Traffic: Serverless might be more cost-effective
Microservices: Container orchestration platforms like EKS might be better
Global Applications: CDN and edge computing solutions should be considered
Event-Driven Workloads: Lambda and event-driven architectures might be more suitable

Next Steps and Evolution

While this 3-tier architecture serves as an excellent foundation, there are several directions for future enhancement:

1. Containerization

Current: EC2 instances with traditional deployment
Future: ECS or EKS with containerized WordPress
Benefits: Better resource utilization, easier scaling, improved deployment processes

2. Serverless Components

Current: Always-on EC2 instances
Future: Lambda functions for specific tasks (image processing, backups)
Benefits: Pay-per-use pricing, automatic scaling, reduced operational overhead

3. Advanced Monitoring and Observability

Current: Basic CloudWatch metrics
Future: Comprehensive monitoring with CloudWatch, X-Ray, and custom dashboards
Benefits: Better performance insights, proactive issue detection, improved troubleshooting

4. CI/CD Pipeline Integration

Current: Manual Terraform deployments
Future: Automated deployments with GitHub Actions or AWS CodePipeline
Benefits: Faster deployment cycles, reduced human error, consistent environments

5. Multi-Region Deployment

Current: Single region deployment
Future: Multi-region setup with Route 53 health checks
Benefits: Disaster recovery, improved global performance, higher availability

Final Thoughts

The traditional 3-tier architecture remains a solid choice for many web applications, especially when implemented with modern cloud services and infrastructure as code practices. This project demonstrates that you don't always need the latest serverless or microservices architecture to build robust, scalable applications.

The key is understanding your requirements, constraints, and team capabilities, then choosing the architecture that best fits your specific situation. Sometimes, the tried-and-true approach is exactly what you need.

Whether you're migrating legacy applications to the cloud, building new traditional web applications, or simply learning cloud architecture patterns, the 3-tier approach provides a solid foundation that can evolve with your needs over time.

The infrastructure code and deployment process I've shared here can serve as a starting point for your own projects, and the lessons learned can help you avoid common pitfalls when building similar architectures.

Remember: great architecture isn't about using the newest technology—it's about solving real problems with reliable, maintainable, and cost-effective solutions.

Have you built similar architectures? What challenges did you face? I'd love to hear about your experiences in the comments below.

Tags: #AWS #Terraform #3TierArchitecture #WordPress #CloudInfrastructure #InfrastructureAsCode #DevOps

LinkedIn: linkedin.com/in/ramon-villarin

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/traditional_3_tier_website_deployment_on_aws

Real-Time Stock Market Data Analytics Pipeline on AWS with Terraform

Mon Villarin — Thu, 21 Aug 2025 09:47:50 GMT

Modern businesses succeed when they can turn fresh data into action. Markets move quickly, and the sooner you can detect a pattern, the faster you can respond. This project demonstrates a lean, production-friendly approach to real-time analytics on AWS: ingest stock ticks, process them immediately, archive raw events for historical analysis, compute trends, and make the results queryable with SQL. Everything is defined as code with Terraform modules, so it is easy to deploy, reason about, and evolve.

This post explains how the project is built, how each component works and interacts with others, why the architecture is cost efficient, and how organizations can benefit from it.

What We Built

At a high level, the pipeline consists of:

A lightweight producer script that writes stock ticks to Amazon Kinesis Data Streams.
A Lambda consumer that validates and transforms records, saves curated data in DynamoDB, and archives raw JSON to Amazon S3.
A trend-analysis Lambda that listens to DynamoDB Streams, computes simple moving averages (SMAs), and publishes alerts via Amazon SNS.
An AWS Glue Catalog database and table that make raw data in S3 discoverable and queryable by Amazon Athena.
Small Terraform modules for each AWS component, assembled in a clear, hardcoded root configuration.

The result is an end-to-end, serverless analytics stack that scales with traffic, keeps costs tied to usage, and provides both real-time and historical paths for analysis.

Components and How They Work Together

Ingestion: Amazon Kinesis Data Streams

Data ingestion is handled by Amazon Kinesis Data Streams. Kinesis provides a durable, scalable, ordered log for events. In this project we use a single shard, which supports up to 1,000 records per second or 1 MB per second of writes. If your throughput grows, you can scale horizontally by adding shards.

A small Python program, producer_data_function.py, fetches data for a symbol (AAPL by default) using the yfinance library. When real market data is unavailable, it generates realistic mock data so the pipeline can be demonstrated offline. The producer publishes a compact JSON document including fields like symbol, open, high, low, price, previous_close, volume, a source flag, and an ISO 8601 timestamp. It sends a new record every 30 seconds.

The producer reads the stream name from an environment variable KINESIS_STREAM_NAME (defaulting to stock-market-stream). That makes it simple to point the producer to different streams without changing code.

Real-Time Processing: Lambda Consumer for Kinesis

The first AWS Lambda function, ConsumerStockData, is connected to the Kinesis stream via an event source mapping. When new records arrive, Kinesis batches them (batch size is configurable, 2 in this example) and invokes the function. The function:

Decodes and validates each JSON payload, ensuring required fields like symbol, price, and timestamp are present and well typed.
Archives the raw event in S3 under a logical, time-based path: raw/YYYY/MM/DD/HH/.... This provides a natural partitioning scheme for later analytics.
Writes a curated item to DynamoDB containing the symbol, timestamp, and price, plus optional attributes such as volume or exchange.

Why write to both S3 and DynamoDB? DynamoDB is optimized for fast key-value and range queries and is perfect for real-time lookups and dashboards. S3 is the long-term system of record and data lake. By archiving every raw record in S3, you can run backfills, ad-hoc analytics, and train ML models using complete history, without touching production tables.

Insights: Trend Analysis with DynamoDB Streams and Lambda

The second Lambda function, StockTrendAnalysis, is triggered by DynamoDB Streams. Whenever the stock-market-data table changes, DynamoDB emits a stream record. The function queries recent items for a symbol (for example the last few minutes), computes short and long simple moving averages (such as SMA-5 and SMA-20), and detects crossovers that may indicate an uptrend or downtrend.

If a signal is detected, the function publishes a message to an Amazon SNS topic named stock-trend-alerts. For ease of testing, the project creates a standard email subscription; you confirm the subscription by clicking a link in an AWS email. In production, you could send alerts to SMS, HTTPS webhooks, Slack, or event buses, all via SNS.

Both Lambda functions use environment variables for configuration. For example, the consumer reads the DynamoDB table name and S3 bucket name from its environment, and the trend function reads the table name and SNS topic ARN. This approach lets you move across environments without code changes.

Archival and Query: S3, Glue Catalog, and Athena

All raw events are archived in S3. The project creates two buckets: one for raw data (stock-market-data-bucket-121485) and another for query results (athena-query-results-121485). Raw data is stored as JSON. An AWS Glue Catalog database and table define the schema over the S3 prefix so Amazon Athena can run SQL queries against the JSON files.

Athena itself is not an infrastructure resource to provision in Terraform (it is a serverless query service). Still, the project fully prepares the environment for Athena by creating the Glue Catalog and a results bucket. You can immediately explore the data from the Athena console using standard SQL and save or share queries as needed.

Access and Security: IAM

Two IAM roles are created using a reusable module. One role is for the Kinesis consumer Lambda; the other is for the trend-analysis Lambda. Managed policies grant access to the required services: DynamoDB, Kinesis (for the consumer), S3, SNS (for trend alerts), and CloudWatch Logs. As you harden the solution, you can replace broad managed policies with narrow, resource-level policies.

Implementation Approach: Terraform Modules

The project is intentionally simple to make the design easy to understand and extend. Each AWS service is represented by a small Terraform module. The root configuration calls those modules and passes explicit values. This makes resource relationships and data flow very clear.

modules/kinesis: Kinesis stream with name, shard count, retention, and encryption settings.
modules/lambda_function: Creates a Lambda function and an event source mapping. It supports event sources from Kinesis streams and DynamoDB Streams via a generic event_source_arn variable.
modules/dynamodb: Creates the stock-market-data table with on-demand billing, server-side encryption, point-in-time recovery, and a DynamoDB Stream.
modules/s3_bucket: Creates S3 buckets with versioning, encryption, public access blocks, and force-destroy on delete so destroys do not fail on non-empty buckets.
modules/glue_catalog: Defines a Glue database and table using a JSON SerDe for the archived S3 data.
modules/iam_role: Creates IAM roles and attaches managed policies passed as variables.
modules/sns: Creates an SNS topic and a subscription (email protocol).

The root main.tf wires these modules together, passing identifiers like ARNs where needed. The Lambda module references local ZIP artifacts for the functions and uses source_code_hash so updates are deployed when the package changes.

Cost Efficiency: Why This Architecture Is Affordable

This design keeps costs tied to usage and eliminates idle infrastructure:

Kinesis costs are dominated by the number of shards and PUT payload units. Starting with a single shard keeps the baseline low; scaling is linear and intentional.
Lambda is billed per request and compute duration. Choosing sensible batch sizes, memory, and timeouts lets you trade latency for cost. For example, a small batch size minimizes per-batch latency for near-immediate processing while keeping compute bursts small.
DynamoDB PAY_PER_REQUEST pricing removes the need to forecast capacity. You only pay for read and write units you actually consume. Point-in-time recovery adds a small storage cost but provides significant safety.
S3 is extremely cheap for storage, and you can optionally enable lifecycle rules to transition older data to cheaper tiers.
Glue Catalog has negligible cost for metadata storage.
Athena costs are per TB scanned. Because the raw data is organized by time and schema-defined, it is straightforward to add partitioning or switch to Parquet later to reduce scan costs substantially.

All of this means you can run a real-time analytics stack for a small team or pilot project at very low cost and scale it progressively as value is proven and requirements grow.

Operational Flow

Deploy the infrastructure with Terraform: terraform init, terraform validate, terraform plan, and terraform apply.
Confirm the SNS email subscription sent for the stock-trend-alerts topic.
Start the producer: optionally set KINESIS_STREAM_NAME, then run python producer_data_function.py. Records will begin flowing into Kinesis.
Observe processing: the consumer Lambda archives raw JSON to S3 and writes curated items to DynamoDB. The trend Lambda listens to DynamoDB Streams and publishes alerts when it detects SMA crossovers.
Query archived data in Athena: open the Athena console, select the Glue database and table, and run SQL against your archived JSON. Results will appear in the query results bucket.
Tear down if needed: because the S3 module uses force_destroy = true, terraform destroy will delete the buckets even if objects remain. If you just enabled force-destroy, apply that change first, then destroy.

Business Benefits

Faster insights: Real-time ingestion and processing allow your teams to detect market shifts or operational anomalies as they happen. Alerts can route directly to people or systems.
Lower total cost of ownership: There are no servers to size or patch. Costs scale with usage and can remain near zero during quiet periods.
Durable data lake: By archiving raw events to S3, you keep a complete record for backtesting, trend discovery, and machine learning. Glue plus Athena provide ad-hoc SQL without building a warehouse on day one.
Operational simplicity: The system is composed of a few highly available, fully managed services. Terraform modules make the infrastructure explicit, consistent, and repeatable across environments.
Extensibility: Swap in a real market data feed, track more symbols, compute additional indicators (RSI, Bollinger Bands, VWAP), add APIs or dashboards, or stream cleaned data to other systems. The architecture is flexible by design.

Hardening and Future Enhancements

Least-privilege IAM: Replace broad managed policies with minimal, resource-scoped permissions.
Data format and partitioning: Store archived data as Parquet and add year/month/day/hour partitions for significant Athena cost and speed gains.
Observability: Add CloudWatch alarms for Kinesis iterator age, Lambda error rates and throttles, and DynamoDB activity. Consider tracing with AWS X-Ray for end-to-end visibility.
Resilience: Configure dead-letter queues on event source mappings and make Lambda writes idempotent to handle retries safely.
Multi-environment strategy: Externalize names and ARNs into variables, use remote Terraform state (S3 backend with DynamoDB locking), and adopt a consistent tagging scheme for cost allocation.

Conclusion

This project is a concise blueprint for real-time analytics on AWS. Kinesis streams events in; Lambda transforms and stores; DynamoDB enables instantaneous lookups; S3 plus Glue make history queryable with Athena; and SNS turns analytics into action. Expressed as Terraform modules, the stack is simple to deploy, easy to understand, and inexpensive to run.

Whether you are prototyping trading signals, monitoring application events, or analyzing IoT telemetry, this architecture gives you a pragmatic, cost-efficient foundation that scales as your needs grow.

LinkedIn: linkedin.com/in/ramon-villarin

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/stock-market-data-analytics-pipeline

Serverless Approach with AWS CI/CD: Transforming Operations and Reducing Costs

Mon Villarin — Thu, 14 Aug 2025 07:38:12 GMT

A deep dive into implementing a fully automated deployment pipeline using AWS services, and why this architecture is revolutionizing how businesses approach software delivery

In today's fast-paced digital landscape, the ability to deploy software quickly, reliably, and cost-effectively can make or break a business. Traditional deployment methods often involve complex server management, lengthy deployment cycles, and unpredictable costs that scale poorly with business growth. This article explores how modern serverless CI/CD architectures on AWS are solving these challenges, using a practical 2048 game deployment as a case study.

The Business Challenge: Traditional Deployment Pain Points

Before diving into the solution, let's examine the typical challenges businesses face with traditional deployment approaches:

1. Infrastructure Management Overhead

Traditional deployments require dedicated DevOps teams to manage servers, handle security patches, monitor system health, and scale infrastructure manually. This translates to significant operational costs and diverted focus from core business objectives.

2. Unpredictable Scaling Costs

Maintaining always-on servers for variable workloads leads to either over-provisioning (wasted money) or under-provisioning (poor user experience). Businesses often struggle to find the sweet spot between cost and performance.

3. Deployment Risk and Downtime

Manual deployment processes are error-prone and often require maintenance windows, resulting in lost revenue and poor user experience. The fear of deployment failures often leads to infrequent releases, slowing innovation.

4. Security and Compliance Complexity

Managing security across multiple servers, ensuring proper access controls, and maintaining compliance standards requires specialized expertise and constant vigilance.

The Modern Solution: Serverless CI/CD Architecture

Our 2048 game deployment project demonstrates how modern AWS services can address these challenges through a fully automated, serverless CI/CD pipeline. Let's break down the architecture and its business benefits:

Architecture Overview

GitHub → CodePipeline → CodeBuild → ECR → ECS Fargate

This seemingly simple flow represents a sophisticated system that eliminates most traditional deployment pain points while providing enterprise-grade reliability and security.

Business Value Analysis: ROI and Cost Benefits

1. Dramatic Reduction in Operational Overhead

Traditional Approach:

2-3 DevOps engineers ($150K-$200K annually each)
Server maintenance and monitoring tools ($50K-$100K annually)
Security management and compliance auditing ($75K-$150K annually)
Total Annual Cost: $375K-$650K

Serverless CI/CD Approach:

AWS services costs (detailed below)
0.5-1 DevOps engineer for initial setup and maintenance ($75K-$100K annually)
Automated security and compliance through AWS services
Total Annual Cost: $80K-$120K + AWS usage

Savings: 60-80% reduction in operational costs

2. Precise Cost Control with Pay-Per-Use Model

Let's break down the actual AWS costs for our architecture:

AWS Fargate Costs

Small Application (256 CPU, 512MB RAM): ~$12-15/month for continuous operation
Medium Application (512 CPU, 1GB RAM): ~$25-30/month for continuous operation
Auto-scaling: Costs scale linearly with actual usage, not provisioned capacity

CI/CD Pipeline Costs

CodePipeline: $1/month per active pipeline
CodeBuild: $0.005/minute of build time (typical build: 3-5 minutes)
ECR: $0.10/GB/month for image storage
S3 Artifacts: $0.023/GB/month for artifact storage

Real-World Cost Example

For a typical web application with moderate traffic:

Monthly AWS costs: $50-100
Annual AWS costs: $600-1,200
Traditional server costs: $2,400-4,800 annually (just for basic VPS hosting)

Result: 75-85% cost reduction compared to traditional hosting

3. Zero-Downtime Deployments = Revenue Protection

Traditional deployments often require maintenance windows, potentially costing businesses:

E-commerce: $100K-500K per hour of downtime
SaaS Applications: $50K-200K per hour of downtime
Content Platforms: $25K-100K per hour of downtime

Our architecture provides:

Rolling deployments with automatic health checks
Instant rollback capabilities
Circuit breakers to prevent cascading failures
99.99% uptime guarantee through AWS SLA

4. Accelerated Time-to-Market

Traditional Development Cycle:

Code development: 2 weeks
Manual testing and deployment preparation: 3-5 days
Deployment and troubleshooting: 1-2 days
Total: 3-4 weeks per release

Automated CI/CD Cycle:

Code development: 2 weeks
Automated testing and deployment: 5-10 minutes
Total: 2 weeks per release

Business Impact:

50% faster feature delivery
Increased competitive advantage
Higher customer satisfaction
More frequent revenue-generating releases

Technical Architecture Deep Dive: Why This Approach Works

1. Container-First Strategy with ECS Fargate

Business Benefits:

No server management: Eliminates the need for dedicated infrastructure teams
Automatic scaling: Handles traffic spikes without manual intervention
Resource optimization: Pay only for actual container runtime
Security by default: AWS manages the underlying infrastructure security

Cost Implications:

Predictable pricing: $0.04048/vCPU/hour + $0.004445/GB/hour
No idle costs: Containers only run when needed
Automatic optimization: AWS continuously optimizes the underlying infrastructure

2. Infrastructure as Code with Terraform

Business Benefits:

Reproducible environments: Eliminate "it works on my machine" problems
Version-controlled infrastructure: Track and audit all infrastructure changes
Disaster recovery: Rebuild entire infrastructure in minutes
Multi-environment consistency: Identical staging and production environments

Cost Benefits:

Prevent configuration drift: Avoid costly misconfigurations
Resource optimization: Ensure resources are properly sized and tagged
Compliance automation: Built-in security and compliance controls

3. Automated CI/CD Pipeline

Business Benefits:

Reduced human error: Automated processes eliminate manual mistakes
Faster feedback loops: Developers get immediate feedback on code changes
Consistent deployments: Every deployment follows the same tested process
Audit trail: Complete history of all deployments and changes

Cost Benefits:

Reduced deployment time: From hours to minutes
Lower failure rates: Automated testing catches issues early
Faster recovery: Automated rollback capabilities

Real-World Business Scenarios and ROI

Scenario 1: Startup with Limited Resources

Challenge: A startup with 5 developers needs to deploy multiple applications quickly while keeping costs minimal.

Traditional Approach:

3 dedicated servers: $300/month
DevOps engineer: $120K/year
Deployment tools and monitoring: $500/month
Total Annual Cost: $129,600

Serverless CI/CD Approach:

AWS services: $200/month
Part-time DevOps consultant: $30K/year
Total Annual Cost: $32,400

ROI: 75% cost reduction, 10x faster deployments

Scenario 2: Mid-Size Company with Multiple Products

Challenge: A company with 50 developers managing 20 different applications across multiple environments.

Traditional Approach:

Infrastructure team (5 people): $750K/year
Server and tooling costs: $200K/year
Total Annual Cost: $950K

Serverless CI/CD Approach:

AWS services (all applications): $50K/year
DevOps team (2 people): $300K/year
Total Annual Cost: $350K

ROI: 63% cost reduction, 5x faster time-to-market

Scenario 3: Enterprise with Compliance Requirements

Challenge: A financial services company needing SOC 2 compliance and 99.99% uptime.

Traditional Approach:

Infrastructure and security team: $2M/year
Compliance auditing and tools: $500K/year
High-availability infrastructure: $1M/year
Total Annual Cost: $3.5M

Serverless CI/CD Approach:

AWS services with compliance features: $300K/year
Reduced team size: $1M/year
Built-in compliance and auditing: $100K/year
Total Annual Cost: $1.4M

ROI: 60% cost reduction, improved compliance posture

Strategic Business Advantages

1. Competitive Agility

Companies using modern CI/CD architectures can:

Deploy features 10x faster than competitors
Respond to market changes within hours, not weeks
A/B test new features with minimal risk
Scale globally without infrastructure concerns

2. Risk Mitigation

Reduced blast radius: Containerized applications limit failure impact
Automatic recovery: Self-healing infrastructure reduces downtime
Security by design: AWS handles most security concerns automatically
Compliance automation: Built-in audit trails and access controls

3. Talent Optimization

Focus on value creation: Developers spend time on features, not infrastructure
Reduced specialized knowledge requirements: Less need for deep infrastructure expertise
Improved developer experience: Faster feedback loops and easier debugging
Attraction of top talent: Modern tooling attracts better developers

Implementation Strategy and Best Practices

Phase 1: Foundation (Weeks 1-2)

Set up basic CI/CD pipeline for one application
Implement Infrastructure as Code
Establish monitoring and alerting
Expected ROI: 30% reduction in deployment time

Phase 2: Optimization (Weeks 3-4)

Add automated testing and security scanning
Implement multi-environment deployments
Set up auto-scaling and cost optimization
Expected ROI: 50% reduction in operational overhead

Phase 3: Scale (Weeks 5-8)

Migrate additional applications
Implement advanced monitoring and observability
Add disaster recovery and backup strategies
Expected ROI: 70% total cost reduction

Cost Optimization Strategies

1. Right-Sizing Resources

Start with minimal resources (256 CPU, 512MB RAM)
Use CloudWatch metrics to optimize based on actual usage
Implement auto-scaling to handle traffic variations

2. Lifecycle Management

Automatic cleanup of old Docker images (30-day retention)
S3 lifecycle policies for artifact management
CloudWatch log retention policies (7-30 days)

3. Spot Instances for Non-Production

Use Fargate Spot for development and staging environments
70% cost reduction for non-critical workloads
Automatic failover to on-demand instances

4. Multi-Environment Strategy

Shared infrastructure components across environments
Environment-specific scaling policies
Cost allocation tags for accurate billing

Measuring Success: Key Performance Indicators

Technical KPIs

Deployment frequency: From weekly to multiple times per day
Lead time: From weeks to hours
Mean time to recovery: From hours to minutes
Change failure rate: Reduced by 80-90%

Business KPIs

Infrastructure costs: 60-80% reduction
Developer productivity: 40-60% improvement
Time to market: 50-70% faster
System reliability: 99.9%+ uptime

Future-Proofing Your Architecture

Emerging Technologies Integration

AI/ML workloads: Easy integration with AWS SageMaker
Serverless functions: Seamless Lambda integration
Edge computing: CloudFront and Lambda@Edge support
IoT applications: Built-in IoT Core connectivity

Scalability Considerations

Global deployment: Multi-region support out of the box
Microservices architecture: Container-native design
Event-driven systems: Native AWS event integration
Data analytics: Built-in CloudWatch and X-Ray integration

Conclusion: The Strategic Imperative

The shift to serverless CI/CD architectures isn't just a technical upgrade—it's a strategic business transformation. Companies that embrace this approach gain significant competitive advantages:

60-80% reduction in infrastructure costs
10x faster deployment cycles
Improved system reliability and security
Enhanced developer productivity and satisfaction
Better resource allocation toward core business objectives

The 2048 game deployment project demonstrates that even simple applications benefit enormously from modern CI/CD practices. For businesses of all sizes, the question isn't whether to adopt these practices, but how quickly they can implement them to stay competitive.

As we've seen through real-world scenarios and cost analyses, the ROI is compelling across all business sizes—from startups saving 75% on infrastructure costs to enterprises reducing operational overhead by millions of dollars annually.

The future belongs to organizations that can deploy software quickly, reliably, and cost-effectively. By implementing serverless CI/CD architectures today, businesses position themselves not just for current success, but for the challenges and opportunities of tomorrow's digital landscape.

LinkedIn: linkedin.com/in/ramon-villarin

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/2048_game_with_aws_codepipeline_ecs_ecr

A Business Intelligence Pipeline That Transforms Clickstream Into Insights

Mon Villarin — Sun, 03 Aug 2025 08:55:20 GMT

How we transformed a traditional EC2-based data pipeline into a cost-effective, serverless architecture that processes millions of events for real-world business intelligence.

The Challenge: Modern BI Needs Modern Architecture

In today's digital landscape, businesses generate massive amounts of clickstream data—every page view, button click, and user interaction represents valuable insights waiting to be discovered. However, traditional approaches to processing this data often involve:

Over-provisioned servers running 24/7 for intermittent workloads
Complex infrastructure management requiring dedicated DevOps resources
High operational costs with poor resource utilization
Scaling challenges during traffic spikes

We set out to solve these problems by building a completely serverless business intelligence pipeline that automatically collects, processes, and analyzes clickstream data while reducing costs by 95% and eliminating operational overhead.

The Solution: A Serverless-First Approach

Our solution leverages AWS's serverless ecosystem to create an intelligent, self-managing data pipeline:

🏗️ Architecture Overview

EventBridge → Lambda → S3 → Glue → Athena/QuickSight
    ↓           ↓       ↓      ↓           ↓
 Schedule   Generate  Store  Process  Analyze/Visualize

Core Components:

AWS Lambda: Generates realistic clickstream events
EventBridge: Orchestrates scheduled data generation
S3: Scalable data lake for raw and processed data
AWS Glue: Serverless ETL for data transformation
Amazon Athena: SQL analytics engine for technical users
AWS QuickSight: Interactive dashboards for business users
Terraform: Infrastructure as Code for reproducible deployments

How Each Component Works Together

1. Data Generation Engine (Lambda)

Our Lambda function acts as a sophisticated clickstream simulator:

def generate_event():
    return {
        'event_type': random.choices(['click', 'search', 'purchase'], weights=[0.6, 0.3, 0.1])[0],
        'user_id': random_string(10),
        'user_action': random.choices(['home_page', 'product_page', 'cart_page'], weights=[0.2, 0.4, 0.2])[0],
        'location': random.choices(country_codes, weights=country_probabilities)[0],
        'user_age': max(16, min(80, int(random.normalvariate(35, 10)))),
        'timestamp': generate_realistic_timestamp()
    }

Key Features:

Realistic Data Distribution: Uses weighted random selection to simulate real user behavior
Geographic Diversity: Includes 45+ countries with realistic population distributions
Temporal Patterns: Generates timestamps spanning 60 days for trend analysis
Event Variety: Simulates clicks, searches, and purchases with appropriate ratios

Business Value: Provides high-quality synthetic data that mirrors real-world patterns, enabling teams to develop and test analytics without exposing sensitive customer data.

2. Intelligent Scheduling (EventBridge)

EventBridge orchestrates our data generation with precision:

resource "aws_cloudwatch_event_rule" "lambda_schedule" {
  schedule_expression = var.lambda_schedule  # "rate(5 minutes)"
  description         = "Trigger clickstream generator on schedule"
}

Capabilities:

Flexible Scheduling: From minutes to days, easily configurable
Automatic Retry: Built-in error handling and retry logic
Cost Optimization: Only triggers when needed, no idle compute
Monitoring Integration: Native CloudWatch metrics and alarms

Business Impact: Ensures consistent data flow for real-time analytics while minimizing costs through precise scheduling.

3. Scalable Data Lake (S3)

Our S3 architecture implements a modern data lake pattern:

s3://bucket/
├── raw/           # Landing zone for fresh data
├── results/       # Processed, analytics-ready data
├── processed/     # Archived raw data
├── reference/     # Lookup tables and metadata
└── athena-results/ # Query results cache

Advanced Features:

Lifecycle Management: Automatic data archiving and cost optimization
Security by Default: Public access blocked, encryption enabled
Versioning: Data lineage and recovery capabilities
Cross-Region Replication: Disaster recovery and compliance

Real-World Application: Supports petabyte-scale data growth while maintaining sub-second query performance through intelligent partitioning.

4. Serverless ETL (AWS Glue)

Our Glue job transforms raw clickstream data into business-ready insights:

# Join clickstream events with geographic data
join_datasets = Join.apply(
    frame1=clickstream_events, 
    frame2=geographic_reference,
    keys1=["location"], 
    keys2=["id"]
)

# Transform and enrich data
processed_data = ApplyMapping.apply(
    frame=join_datasets,
    mappings=[
        ("user_age", "int", "user_age", "bigint"),
        ("timestamp", "int", "click_date", "timestamp"),
        ("location", "string", "country_name", "string")
    ]
)

Transformation Capabilities:

Data Enrichment: Adds geographic context to raw events
Schema Evolution: Handles changing data structures automatically
Data Quality: Built-in validation and cleansing
Partitioning: Optimizes query performance through intelligent data organization

Business Benefits: Converts raw events into actionable business metrics, enabling analysts to focus on insights rather than data preparation.

5. Analytics Engine (Athena + QuickSight)

Our dual-layer analytics approach serves both technical and business users:

Athena provides SQL-based analytics for technical users:

-- Customer behavior analysis
SELECT 
    continent,
    event_type,
    AVG(user_age) as avg_customer_age,
    COUNT(*) as event_volume,
    COUNT(DISTINCT user_id) as unique_users
FROM clickstream_db.clickstream_table 
WHERE click_date >= current_date - interval '7' day
GROUP BY continent, event_type
ORDER BY event_volume DESC;

Athena Capabilities:

Real-time Queries: Sub-second response times on terabytes of data
Standard SQL: No learning curve for existing analysts
Integration Ready: Connects to BI tools and custom applications
Cost Effective: Pay only for data scanned, not compute time

QuickSight delivers self-service analytics for business users:

# QuickSight Dashboard with pre-built visualizations
resource "aws_quicksight_dashboard" "clickstream_dashboard" {
  dashboard_id = "clickstream-dashboard"
  name         = "Clickstream Business Intelligence Dashboard"

  # Executive Summary, Geographic Analysis, User Behavior sheets
  definition {
    sheets {
      visuals {
        geospatial_map_visual {
          title { plain_text = "Global Event Distribution" }
        }
        bar_chart_visual {
          title { plain_text = "Events by Country" }
        }
        pie_chart_visual {
          title { plain_text = "Event Type Distribution" }
        }
      }
    }
  }
}

QuickSight Benefits:

No-Code Analytics: Drag-and-drop interface for business users
Interactive Dashboards: Real-time filtering and drill-down capabilities
Mobile Ready: Native mobile apps for executives and field teams
Embedded Analytics: White-label dashboards for customer-facing applications
ML Insights: Automatic anomaly detection and forecasting
Cost Effective: Pay-per-session pricing model

Bridging the Gap: Technical Analytics + Business Intelligence

One of the biggest challenges in modern data platforms is serving both technical users (data analysts, engineers) and business users (executives, marketers, product managers) effectively. The solution addresses this with a dual-layer approach:

For Technical Users: Athena SQL

Complex Analysis: Multi-table joins, window functions, advanced aggregations
Data Exploration: Ad-hoc queries for hypothesis testing
Integration: API access for custom applications and automated reports
Cost Control: Query optimization and result caching

For Business Users: QuickSight Dashboards

Self-Service: Drag-and-drop interface, no SQL knowledge required
Interactive Exploration: Click-to-filter, drill-down capabilities
Mobile Access: Native iOS/Android apps for executives on-the-go
Collaboration: Share insights, add comments, schedule reports

The Power of Integration

QuickSight connects directly to our Athena/Glue data catalog, meaning:

Single Source of Truth: Both technical and business users see the same data
Real-time Updates: Dashboard refresh automatically as new data arrives
Consistent Metrics: No discrepancies between SQL queries and visual reports
Governance: Centralized security and access control

Real-World Business Problems We Solve

1. E-commerce Optimization

Problem: Online retailers need to understand customer journey patterns to optimize conversion rates.

Solution:

Technical Analysis: SQL queries for detailed funnel analysis and cohort studies
Business Dashboards: QuickSight visualizations showing conversion rates by geography
Executive Views: High-level KPIs with drill-down capabilities for marketing teams
Real-time Monitoring: Live dashboards with automatic alerts for conversion drops

Business Impact:

15-25% improvement in conversion rates through funnel optimization
Geographic targeting increases ad spend efficiency by 30%
Real-time alerts for unusual patterns (potential issues or opportunities)

2. Content Platform Analytics

Problem: Media companies need to understand content engagement patterns across different demographics and regions.

Solution:

Data Processing: Handles millions of content interaction events with sub-minute latency
Business Intelligence: QuickSight dashboards showing content performance by demographics
Editorial Tools: Interactive visualizations for content teams to identify trending topics
Executive Reporting: Automated weekly/monthly reports with engagement insights

Business Impact:

Content recommendation accuracy improved by 40%
User engagement time increased by 25%
Reduced content production costs through data-driven decisions

3. SaaS Product Intelligence

Problem: Software companies need detailed usage analytics to drive product development and reduce churn.

Solution:

Product Analytics: Detailed feature usage tracking with SQL-based analysis
Customer Success Dashboards: QuickSight views showing user health scores and churn risk
Executive Metrics: High-level subscription and retention KPIs with geographic breakdowns
Team Collaboration: Shared dashboards enabling data-driven product decisions

Business Impact:

Reduced customer churn by 20% through predictive analytics
Feature development prioritization based on actual usage data
Improved onboarding flow increased trial-to-paid conversion by 35%

Architecture Trade-offs and Design Decisions

✅ What We Gained

Cost Efficiency

95% cost reduction: From $30+/month to ~$1/month
No idle resources: Pay only for actual usage
Automatic scaling: Handle traffic spikes without over-provisioning

Operational Excellence

Zero maintenance: No servers to patch or monitor
Built-in reliability: Multi-AZ deployment by default
Automatic backups: S3 versioning and cross-region replication

Developer Productivity

Infrastructure as Code: Reproducible deployments across environments
Rapid iteration: Deploy changes in minutes, not hours
Focus on business logic: Less time on infrastructure, more on features

Business User Empowerment

Self-Service Analytics: Business users create their own reports without IT involvement
Interactive Exploration: Drill-down capabilities and dynamic filtering
Mobile Access: Executive dashboards available on any device

⚠️ Trade-offs We Made

Cold Start Latency

Impact: 1-2 second delay on first Lambda execution
Mitigation: EventBridge keeps functions warm through regular scheduling
Business Context: Acceptable for batch processing, not suitable for real-time user-facing APIs

Vendor Lock-in

Reality: Deep integration with AWS services
Mitigation: Standard interfaces (SQL, REST APIs) for data access
Strategy: Benefits outweigh portability concerns for most use cases

Debugging Complexity

Challenge: Distributed system troubleshooting
Solution: Comprehensive logging and monitoring with CloudWatch
Best Practice: Structured logging and correlation IDs across services

Performance and Scale Characteristics

Current Capacity

Data Generation: 5,760 events/day (configurable up to millions)
Processing Latency: Sub-5 minute end-to-end pipeline
Query Performance: Sub-second response on 100GB+ datasets
Concurrent Users: Unlimited (Athena auto-scales)

Scaling Patterns

Horizontal: Add more Lambda concurrent executions
Vertical: Increase Glue job worker count for larger datasets
Temporal: Adjust generation frequency based on business needs
Geographic: Multi-region deployment for global compliance

Implementation Best Practices

Security First

# S3 bucket with security by default
resource "aws_s3_bucket_public_access_block" "clickstream" {
  block_public_acls       = true
  block_public_policy     = true
  ignore_public_acls      = true
  restrict_public_buckets = true
}

Cost Optimization

# Lifecycle management for cost control
resource "aws_s3_bucket_lifecycle_configuration" "clickstream" {
  rule {
    id     = "archive_old_data"
    status = "Enabled"

    transition {
      days          = 30
      storage_class = "STANDARD_IA"
    }

    transition {
      days          = 90
      storage_class = "GLACIER"
    }
  }
}

Monitoring and Observability

# Structured logging for better observability
logger.info("Event processed", extra={
    "event_type": event_data["event_type"],
    "user_location": event_data["location"],
    "processing_time_ms": processing_time,
    "correlation_id": correlation_id
})

Getting Started: From Zero to Insights in 10 Minutes

Prerequisites

AWS Account with appropriate permissions
Terraform >= 1.0 installed
AWS CLI configured

Deployment

# Clone and deploy
git clone 
cd business_intelligence_app
terraform init

# Deploy with QuickSight dashboards (optional)
terraform apply -var="quicksight_user=your-quicksight-username"

# Or deploy without QuickSight (Athena only)
terraform apply

Immediate Value

Data generation starts automatically
First insights available within 15 minutes
QuickSight dashboards ready in 30 minutes
Full analytics capability in under an hour

Future Enhancements and Roadmap

Short Term (Next 3 Months)

Real-time Streaming: Kinesis integration for sub-second analytics
Machine Learning: QuickSight ML insights and forecasting
Advanced Visualizations: Custom QuickSight themes and branding

Medium Term (6-12 Months)

Multi-tenant Architecture: Separate QuickSight namespaces for business units
Embedded Analytics: White-label dashboards for customer portals
Advanced Permissions: Row-level security and data governance

Long Term (12+ Months)

Cross-Cloud Support: Azure and GCP deployment options
Edge Computing: IoT and mobile data collection
AI-Powered Insights: QuickSight Q for natural language queries

Conclusion: The Future is Serverless

This project demonstrates that modern business intelligence doesn't require complex infrastructure or massive operational overhead. By embracing serverless architecture, we've created a solution that:

Scales automatically from startup to enterprise
Costs 95% less than traditional approaches
Requires zero maintenance while providing enterprise-grade reliability
Delivers insights faster through simplified data pipelines

The serverless paradigm isn't just about cost savings—it's about focusing on business value rather than infrastructure complexity. When your data pipeline manages itself, your team can focus on what matters: turning data into actionable business insights.

Whether you're a startup looking to implement your first analytics pipeline or an enterprise seeking to modernize legacy systems, this serverless approach provides a proven path to scalable, cost-effective business intelligence.

LinkedIn: https://www.linkedin.com/in/ramon-villarin/

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/business_intelligence_app

AI-Powered Meeting Management Chatbot with Amazon Lex V2

Mon Villarin — Thu, 31 Jul 2025 07:54:18 GMT

In today's fast-paced business environment, managing meetings efficiently has become more critical than ever. Traditional scheduling systems often require multiple steps, complex interfaces, and significant manual intervention. What if we could simplify this process using conversational AI? This is exactly what we set out to achieve with Meety, a comprehensive meeting management application that combines the power of Amazon Lex V2 with a modern serverless architecture.

Meety represents a new approach to meeting management, where users can schedule meetings through natural language conversations while administrators maintain full control through a dedicated web interface. Built entirely on AWS serverless technologies, the application demonstrates how modern cloud services can create intuitive, scalable, and cost-effective solutions for everyday business challenges.

The Vision Behind Meety

The inspiration for Meety came from observing the friction in traditional meeting scheduling processes. Users typically need to navigate through multiple calendar interfaces, send numerous emails, and coordinate across different platforms. We envisioned a system where scheduling a meeting could be as simple as having a conversation with an intelligent assistant.

Our goal was to create an application that would serve two distinct user groups: end users who want to schedule meetings effortlessly through natural language, and administrators who need comprehensive tools to manage and approve these meeting requests. This dual-purpose approach required careful architectural planning to ensure both user experiences remained optimal while sharing the same underlying data and infrastructure.

Architectural Philosophy: Serverless First

From the project's inception, we committed to a serverless-first architecture. This decision was driven by several key factors: cost efficiency, automatic scaling, reduced operational overhead, and the ability to focus on business logic rather than infrastructure management. Every component in Meety leverages managed AWS services, eliminating the need for server provisioning, patching, or capacity planning.

The serverless approach also aligned perfectly with the application's usage patterns. Meeting scheduling typically involves sporadic bursts of activity rather than consistent load, making serverless computing an ideal fit. Users might schedule multiple meetings in the morning and then not interact with the system for hours, a pattern that serverless architectures handle exceptionally well.

Core Technologies and Services

Amazon Lex V2: The Conversational Brain

At the heart of Meety lies Amazon Lex V2, AWS's advanced conversational AI service. Unlike traditional form-based interfaces, Lex V2 enables users to schedule meetings through natural language conversations. The service handles intent recognition, slot filling, and conversation flow management, creating an intuitive user experience that feels remarkably human-like.

We configured Lex V2 with three primary intents: StartMeety for initial greetings, MeetingAssistant for the core scheduling functionality, and FallbackIntent for handling unexpected inputs. The MeetingAssistant intent includes six carefully designed slots that collect essential meeting information: attendee name, meeting date, time, duration, email address, and final confirmation. This slot-based approach ensures all necessary information is gathered while maintaining conversational flow.

Amazon Cognito: Dual Authentication Strategy

Authentication in Meety employs a sophisticated dual-mode approach using Amazon Cognito. The system supports both anonymous access for chatbot interactions and authenticated access for administrative functions. This design decision ensures that anyone can schedule meetings without barriers while maintaining security for sensitive administrative operations.

The Cognito Identity Pool provides temporary AWS credentials for both authenticated and unauthenticated users, with carefully crafted IAM policies that grant appropriate permissions for each user type. Anonymous users can interact with Lex V2 directly, while authenticated administrators gain access to additional API endpoints for meeting management.

AWS Lambda: Serverless Computing Power

Four distinct Lambda functions power Meety's backend operations. The generative-lex-fulfillment function serves as the primary fulfillment handler for Lex V2, processing meeting scheduling requests and storing data in DynamoDB. Three additional functions handle administrative operations: get-meetings retrieves approved meetings within specified date ranges, get-pending-meetings fetches meetings awaiting approval, and change-meeting-status enables administrators to approve or reject meeting requests.

Each Lambda function is optimized for its specific purpose, with tailored IAM roles that follow the principle of least privilege. The functions are written in Python 3.12, leveraging the boto3 SDK for AWS service interactions and implementing comprehensive error handling and logging.

Amazon DynamoDB: Flexible Data Storage

Meeting data is stored in Amazon DynamoDB, chosen for its serverless nature, automatic scaling capabilities, and flexible schema design. The database uses a single table design with a Global Secondary Index on the status field, enabling efficient queries for both individual meetings and status-based filtering.

The DynamoDB table stores comprehensive meeting information including unique meeting IDs, attendee details, scheduling information, current status, and creation timestamps. This design supports both the conversational interface's need for quick data insertion and the administrative interface's requirements for complex queries and status updates.

Amazon S3 and CloudFront: Global Content Delivery

The frontend application is hosted on Amazon S3 with CloudFront distribution, providing global content delivery with minimal latency. This combination offers several advantages: automatic scaling to handle traffic spikes, built-in security features, and cost-effective hosting for static content.

CloudFront's integration with AWS Certificate Manager enables HTTPS encryption across all communications, while Origin Access Control ensures that S3 content is only accessible through the CloudFront distribution, enhancing security and performance.

Direct Lex Integration: A Technical Innovation

One of Meety's most significant technical innovations is the direct integration between the frontend and Amazon Lex V2. Rather than routing chatbot interactions through API Gateway and Lambda functions, the frontend communicates directly with Lex using the AWS SDK. This approach offers several compelling advantages.

First, it eliminates unnecessary network hops, reducing latency and improving user experience. Second, it simplifies the architecture by removing intermediate components that would otherwise require maintenance and monitoring. Third, it reduces costs by eliminating API Gateway charges for chatbot interactions, which can be substantial in high-volume scenarios.

The direct integration required careful consideration of authentication and security. We implemented this using Cognito Identity Pool's unauthenticated role, which provides temporary AWS credentials with permissions limited to Lex interactions. This approach maintains security while enabling seamless user experiences.

Infrastructure as Code with Terraform

Meety's entire infrastructure is defined using Terraform, embodying infrastructure as code principles. This approach provides several critical benefits: version control for infrastructure changes, reproducible deployments across environments, and the ability to tear down and recreate the entire stack when needed.

The Terraform configuration is organized into logical modules covering different aspects of the system: API Gateway and Lambda functions, Cognito authentication, DynamoDB storage, Lex bot configuration, and frontend hosting. This modular approach makes the infrastructure maintainable and allows for independent updates to different system components.

Environment-specific configurations are managed through Terraform variables, with sensitive values externalized to terraform.tfvars files that are excluded from version control. This pattern enables secure deployment across multiple environments while maintaining configuration flexibility.

Automated Deployment Pipeline

Recognizing that complex multi-service applications can be challenging to deploy, we created a comprehensive automated deployment pipeline. The master deployment script orchestrates the entire process: building Lambda deployment packages, applying Terraform configurations, configuring Lex intents and slots, creating bot aliases, updating frontend configurations with actual resource IDs, and deploying static assets to S3.

This automation eliminates deployment complexity and reduces the potential for human error. A complete deployment, from empty AWS account to fully functional application, takes approximately 5-10 minutes and requires only a single command execution.

Security Considerations and Best Practices

Security was a primary consideration throughout Meety's development. The application implements multiple layers of security controls: IAM roles with minimal required permissions, JWT-based authentication for administrative functions, HTTPS encryption for all communications, and secure credential management through Cognito Identity Pools.

All Lambda functions include comprehensive input validation and error handling to prevent injection attacks and ensure graceful failure modes. DynamoDB access is restricted through IAM policies that limit operations to specific tables and indexes. The frontend implements Content Security Policy headers and other security best practices to protect against common web vulnerabilities.

Performance Optimization and Scalability

Meety's serverless architecture provides inherent scalability advantages, but we implemented additional optimizations to ensure optimal performance. Lambda functions are configured with appropriate memory allocations based on their computational requirements. DynamoDB uses on-demand billing mode, automatically scaling read and write capacity based on actual usage patterns.

The frontend leverages CloudFront's global edge network for content delivery, with appropriate caching headers to minimize origin requests. Static assets are optimized for size and compressed using modern compression algorithms. The direct Lex integration eliminates unnecessary API calls, reducing both latency and costs.

Lessons Learned and Future Enhancements

Building Meety provided valuable insights into serverless application development and conversational AI implementation. We learned the importance of careful slot design in Lex conversations, the benefits of direct service integration where appropriate, and the value of comprehensive automation in complex deployments.

Future enhancements could include integration with external calendar systems, email notifications for meeting confirmations, support for recurring meetings, and advanced analytics for meeting patterns. The serverless architecture provides a solid foundation for these additions without requiring fundamental changes to the existing system.

Conclusion

Meety demonstrates the power of combining conversational AI with modern serverless architectures to solve real-world business problems. By leveraging AWS's managed services and implementing thoughtful architectural patterns, we created a system that is both user-friendly and technically robust.

The project showcases how serverless technologies can reduce operational complexity while providing enterprise-grade scalability and security. The direct Lex integration pattern, comprehensive automation, and infrastructure as code approach provide a blueprint for similar applications.

Most importantly, Meety proves that sophisticated AI-powered applications are within reach of development teams willing to embrace cloud-native architectures and modern development practices. The combination of natural language processing, serverless computing, and thoughtful user experience design creates possibilities for reimagining how we interact with business applications.

As organizations continue to seek more intuitive and efficient ways to manage their operations, applications like Meety point toward a future where conversational interfaces become the norm rather than the exception. The serverless foundation ensures these applications can scale to meet growing demands while maintaining cost efficiency and operational simplicity.

LinkedIn: https://www.linkedin.com/in/ramon-villarin/

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/chatbot-with-amazon-lex

Serverless Recipe Sharing App with AWS Cognito and Terraform

Mon Villarin — Wed, 09 Jul 2025 16:02:23 GMT

Hello! Welcome to my new blog post. As I continue to grow in my journey toward becoming a cloud engineer, I’m excited to share the projects I’ve been building and the lessons I’m learning along the way. Transitioning into cloud engineering has been both challenging and rewarding — and one of the best ways I’ve found to truly understand the cloud is by building real-world applications using AWS.

In this post, I’ll walk you through a recent project I built from the ground up: a serverless recipe sharing app powered by a suite of AWS services. This project not only helped me strengthen my skills in designing cloud-native architectures, but also gave me hands-on experience with essential tools like Amazon Cognito, API Gateway, Lambda, DynamoDB, and more.

Whether you’re a fellow learner, a cloud enthusiast, or someone curious about serverless development, I hope this inspires you to build, break, and grow with every line of infrastructure you write.

The Concept

The architecture of this application follows a typical AWS serverless design, leveraging fully managed services to ensure scalability, performance, and minimal operational overhead. The infrastructure includes key AWS resources such as Amazon Route 53, CloudFront, Amazon S3, along with services like Amazon API Gateway, AWS Lambda, Amazon Cognito, and Amazon DynamoDB.

Here’s how the system works from end to end:

User requests are first resolved by Route 53, which handles DNS routing. These requests are directed to Amazon CloudFront, a global content delivery network that serves the frontend assets stored in Amazon S3. This setup ensures that users experience low-latency access regardless of their location.
Once the frontend is loaded, users can browse and search for recipes without authentication. When a user performs a search, the frontend sends a request to a public API Gateway endpoint, which forwards the request to an AWS Lambda function. The function then queries DynamoDB for matching recipe data and returns the results to the user interface.
To share a recipe, users must first authenticate via Amazon Cognito, which handles user sign-up, sign-in, and secure token issuance. Once authenticated, users gain access to the admin interface where they can submit their recipes.
When a recipe is submitted, the data is sent through an authenticated API Gateway endpoint, which invokes another Lambda function responsible for validating and writing the data to DynamoDB.

This serverless design is not only efficient and cost-effective, but also secure and highly scalable. By offloading infrastructure management to AWS, the application can automatically scale with demand, maintain low latency, and ensure user data is protected through built-in security services.

How I Developed the Frontend

I typically build my projects starting from the frontend and work my way to the backend, following the architecture outlined in the diagram. As mentioned earlier, the core functionality of this project involves users making read and write requests to and from the database. Since a static HTML site alone cannot directly interact with a backend service like DynamoDB, there's no alternative but to use JavaScript to bridge the gap between the frontend and the database.

User Interface

To achieve this, I chose to use React.js — a popular JavaScript library for building user interfaces, particularly well-suited for Single Page Applications (SPAs). React allows for dynamic data handling, seamless routing, and efficient UI updates, all within a single HTML page. Additionally, since I wanted to avoid managing multiple static HTML files for different views or pages, React provided the flexibility and scalability I needed to build a modern, maintainable frontend.

The hard part is over! atleast for me, because i’m not a coder myself.

S3

After thoroughly scrutinizing the design and functionality of the user interface, the next step is to create an Amazon S3 bucket to host the frontend. Creating an S3 bucket is a straightforward process: simply provide a unique name for the bucket and click “Create Bucket.”

Once the bucket is created, it's important to enable "Static Website Hosting" in the bucket properties. This setting allows the bucket to serve your static frontend assets (HTML, CSS, JavaScript) over HTTP, making your application accessible via the web.

Cloudfront

Next on the list is Amazon CloudFront. CloudFront is configured to route traffic to the S3 bucket created earlier, serving as a content delivery network (CDN) to distribute static assets with low latency and high availability.

To set this up, I created a CloudFront distribution and specified the S3 bucket as the origin domain. Additionally, I configured Origin Access Control (OAC) to securely restrict access to the S3 bucket, ensuring that content can only be served through CloudFront.

For enhanced security and a seamless user experience, I associated the distribution with a custom SSL certificate issued by AWS Certificate Manager (ACM). This enables HTTPS support for a custom domain, ensuring encrypted communication between users and the CDN.

Route 53

Since I already have a custom domain hosted in Amazon Route 53 (monvillarin.com), I created a subdomain under the same hosted zone: recipe.monvillarin.com To route traffic from this subdomain to my CloudFront distribution, I added an A record (alias) in Route 53.

This A record points directly to the CloudFront distribution, enabling users to access the application through a clean, custom URL while benefiting from CloudFront’s performance and security features.

Amazon Cognito

Amazon Cognito is a fully managed authentication and authorization service that allows users to sign up and sign in using a username and password, or via federated identity providers such as Google, Facebook, or enterprise SAML providers.

From a technical perspective, when a user successfully signs in, Cognito authenticates the credentials against the User Pool and returns a set of JSON Web Tokens (JWTs) — including an ID token, access token, and refresh token. These tokens serve as proof of authentication.

The frontend application then includes the access token in the authorization header when making requests to Amazon API Gateway. API Gateway uses a Cognito Authorizer to validate the token. If the token is valid and not expired, the request is forwarded to the AWS Lambda function, which then processes the logic and writes the data to Amazon DynamoDB.

And Here Comes the Backend

API Gateway and Lambda Functions

The Recipe Sharing App is configured with a single API Gateway that defines six distinct routes, each serving a specific purpose within the application. Every route is associated with its own HTTP method and is integrated with a corresponding AWS Lambda function to handle the request logic.

For example, the route with the path /create-recipes and the HTTP method POST is connected to a Lambda function responsible for writing recipe data to Amazon DynamoDB. Similarly, other routes are designed to handle tasks such as retrieving recipes, updating entries, deleting records, and more — each mapped to its respective Lambda function for modular and maintainable backend logic.

DynamoDB Table

The final component in the architecture diagram is Amazon DynamoDB, which is responsible for handling and storing application data, such as user-created recipes. DynamoDB is a fully managed NoSQL database known for its scalability, high availability, and low-latency data access. These characteristics make it particularly well-suited for serverless architectures, where fast, reliable, and elastic data storage is essential.

Final Thoughts

And that’s how I built my Recipe Sharing App — starting from designing the architecture diagram, writing Infrastructure as Code (IaC) configurations (yes, you heard it right — I’m not a ClickOps person), and finally provisioning the entire infrastructure.

Writing IaC with tools like Terraform is never as easy as it seems, even for small projects like this one. We may be living in the age of AI-assisted coding, but I firmly believe that every good developer must understand the logic behind their code — not just what it does, but why it works.

I honestly lost count of how many times I ran terraform apply and terraform destroy. Every failed deployment felt like a nudge to dig deeper — tweak the code, test again, and repeat — until I achieved the outcome I was aiming for. This cycle of trial and improvement is exactly where perseverance comes in — a quality that every aspiring cloud engineer should embrace.

LinkedIn: https://www.linkedin.com/in/ramon-villarin/

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/recipe_sharing_app/tree/v2

From Resume to the Cloud: How I Built and Deployed My Cloud Resume Challenge

Mon Villarin — Sun, 29 Jun 2025 13:25:30 GMT

Hey there!

I’m Mon Villarin, a Full-Stack Administrator with around twelve years of full-stack system and application administration experience under my belt. I have limited hands-on experience with coding or scripting, but I’m comfortable reading and understanding code in languages like JavaScript and Python. I can follow what a script is doing and grasp its overall logic and purpose, even if I’m not yet writing complex code myself.

Lately, I’ve been feeling the urge to push my skills further and dive deeper into the world of cloud computing—and that’s when I discovered the Cloud Resume Challenge.

I know I’ve missed the original deadline for the Cloud Resume Challenge—that had been set on July 31, 2020, as the cutoff for code reviews—but I still wanted to take on the challenge. Even though I’ve been focused on preparing for my AWS certifications, I saw this as a valuable opportunity to apply what I’ve learned and push myself further.

Originally created by Forrest Brazeal, this challenge offers a fun and practical way to explore cloud technologies. But it’s more than just putting your resume online. It’s about designing and deploying a fully cloud-native resume site using AWS. The experience is hands-on, challenging, and surprisingly rewarding. Think of it as a tech-packed upgrade to your resume—and your cloud skills.

So, What’s It All About?

For anyone unfamiliar, the Cloud Resume Challenge is all about creating and hosting your resume using serverless architecture on a cloud platform.

The challenge is thoughtfully structured—and if you haven’t already, I highly suggest checking out the official challenge guidebook. It breaks the entire project down into manageable parts, or “chunks,” making the journey both organized and achievable.

Chunk 0. Certification Prep
Chunk 1. Building the Front-end
Chunk 2. Building the API
Chunk 3. Front-end / Back-end Integration
Chunk 4. Automation (IaC, CI/CD).

It’s a hands-on mix of coding, cloud services, and just enough of a challenge to keep it interesting.

How I Brought It All Together

Chunk 0: Certification Prep

As someone completely new to the AWS ecosystem, I began my journey by earning the AWS Certified Cloud Practitioner certification in January 2025. Building on that foundation, I went on to achieve the AWS Certified Solutions Architect – Associate in April 2025, along with the HashiCorp Certified: Terraform Associate certification in May 2025. With these credentials under my belt, I felt well-prepared to take on the Cloud Resume Challenge.

Chunk 1: Building the Front-end

Frontend - HTML / CSS

I started with a responsive template from html5up, which I customized by removing unnecessary pages and links to keep the design as clean and minimal as possible. My early experience with HTML and CSS, along with a bit of help from Google, made it easier to tweak the layout to fit my needs. I also embedded a JavaScript snippet into the HTML page to fetch and update the visitor counter (more of this in Chunk 2) from the back-end service.

Hosting on AWS S3

Thanks to the knowledge I gained while preparing for my AWS certifications, setting up a static website on Amazon S3 and integrating it with CloudFront for content delivery was straightforward. I registered a custom domain through AWS Route 53 and configured it to point to my CloudFront distribution. To secure the site with HTTPS, I used AWS Certificate Manager (ACM) to provision an SSL certificate.

Chunk 2: Building the API

Backend Infrastructure

To handle the logic for updating and retrieving the visitor count, I needed to set up a simple back-end using AWS services. This included API Gateway, AWS Lambda, and DynamoDB.

DynamoDB: Amazon DynamoDB is AWS’s fully managed, high-performance NoSQL database service that scales seamlessly. I created a DynamoDB table with a single item to store the visitor count. To increment this value, I used DynamoDB’s Atomic Counter feature—a numeric attribute that can be updated concurrently without conflict.
AWS Lambda: Allows you to run code without managing servers. I wrote a Python-based Lambda function that interacts with DynamoDB to both retrieve and update the visitor count using the update_item operation.

Since I have limited experience writing Python scripts, I referenced open-source implementations from Github to guide my function’s structure and logic.
API Gateway: Enables you to create and manage RESTful APIs that act as a bridge between frontend applications and backend services such as Lambda.

In this setup, API Gateway exposes a REST API endpoint that the JavaScript snippet embedded in the front-end HTML calls each time the page is loaded. This request triggers the Lambda function, which in turn updates and returns the visitor count from DynamoDB.

To ensure this works from the browser, I had to enable CORS (Cross-Origin Resource Sharing) on the API Gateway resource. Without it, the client-side script wouldn’t be able to fetch data from the API endpoint.

Chunk 3. Front-end / back-end integration

Initially, I built the back-end components—DynamoDB, Lambda, and API Gateway—individually through the AWS Management Console, manually configuring them to work together and serve the visitor count to the front-end HTML page.

However, one of the Cloud Resume Challenge requirements was to define these resources using Infrastructure as Code (IaC) with AWS SAM (Serverless Application Model). Since I had no prior experience with SAM and personally prefer Terraform over other IaC tools, I did not pursue that route at the time.

As the next step in improving the project, my goal is to transition everything to Terraform—including the front-end resources such as the S3 bucket, CloudFront distribution, and Route 53 DNS records—for a fully automated and reproducible deployment.

Below is the architecture diagram that illustrates all AWS resources I provisioned using Terraform as part of the Cloud Resume Challenge:

You can explore the full source code and configuration for my Cloud Resume Challenge project on GitHub: Cloud Resume Challenege Repo

It includes everything—from the front-end website code to the back-end infrastructure scripts managed through Terraform. Feel free to check it out, fork it, or use it as inspiration for your own challenge!

Chunk 4. Automation (IaC, CI/CD)

Front-end - CI/CD with GitHub Actions

One of the key requirements of the Cloud Resume Challenge was to store both the front-end and back-end code in GitHub repositories, and to implement Continuous Integration and Deployment (CI/CD) using GitHub Actions.

Since I had never used GitHub Actions before, this was a valuable learning experience. GitHub Actions made it easier to automate the build and deployment processes, reducing the need for manual updates.

For the front-end pipeline, I configured GitHub Actions to:

Authenticate with AWS using GitHub Secrets (to securely store AWS access keys)
Deploy updated HTML, CSS, JavaScript, and image files to the S3 bucket
Invalidate the CloudFront distribution to ensure the latest content is served to users

This setup helped me gain hands-on experience with CI/CD workflows, while ensuring smooth, automated deployment of front-end changes directly from GitHub.

Back-end: IaC for Resource Deployment

I chose to keep both the front-end and back-end code in a single GitHub repository. This repository contains everything needed for the project, including the Terraform configuration files, the Lambda function code, and the JavaScript used in the front end.

To support deployment from GitHub Actions, I also created the necessary Terraform configurations to manage AWS credentials securely and automate the provisioning of infrastructure components. This unified setup helped streamline development, version control, and CI/CD workflows within a single codebase.

What I Learned

Taking on this challenge was a rewarding mix of fun and frustration. Setting up the CI/CD pipeline was particularly tricky, and I faced a few hurdles while working with the Lambda function. But every obstacle turned into a learning opportunity.

Each time I hit a roadblock, I dug deeper, experimented, and picked up something new. By the end of the project, I walked away with a much stronger understanding of AWS services, serverless architecture, and infrastructure as code—and, more importantly, a real boost in confidence with cloud development.

In the end, I successfully built a fully functional, cloud-powered resume website. It’s not overly flashy—but it’s entirely my own, and it represents what I’ve learned and what I’m capable of building with cloud technologies.

LinkedIn: https://www.linkedin.com/in/ramon-villarin/

Portfolio Site: MonVillarin.com

Github Project Repo: https://github.com/kurokood/cloud_resume_challenge