Resolving Site-to-Site VPN Tunnel Drops with AWS Transit Gateway

Client: Confidential (Upwork Client – Hosting Platform on AWS)
Industry: SaaS Hosting / Cloud Infrastructure

The Challenge

A client running a cloud hosting platform on AWS was facing a persistent VPN connectivity issue. Their architecture relied on a Site-to-Site VPN via AWS Transit Gateway, and one of their key customer tunnels was dropping every hour for several minutes.

This recurring downtime posed a serious threat to platform availability, customer integrations, and overall satisfaction. With limited visibility into the root cause and unsuccessful internal troubleshooting, they turned to us for urgent help.

Our Solution

InfraxDev was brought into the project through Upwork to rapidly identify and resolve the issue. Here’s how we approached it:

  • Audited VPN tunnel configurations, including customer gateway and AWS side
  • Verified Dead Peer Detection (DPD) behavior and Phase 1/2 IPSec rekey settings
  • Analyzed CloudWatch metrics and VPN logs for patterns
  • Cross-checked timing alignment for rekey and DPD timers

🔍 Key Finding: A misalignment between the DPD timeout and rekey intervals on the customer’s equipment was causing the periodic drops.

We documented the misconfiguration clearly, guided the client through the update, and confirmed the successful implementation.

Results & Impact

MetricBeforeAfter
VPN Tunnel StabilityDropped every hour✅ 100% stable for 7+ days
Customer SatisfactionAt risk✅ Fully resolved
Resolution TimeOngoing for weeks✅ Fixed in less than 24 hours

What the Client Said

Why It Matters

In production environments, VPN tunnel instability can break integrations, violate SLAs, and damage trust. At InfraxDev, we bring deep expertise in AWS networking to ensure your systems stay resilient, secure, and seamless — even under pressure.

Need Help with AWS VPNs, Networking, or Transit Gateway?

Let’s talk. Our cloud experts are ready to help you resolve complex AWS issues.