Pereira, Colombia · Remote

Hector F. Jimenez S.

Senior SRE · 12 yrs in Production · Multi-Cloud AWS · Azure · GCP · Infrastructure at Scale
4°48'N 75°42'W
// |
Scroll
01

About Me

I've spent 12 years making sure production systems don't fall over — migrating datacenters to cloud, building CI/CD pipelines for Disney-scale traffic, and keeping global infrastructure reliable at GoDaddy. I started as a computer lab monitor at university and spent a decade climbing from server administration to SRE, which means I understand infrastructure from the metal up.

I work across AWS, Azure, GCP, Cloudflare, OpenStack, and bare-metal — wherever systems live, I make them reliable, observable, and easier to ship against. Outside the day job, I co-founded the CNCF Pereira chapter and spent nearly a decade co-organizing PereiraJS and Pereira Tech Talks, because I believe strong engineering communities build better engineers.

CURRENT ROLESr Site Reliability Engineer
COMPANYGoDaddy
EXP12 years
FOCUSAWS · K8s · IaC
BASEPereira, CO 🇨🇴
GITHUBh3ct0rjs
LINKEDINh3ct0rjs
STATUS Open to opportunities
02

Experience

07 / 2023 — Now

Sr Site Reliability Engineer L3

GoDaddy — Studio 503 Contractor

  • Manage production AWS and OpenStack accounts across a global multi-region presence supporting thousands of internal users.
  • Own GitHub Enterprise Cloud & On-Prem platform for the entire engineering org — version control, branch protection, security policies, and Actions runners at scale.
  • Built and maintain advanced GitHub Actions CI/CD pipelines reducing manual deployment steps across multiple product teams.
  • Delivered custom Python/TypeScript automation tools integrating AWS services with third-party APIs, improving operational efficiency.
  • Drive AWS Well-Architected Framework reviews to improve reliability, cost, and security posture across production workloads.
  • Provide Artifactory infrastructure reliability guidance across the artifact management lifecycle.
AWSOpenStackPythonTypeScriptGitHub ActionsArtifactory
05 / 2021 — 07 / 2023

CloudOps & DevOps Engineer

Globant Colombia

  • Maintained multi-cloud AWS + Azure infrastructure for Disney Media Entertainment products — Digitalscreeners, Marvel, and TheForce — serving millions of users globally.
  • Delivered Infrastructure as Code resources using Terraform and CloudFormation across 5+ independent product squads.
  • Designed and operated GitLab CI/CD Enterprise pipelines enabling continuous delivery for multiple tech stacks simultaneously.
  • Orchestrated containerized workloads across Kubernetes, EKS, ECS, and ECR — improving deployment reliability and rollback speed.
  • Built iOS/tvOS CI/CD mobile pipelines with Fastlane, accelerating mobile release cycles.
AWSAzureKubernetesEKSGitLab CI/CDTerraform
12 / 2018 — 05 / 2021

Sr IT Infrastructure Engineer

Soluciones Globales SAS

  • Sole infrastructure owner for a fleet of 80–90 servers across multiple OS environments hosted at a regional BTLatam datacenter.
  • Led full PostgreSQL on-premise → AWS RDS migration with zero data loss, reducing maintenance overhead significantly.
  • Designed and managed site-to-site and client-to-site VPN infrastructure using StrongSwan, Fortinet, Cisco, and OpenVPN — supporting core business services across multiple offices.
  • Implemented Linux Security Hardening practices and AWS Well-Architected Framework recommendations across all production workloads.
  • Administered SaaS ecosystem for the entire company: Google Workspace, Office 365, Microsoft Teams, and AWS accounts.
AWSLinuxAnsibleOpenVPNFortinetPostgreSQL
2019 — Now

Chapter Lead & Community Organizer

CNCF Cloud Native Pereira · PereiraJS · Pereira Tech Talks

  • Founded and lead the CNCF Cloud Native Pereira chapter — one of the first CNCF chapters in Colombia.
  • Organized recurring events, workshops, and talks on Kubernetes, observability, and cloud-native practices for a growing regional community.
  • Co-organized PereiraJS (Jan 2016 – Dec 2025, ~10 yrs) and Pereira Tech Talks (Jan 2017 – Dec 2025, ~9 yrs) — local meetups reaching hundreds of developers across Risaralda.
CNCFKubernetesCloud NativePereiraJSCommunity
10 / 2018 — 12 / 2018

DevOps Engineer

Basik Development LLC · Part-time · Remote

  • Implemented git branching strategies and managed server fleets with Ansible in production.
  • Administered Linux/Unix servers across multiple workloads; maintained IP networking, VPNs, DNS, load balancing, and firewalls.
  • Leveraged APM and log management (New Relic) for application health monitoring and performance tuning.
  • Worked alongside development teams to troubleshoot and fix Node.js/PHP healthcare applications.
AnsibleLinuxBashNew RelicDevOps
02 / 2018 — 12 / 2018

Infrastructure Engineer & System Administrator

XeroGroup · Part-time · Pereira, Colombia

  • Tuned Nginx and Varnish Cache web servers; secured PHP web applications and Apache configurations.
  • Automated daily tasks and jobs using Bash/sh under Debian GNU/Linux.
  • Managed cloud infrastructure on DigitalOcean and OVHCloud; supported users on cPanel and WHM.
NginxLinuxBashDigitalOceanOVH
01 / 2018 — 12 / 2018

Sr DevOps & System Administrator

Innoving Innovations Ingenieria SAS · Part-time · Pereira, Colombia

  • Designed multi-tier service architecture for production workloads.
  • Built an HTTP RESTful API service in Node.js for high traffic and concurrent users.
  • Installed observability tools (CachetHQ, New Relic) and deployed SNMP sensors across 10 Debian GNU/Linux servers.
  • Performed corrective software maintenance with Ansible; managed FTP, DNS, DHCP, and proxy configurations fleet-wide.
  • Created network and infrastructure diagrams; designed backup and DR strategies.
Node.jsAnsibleNew RelicLinuxNetworking
01 / 2016 — 11 / 2016

Unix Support Engineer

Upwork · Freelance · Colombia

  • Developed and maintained Bash scripts for automated database backups.
  • Migrated and installed high-performance web servers using Nginx; implemented application monitoring for nginx environments.
  • Installed and updated software across MEAN, ELK, and LAMP stacks on production environments.
  • Managed firewall and network security policies for multiple small companies.
BashNginxLinuxELKNetworking
03

Skills & Stack

Cloud Platforms

AWSAzureGCP CloudflareOpenStackDigitalOceanHetzner

Infrastructure as Code

TerraformOpenTofuCloudFormation AWS CDKAnsiblePulumi

Containers & Orchestration

KubernetesEKSDocker HelmArgoCDECS / ECRPodman

CI/CD & GitOps

GitHub ActionsGitLab CI/CDFluxCD CircleCIApache AirflowArtifactoryFastlane

Observability & SRE

PrometheusGrafanaDatadog OpenTelemetryLokiCloudWatch PagerDutySite24x7BigPandaSLO / SLI

Networking & Security

Cloudflare WAFCloudflare TunnelsEnvoy StrongSwanFortinetOpenVPN Zero TrustmTLSAWS VPC

Languages & Scripting

PythonTypeScriptBash Rust (learning)HCLYAMLLinux / Unix

Data & Messaging

PostgreSQLRedisDynamoDB AWS SQS / SNSS3MySQLSQLiteTurso

Tooling & ITSM

ServiceNowJiraClickUp ConfluenceGitHubSlack
04

Projects

PRJ-001

Disney Media Entertainment MultiCloud

Multi-cloud (AWS + Azure) infrastructure for Digitalscreeners, Marvel, and TheForce teams at Globant. End-to-end IaC, EKS orchestration, and GitLab CI/CD pipelines at Disney scale.

PRJ-002

GitHub Enterprise Platform

Full GitHub Enterprise Cloud & On-Prem management at GoDaddy. Advanced GitHub Actions workflows for CI/CD and automation, integrating AWS and third-party APIs via Python and TypeScript.

PRJ-003

On-Premise → AWS Cloud Migration

Full datacenter migration at Soluciones Globales: 80–90 server fleet, PostgreSQL to AWS RDS, VPN infrastructure with StrongSwan and Fortinet, and multi-service AWS environment setup.

PRJ-004

k8s-workshop

Hands-on Kubernetes workshop with a practical guide for learning Kubernetes fundamentals — used in community talks and training sessions.

PRJ-005

Jenkins HA Multi-Region Cluster

Highly available Jenkins deployment across multiple AWS regions using EKS, Route 53 latency-based routing, and cross-region EFS for shared workspace persistence. Active-active setup with automatic failover and zero-downtime upgrades via Helm.

PRJ-006

Jenkins Global Controller Federation

Federated Jenkins architecture using CloudBees Operations Center to manage regional controllers across US-East, US-West, and EU. Centralized RBAC, shared agent pools via Kubernetes plugin, and unified observability through Prometheus + Grafana.

PRJ-007

Bare Metal Kubernetes with OKD

Self-hosted OpenShift OKD cluster deployed on bare metal. Full provisioning automation covering PXE boot, Ignition configs, and post-install hardening. Includes integrated container registry, internal DNS, and OAuth-based RBAC — zero cloud dependency.

PRJ-008

Bare Metal Kubernetes with Kubespray

Production-grade Kubernetes cluster on bare metal using Kubespray. Fully automated node provisioning with Ansible, Calico CNI for networking, MetalLB for load balancing, and Rook-Ceph for persistent storage — entirely on-premise with no managed control plane.

PRJ-009

HPC Video Processing Cluster on AWS

High-performance video transcoding cluster on AWS with fully automated provisioning and deployment. Leverages EC2 GPU instances, S3 for raw and processed storage, SQS for job queuing, and Auto Scaling to handle burst workloads — end-to-end IaC with Terraform.

05

Certifications

GitHub Actions

GitHub

Valid · Expires Apr 2027

GitHub Foundations

GitHub

Valid · Expires Feb 2027

Organizer — Cloud Native Community Group

The Linux Foundation

Issued Jan 2025

LFC102: Inclusive Open Source Community Orientation

The Linux Foundation

Issued Dec 2024

AWS Partner: Technical Accredited

Amazon Web Services

Issued Jun 2021
In Progress · To Renew

Claude Certified Architect – Foundations

Anthropic

In Progress · 2025

AWS Certified AI Practitioner

Amazon Web Services

In Progress · 2025

AWS Certified Developer – Associate

Amazon Web Services

In Progress · 2025

AWS Certified Solutions Architect – Associate

Amazon Web Services

In Progress · 2025

AWS Certified Cloud Practitioner

Amazon Web Services

To Renew · Expired Dec 2025

HashiCorp Certified: Terraform Associate (002)

HashiCorp

To Renew · Expired Dec 2023
06

Let's Talk

I'm always open to new SRE opportunities, infrastructure challenges, and community collaborations. Let's build something reliable together.