Principal Site Reliability Engineer


Clearwater positions open to candidates located in greater Tampa Bay area.

We are seeking a Principal SRE to join our team as we aim to develop new frameworks that simplify the lives of our team members and our users. This role requires high technical competence in key areas, strong leadership abilities, and excellent communication skills.

You will work with the Sr Director of Engineering to understand and contribute towards the strategic vision. You will lead our SRE team towards building Well Architected and DRY infrastructure solutions using tools such as Terraform & Terragrunt and working primarily with Serverless(Lambda) and Kubernetes (EKS) solutions.

This position will be instrumental in helping us to deliver version 2.0 of the existing infrastructure platform.


  • Ownership and Support of SRE Platform Resources
  • Actively work with business partners to keep our team unblocked and moving forward.
  • Work with engineering leadership to research, plan, and develop Platform and Tooling solutions.
  • Develop quality solutions that set an example for SRE team members.
  • Solidify well-architected designs as IaC modules that can be made accessible through a service catalog.
  • Manage and create AWS Control Tower customizations, such as Terraform Account Factory and IPAM VPC Customizations
  • Work with business partners to establish SLOs and Error Budgets and with the SRE team to determine SLIs.
  • Create CI/CD Workflows and Functions that improve visibility and improve developer happiness.
  • Provide technical expertise to help resolve complex issues.
  • Participate in an on-call rotation and ticketing systems to support our production services
  • Maintain a high level of professionalism and a positive attitude
  • Work with the latest technologies and expand your own skill sets
  • Work with SRE team members to provide mentorship and delegate tasks that help them grow their own skill sets as well as their understanding of best practices.
  • Align roadmap deliverables with print planning and grooming sessions and communicate risk early and often
  • Other applicable job duties as assigned
  • Back up other team members
  • Adhere to company policies 
  • KnowBe4 reserves the right to change the duties and responsibilities at any time 

Minimum Qualifications:

  • AWS Certified Solutions Architect – Professional
  • Associate’s Degree in Computer Science or other experience in a DevOps or SRE role is required
  • Strong understanding of Networking technologies and concepts
  • Expert using Linux, Docker, and Git
  • Strong understanding of Kubernetes and AWS EKS Best Practices
  • Systems: manage, configure and troubleshoot operating system issues, storage (block and object), networking (VPCs, proxies, and CDNs), and administer high-availability clusters running various cloud technologies.
  • Engineering practices: availability, reliability, and scalability, as well as disaster recovery
  • Monitoring and instrumentation: implement metrics in Datadog, Cloudwatch, and Slack/PagerDuty integrations
  • Excellent understanding of continuous delivery and continuous deployment workflows using GitOps methodologies
  • Work in a variety of languages: Shell, Ruby, GoLang, Python
  • Planning: familiarity with agile methodologies; using Jira epics, roadmaps, stories, tasks, and reports
  • Comfortable aligning sprint workloads with Objectives and Key Results
  • High technical aptitude for absorbing new technologies and concepts
  • Working with a mix of local and remote team members to collaborate on solutions.
  • Creativity, initiative, and attention to detail

The base pay for this position ranges from $160,000 – $180,000, which will vary depending on how well an applicant’s skills and experience align with the job description listed above.

To apply, please visit the following URL:→