EngageRocket is looking for a Senior DevOps Engineer to take ownership of our cloud infrastructure, reliability, and security foundations as we scale. This is a high-impact, hands-on role where you will be responsible for designing, building, and operating a secure, resilient, and scalable platform.
You will work closely with engineering, product, and leadership to ensure our platform is stable, compliant, and built for growth, while remaining deeply involved in day-to-day technical execution.
Design, build, and operate EngageRocket’s AWS-based infrastructure with a focus on scalability, security, and reliability.
Make and document architecture decisions with a long-term view on performance, cost efficiency, and risk.
Drive infrastructure standardization and automation to reduce operational risk and manual work.
Design and maintain robust CI/CD pipelines that support frequent, low-risk production deployments.
Implement and maintain Infrastructure as Code (IaC) using Terraform and/or AWS CloudFormation across all environments.
Define and implement deployment strategies (blue/green, canary, automated rollbacks).
Introduce and improve reliability practices such as SLOs, SLIs, and operational metrics.
Build and maintain monitoring, logging, and alerting systems (e.g. Prometheus, Grafana, ELK).
Respond to production incidents, perform root-cause analysis, and implement long-term fixes.
Improve operational readiness through runbooks, automation, and environment consistency.
Own platform security controls, including IAM, secrets management, encryption, certificate management, and network security.
Work with stakeholders on security reviews, threat modeling, and risk mitigation.
Support SOC 2 / ISO 27001 compliance efforts through implementation of technical controls and documentation.
Collaborate closely with software engineers to ensure systems are deployable, observable, and operable.
Produce and maintain clear documentation for infrastructure architecture, processes, and operational procedures.
Evaluate and adopt tools and practices that improve platform reliability, security, and efficiency.
Extensive experience in DevOps, SRE, or infrastructure engineering roles within cloud-native environments.
Strong, hands-on experience with AWS services and cloud architecture.
Proven experience designing and operating CI/CD pipelines for production systems.
Strong experience with Infrastructure as Code (Terraform and/or CloudFormation).
Solid scripting and automation skills (Python, Bash, or similar).
Experience with Docker and Kubernetes (or equivalent container orchestration platforms).
Strong understanding of networking fundamentals, load balancing, SSL/TLS, and high-availability architectures.
Hands-on experience with monitoring, logging, and alerting solutions.
Strong troubleshooting and incident resolution skills.
Excellent written and spoken English.
Ability to work effectively in a remote-first, cross-functional environment.
Strong understanding of security fundamentals, including access control, encryption, certificates, and security monitoring.
Designed and operated cloud infrastructure from the ground up.
Experience supporting SOC 2, ISO 27001, or similar compliance frameworks.
Background or strong interest in cybersecurity.
Experience working in a scaling SaaS or startup environment.
You’ll have the opportunity to:
Own and improve platform reliability and security
Work on real production systems with meaningful scale and complexity
Build long-term infrastructure foundations for a growing SaaS product
Operate in a focused, remote-first engineering environment
If you want, I can next:
Tighten this further to filter out mid-level candidates
Adjust tone to be more technical or more outcome-driven
Create screening questions aligned to this role