{{ site.title }}

DevOps Engineer

DevOps Engineer

LockedIn AI Job Location: Date Created: 05/19/2026

LockedIn AI is a fast-growing AI-powered career technology company serving over 1 million users worldwide. We build real-time AI systems that assist users during interviews, coding assessments, and professional communication scenarios.

We are seeking a DevOps Engineer to design, build, and maintain the infrastructure systems that power our AI platform at scale. This role is central to ensuring reliability, scalability, and performance across all services, AI endpoints, and data pipelines.

Role Overview

The DevOps Engineer will be responsible for building and operating cloud infrastructure, CI/CD pipelines, and observability systems that enable fast and reliable deployments. You will work closely with engineering, AI/ML, and product teams to ensure seamless delivery of services to production environments.

This is a hands-on role focused on automation, system reliability, and continuous improvement of platform operations.

Key Responsibilities

Cloud Infrastructure & Systems

  • Design and manage scalable cloud infrastructure using AWS, GCP, or Azure
  • Implement Infrastructure as Code using Terraform, Pulumi, or CloudFormation
  • Maintain and optimize Kubernetes clusters for production workloads
  • Ensure high availability and performance of distributed systems

CI/CD & Deployment Automation

  • Build and maintain CI/CD pipelines using GitHub Actions, GitLab CI, or ArgoCD
  • Implement safe deployment strategies including canary and blue-green releases
  • Automate build, test, and deployment workflows across environments
  • Support AI model deployment and configuration pipelines

Observability & Monitoring

  • Implement monitoring systems using Prometheus, Grafana, Datadog, or ELK stack
  • Set up alerting systems with actionable insights and low noise
  • Enable distributed tracing across microservices and AI pipelines
  • Track system health, latency, and performance metrics

Reliability & Incident Management

  • Participate in on-call rotations and lead incident response
  • Perform root cause analysis and post-incident reviews
  • Build automation to reduce operational workload and system downtime
  • Maintain disaster recovery and operational runbooks

Security & Compliance

  • Implement secure infrastructure practices including IAM and secrets management
  • Ensure encryption and audit logging across systems
  • Apply GitOps practices for infrastructure change management
  • Manage vulnerability scanning and patching processes

Required Qualifications

  • 3+ years of experience in DevOps, SRE, or platform engineering
  • Strong experience with Kubernetes and Docker in production environments
  • Proficiency in cloud platforms (AWS, GCP, or Azure)
  • Experience with CI/CD pipelines and infrastructure automation
  • Strong scripting skills (Python, Go, or Bash)
  • Experience with monitoring and observability tools
  • Strong systems thinking and automation mindset

Preferred Qualifications

  • Experience with AI/ML infrastructure or GPU workloads
  • Familiarity with real-time or low-latency systems
  • Experience building MLOps pipelines
  • Knowledge of chaos engineering practices
  • Experience in high-growth startup environments
  • Contributions to open-source DevOps tools

Learn More »

Affiliates