Everseen: A leader in vision AI solutions for the world’s leading retailers.
The Role
As a DevOps Engineer III, you will be part of the L3 support team for Operations across Edge/on‑prem and cloud, owning complex incidents end‑to‑end: triage, deep‑dive debugging, root‑cause analysis, remediation, and follow‑ups. Strong Linux administration (RHEL primarily, plus Ubuntu) and OpenShift/Kubernetes expertise are essential.
To reduce Operations (Customer Deployment) issues, you will build targeted automations (Python, Bash, Ansible) and automate new and existing SOPs used by Operations.
You will execute safe deployments and upgrades via GitOps and IaC pipelines (Flux, Ansible, Terraform) on AKS and GKE—coordinating validation and rollback plans—and contribute to the maintenance of existing GitLab CI/CD pipelines together with the DevOps engineering teams.
You will design and continuously refine Alertmanager rules and standardize actionable Grafana dashboards with Operations, ensuring effective use of Prometheus metrics and logs (Grafana Alloy, Thanos).
Beyond day‑to‑day operations, you’ll apply deep DevOps, CI/CD, and infrastructure automation expertise, drive best practices, share knowledge through workshops and mentoring, write and maintain documentation and SOPs (Standard Operating Procedure), test infrastructure, and collaborate across teams to optimize systems and workflows.
What You'll Do
Designs and maintains CI/CD pipelines using GitLab CI/CD.Implements Infrastructure as Code (IaC) with tools like Terraform.Oversees advanced CI/CD pipeline setups, including GitOps with Flux CD.Automates complex workflows and enhances infrastructure scalability.Troubleshoots and optimizes Kubernetes cluster operations.Integrates monitoring solutions for observability.Writes and maintains system operations documentation (articles, diagrams, data flows, etc.) for new and existing applications and services.Keeps up-to-date on best practices and new technologies. Conducts, designs, and executes staging/UAT/production and mass service deployment scenarios.Collaborates on technical architecture and system design.Analyzes and collects data: log files, application stack traces, thread dumps etc. Reproduces and simulates application incidents to create debug reports and coordinate delivery of application fixes. Evaluates existing components or systems to determine integration requirements and to ensure the final solutions meet organizational needs.Interacts with cross-functional management on high profile technical operations while providing clear feedback and leadership to support teams. Authoring knowledgebase articles and driving internal knowledge sharing.Work in off-routine hours occasionally.Work with customers and travel to international customer or partner locations high-profile.
Collaborating With
Operations (Customer Deployment) teams: Collaborate with the Operations teams for troubleshooting and solving L3 tickets, create automations to reduce and optimize workload.DevOps Cloud and Edge teams: Work closely with the wider DevOps engineering teams, your manager, developers and QA engineers to understand requirements, provide technical guidance, and ensure smooth integration and deployment of our product.Security Team: Collaborate with the team to ensure the security of our cloud and edge solutions.
Our Tech Stack
At Everseen, you will have the opportunity to work with cutting-edge technology. Our stack includes: CI/CD Tools: GitLab CI/CDCloud Platforms: Azure (AKS, Registry), GCP (GKE)Edge Platforms: Docker, Podman, Kubernetes(k0s) and OpenshiftEdge OS: RHEL, UbuntuAutomation Tools: Ansible (AWX), Jinja, TerraformDeployment Tools: Helm, Flux CDObservability: Prometheus, Loki, Grafana alloy, Grafana dashboards, ThanosDatabases: Elasticsearch, MongoDBAuthentication: KeycloakScripting Languages: Python, Bash
Profile and Skills
Experience: 4+ years in DevOps-related roles with a strong focus on automation.Networking: Proficient in DNS, routing, container communication, firewalls, reverse-proxying, load-balancing, edge to cloud communication and troubleshooting.System Administration: Strong system administration skills are required for deploying and troubleshooting OS level outages and Everseen’s containerized Edge application in customer network.Cloud Expertise: Extensive experience with Azure (or GCP), including fully automated infrastructure and deployment.Cloud Cost Management: Experience with monitoring and optimizing cloud costs.CI/CD Pipelines: Proven experience in implementing and managing CI/CD pipelines (GitLab CI/CD preferred) and excellent knowledge of Git and associated workflows (e.g., Gitflow).Observability: Proven experience with monitoring, logging, and alerting tools and stacks.Scripting: Excellent scripting skills in Bash and Python.Containerization: Advanced knowledge of Kubernetes and Openshift, including cluster management, orchestration and auto-scaling, deployments using Helm charts and GitOps.Microservices Experience: Proven experience with microservices architecture and related deployment strategies.Infrastructure as Code: Expertise with Terraform modules. Configuration management: Deep experience with Ansible, including writing complex playbooks, roles, and using Ansible Vault for secrets management.Security Practices: Strong understanding of DevSecOps principles and experience implementing security best practices within CI/CD pipelines.
Analytical and Problem-Solving SkillsPossesses strong analytical and problem-solving abilities, leveraging data to inform product decisions. This skill is essential for identifying market opportunities, optimizing product features, and addressing challenges effectively.
Communication SkillsExcellent presentation, oral, and written communication skills. Fluent business English is a requirement.
Customer FocusA passionate advocate for determining and delivering solutions with a high level of customer satisfaction.Ability to prioritize customer experience as a top priority in solution delivery.
Interest in Learning and Growth MindsetDemonstrated interest in learning and a strong desire to expand knowledge in their respective field.Eagerness to explore new technologies, methodologies, and best practices to enhance skills and capabilities.Results-oriented attitude, with a drive to achieve objectives efficiently.
Technical LeadershipCapable of engaging in technical discussions with stakeholders and leading DevOps projects. Mentors and coaches team members.
Nice to Have Skills
Experience in using Service Mesh solution like IstioExperience in using Tracing solutions like Grafana Tempo, JaegerExperience with RenovateBot or similar toolsExperience with node.js