Founded in 2017, percipient.ai utilizes state-of-the-art research in Computer Vision, Artificial Intelligence, and Deep Learning to develop cutting-edge tools that bridge the gap between AI and human understanding. We pride ourselves on maintaining an inclusive and collaborative work environment that enables each individual to grow while having a meaningful impact on national security. Join our team today!
Percipient.ai is currently seeking Sr. DevOps Engineers to work with our customers to ensure their mission success. This position will work directly with customers at their facilities several days a week. In addition to DevOps skills, this broad role requires skills in IT/networking, release management, site reliability engineering, cloud computing, and overall troubleshooting.
Responsibilities include ensuring our pipeline for deploying mission critical updates is working smoothly, and using open-source tooling, cloud services, and Infrastructure as Code to automate deployments and scale our product. Additionally, this role will frequently troubleshoot complex problems that will impact the most pressing intelligence and national security missions.
Responsibilities:
Utilize troubleshooting and scripting skills to improve the availability, performance, and security of Percipient.ai servicesImplement automated deployments, and operational toolsCollaborate with product and engineering teams to plan and deploy product releasesEnsure services are designed with 24/7 availability and operational readiness and rigorImplement proactive monitoring, alerting, and self-healing systemsParticipate in on-call rotations, driving restoration and repair of service-impacting issuesDefine non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systemsCoding and automation of applications in the cloud
Requirements:
BS in Computer Science or related fieldAbility to be onsite at the customer’s facility several days a week8+ years of Systems/Applications automation in 24/7 production services environmentsExpert understanding of running a large-scale virtualized infrastructure in the cloud and on-premiseExpertise with containerizing concepts like Docker, PaaS services on AWS, and Kubernetes or equivalent technologiesFluency with at least one current generation scripting language used by DevOps professionals such as Python, Bash, or PerlDeep experience operating on AWS (C2S) and infrastructure automation using Ansible and TerraformExcellent troubleshooting and problem-solving skillsDemonstrated experience in analyzing and diagnosing large-scale distributed systems and Linux systems internals (system libraries, file systems, etc.)Experience with elastically scalable, fault tolerance and other cloud architecture patternsExperience with Continuous Integration and Continuous Delivery, including tools such as CloudformationExperience in Linux and security triage and forensic analysisExcellent interpersonal and communication skillsUS citizenship and a national security background requiredExperience working with the DoD / IC community a plus