Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $2B in revenue in our last fiscal year with extensive growth potential ahead.
At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company – we made history in 2021 by becoming a public benefit corporation (PBC), legally bound to balancing the interests of customers, employees, society, and investors.
As a Work Anywhere company, we support your flexibility to work from home or in the office, so you can thrive in your ideal environment.
The Role
Join our dynamic team as a Senior Site Reliability Engineer on the Vault Platform team, where you'll ensure the scalability and reliability of our enterprise applications. You'll tackle complex challenges at a global scale, drawing on your deep expertise in Java and modern open-source technologies to make a tangible impact on production systems.
You should have extensive experience with Java applications and the latest open-source technologies, ideally from enterprise software development or a high-growth tech company. As a Senior SRE, you'll need to be naturally curious and skilled at problem-solving. You'll also bring a unique engineering perspective, understanding how systems integrate in production to operate at a global scale for hundreds of customers across North America, Europe, and Asia.
What You'll Do
Build Cloud Infrastructure: Rapidly build new cloud infrastructure from scratch, adhering to software development best practicesDrive Reliability & Scalability: Ensure our platform meets the scalability and reliability needs of our hundreds of global customers (across North America, Europe, and Asia)Lead Incident Management: During an incident, effectively lead triage and mitigation efforts, potentially performing periodic on-call duty for escalationsAutomate & Optimize: Develop tools and automation to eliminate manual work and reduce issue resolution timesFull-Stack Diagnostics: Proactively learn all necessary systems to provide full-stack diagnostics and determine root causes of production problemsStrategic Engineering Partnership: Strategize with engineering teams on complex problems, offering insights on what will work at scale (supporting 2M+ users) and guiding development decisions before features shipInfluence Design: Participate in engineering design reviews of new features and drive initiatives to improve operational efficiency and platform scalabilityCross-functional Collaboration: Partner effectively with Product Management, Design, and QA to deliver cutting-edge solutions and direct customer valueBackend Focus: Work across multiple layers of our technology stack, with a primary focus on backend development, and opportunities in frontend and infrastructureEffective Communication: Communicate clearly with engineering teams, succinctly describing problems for seamless hand-offs during outages with both technical and non-technical audiencesMentorship: Actively mentor team members, contributing to a positive and high-performing team environment
Requirements
Deep Java Expertise: 5+ years of experience in Java development, with a strong preference for experience within enterprise cloud software companiesOperational Experience: Hands-on operational experience in a high-volume or critical production service environment, including incident management and root cause analysisCode Quality: Proven ability to write clean, testable, readable, and maintainable code within a collaborative team settingOpen Source Proficiency: Hands-on experience with a range of open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, and KubernetesDatabase Mastery: 3+ years of experience in relational databases with expert-level SQL skillsScripting Skills: Solid scripting proficiency with languages such as Shell, Bash, Ansible, Python, Go, Ruby, etc.Leadership & Communication: Demonstrated history of incident management and leadership ability, with effective communication skills across all levels (individual contributors to executives)Mentorship: Proven record of making your team better through mentorship