Teikametrics

Senior Site Reliability Engineer

Teikametrics • IN
JavaPython Remote
ABOUT THE COMPANY
Teikametrics is revolutionizing retail through our patented Artificial Retail Intelligence platform. Our proprietary orchestration layer functions as prompt intelligence, verticalizing AI for Amazon, Walmart, TikTok, and emerging marketplace use cases. For more information, visit www.teikametrics.com.
 
ABOUT THE ROLE
Teikametrics is looking for a Site Reliability Engineer at Bengaluru, India to help us build and maintain our cloud infrastructure for hosting Teikametrics applications and platforms, in addition to helping build internal DevOps tools and best practices required for efficient software development and deployment. 
 
This highly visible role will help us guide deployments using the latest technologies such as Docker, Kubernetes, and Terraform while having an enormous impact on our entire organization.
 
You will work within a DevOps model alongside product development teams, designing, deploying, and managing automation tools that increase predictability, improve efficiency, and reduce operational cost.

** This role requires you to dedicatedly work in US hours, i.e, 9 AM - 6 PM EST 
**Candidates based in Bangalore are preferred
 
 
ABOUT THE TEAM
We currently host services and infrastructure across AWS in tandem with third party providers and must address the security and scaling challenges that come with this. Our daily role involves: 
  • Managing, and scaling web applications and data platforms

  • Building tools to implement devops and security best practices

  • Creating reusable and immutable infrastructure with Terraform

  • Continuously improve our infrastructure with monitoring, logging, and alerting

  • Developing authentication and gateway solutions for our infrastructure and applications

  • Investigating, identifying application issues and advising development teams on design, deployment and infrastructure choices.

  • Participating in on-call rotations, post-mortems and root cause analysis (RCA)

WHO YOU ARE

  • 3-5 years of professional experience

  • Take full ownership of significant system components with responsibility for their reliability and performance

  • Manage Lifecycle of core project infrastructure, from design through to deployment, maintenance, and performance optimization 

  • Familiar with industry standards and devops best practices

  • Experience supporting the overall platform in on-call rotations.

  • Able to operate with minimal supervision

  • HOW YOU'LL SPEND YOUR TIME

  • Managing deployment infrastructure and automations including Github, CI/CD pipelines,and other deployment tooling

  • Experience managing workflows and pipelines using tools like CircleCI, Argo Workflows etc

  • Cloud computing providers such as AWS

  • Hands-on experience with Kubernetes (EKS, GKE) or similar container orchestration platforms 

  • Infrastructure as code tools such as Terraform

  • Experience coding with at least one language such as Bash, Python required

  • Hands-on experience with authentication and authorization technologies required 

  • Automation experience of cloud environments

  • Containerization technologies and tools such as Docker

  • Monitoring tools such as Datadog, Opensearch, Sentry

  • WHAT CAN HELP YOU STAND OUT

  • Good at using AI agents and writing project specific standard guidelines

  • Experience operating data pipelines with Databricks, Kafka

  • Experience with Java, Javascript

  • Experience with managing infrastructure costs and budgets

  • Experience with databases like AWS RDS/Postgres

  • WE'VE GOT YOU COVERED

  • Every Teikametrics employee is eligible for company equity
  • Remote Work Flexibility - Work from home or from our offices, with flexible remote options
  • Broadband reimbursement 
  • Group Medical Insurance – Coverage of INR 7,50,000 per annum for a family 
  • Crèche benefit