As a Site Reliability Engineer, you will ensure the high-quality delivery of our software with a rapidly growing team, by building the frameworks and tools needed by software engineers and data scientists to thoroughly validate, deploy, and monitor their code. In this role, you will be a champion for best practices and a quality mentor to the rest of the engineering organization.This US-based role can be remote or based out of our New York City office.

 

At Dataminr, we are creating a team of talented builders, creators and visionaries to have a real-world impact on how organizations respond to fast-emerging events as they unfold. We are over 600 talented individuals, spanning seven global offices, united by our passion to use AI for the greater good and be agents of positive change in our company and in our communities.

 

We offer a competitive benefits package aimed at keeping you healthy and happy:

  • Comprehensive medical, dental and vision insurance plan options for employees, domestic partners and their dependents
  • Generous PTO, flexible sick days and remote working options
  • Paid parental leave and family forming benefits
  • Mental health benefits and support
  • Company equity (RSUs)

 

​At Dataminr, we serve a global community made up of many cultures and strive to reflect the diversity of the world in which we live. We stand for social justice and we lead with empathy. We foster a culture of allyship, standing up for those who face systemic barriers to equality. We actively condemn racism and discrimination in any form.

 

We believe our differences give us strength. Our employees are empowered to be their best, authentic selves through various opportunities, such as our robust employee resource group (ERG) network, learning and development funds, and more.

 

 

The opportunity

  • Develop and maintain an Internal Developer Platform used by engineering teams to deploy containers, manage configuration and secrets, observe applications and manage cloud resources
  • Drive improvements in security, scalability, reliability, and performance
  • Troubleshoot large-scale distributed systems
  • Work closely with engineering teams to support project delivery and provide guidance on infrastructure/architecture related decisions
  • Support our production infrastructure as part of an on call rota, help with triage and resolution when issues arise

 

What you bring

 

At Dataminr, we value you for who you are. We encourage you to apply for this role, even if you don’t meet every qualification. Our candidates are reviewed on the basis of their skill and potential to succeed.

 

  • Extensive knowledge and experience with Python, Go or similar programming languages
  • Experience with Linux system administration and TCP/IP networking
  • Hands on experience with building and operating scalable and secure infrastructure on AWS
  • Experience with deployment and administration of Kubernetes based infrastructure at scale
  • Familiarity with self service internal developer platforms for developers using cloud native technologies
  • Deep understanding of reusable infrastructure as code modules for provisioning cloud infrastructure e.g. Terraform
  • Experience with observability infrastructure e.g. OpenSearch, Kibana, Loki, Grafana
  • Extensive knowledge and experience with serverless functions e.g. AWS Lambda, GCP Cloud Functions
  • Experience designing and deploying reliable CI/CD pipelines for deploying containers, serverless functions and cloud infrastructure
Job Overview
Job alerts

Subscribe to our weekly job alerts below and never miss the latest jobs

Sign in

Sign Up

Forgotten Password

Job Quick Search

Cart

Basket

Share