Saltar al contenido

SRE

  • On-site
    • CDMX, Ciudad de México, Mexico
    • Monterrey, Nuevo León, Mexico
    • Guadalajara, Jalisco, Mexico
    • Saltillo, Coahuila de Zaragoza, Mexico
    +3 more

Job description

Important IT company At the Latin American level, growth requires:

SRE (Site Reliability Engineer)

Job Description:

We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders, continuously looking for ways to provide real-time visual feedback for all the metrics and statuses.


What you will do:

  • Proactively build and implement services to make IT and support better at their jobs.
  • Design and implement dashboard that provide valuable real-time insights of platform key metrics.
  • Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems.
  • Optimizing on-call rotations & processes.
  • Ensure Incidents assigned to the team are being managed within agreed SLAs
  • Ensure alarms are documented in up to date Knowledge Base Articles.
  • Conduct pot-incident reviews to identify platform status.

What we’re looking for:

  • Bachelor’s degree in computer science or equivalent relevant to SR or Automation/development experience.
  • 7+ years’ experience focussed on Site Reliability Engineering or related position in some of the majors Cloud Platforms.
  • Involved in the automation of multi-tenant systems, preferably in a cloud environment.
  • Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation;
  • Ability to explain technical concepts in clear, non-technical language
  • Experience building Infrastructure-As-Code.
  • Experience in Docker and Kubernetes and networking concepts.
  • Experience with Graphana and Prometeus.
  • Integration experience with Pager-Duty, ServiceNow, Datadog.
  • Expertise with system and performance monitoring tools (Dynatrace, Splunk, etc.).


ADVANCED CONVERSATIONAL ENGLISH ESSENTIAL (Will be evaluated).

Job type: On site.

Location: Mty / Slw / Gdl/ Mexico city

Salary: $95,000 gross.

Benefits: Excellent superior benefits.

or

Apply with Indeed unavailable