Puesto, ciudad o estado.

Hace 4 sem

Principal Site Reliability Engineer

Salario no mostrado por compañía.

HERBALIFE INTERNACIONAL DE MEXICO SA DE CV en

Hace 4 sem

Principal Site Reliability Engineer

Salario no mostrado por compañía.

HERBALIFE INTERNACIONAL DE MEXICO SA DE CV

en

Sobre el empleo

Educación mínima requerida: Universitario titulado

Detalles

Contratación:Permanente
Espacio de trabajo:Desde casa

Descripción

POSITION SUMMARY STATEMENT:

The Principal SRE Engineer plays a pivotal role in ensuring the utmost reliability, scalability, and performance of our organization's critical infrastructure and services. This position entails collaborating with cross-functional teams and spearheading the design, construction, and maintenance of highly resilient systems. The Principal SRE Engineer's expertise and leadership significantly contribute to the overall stability and operational efficiency of our technology stack.



DETAILED RESPONSIBILITIES/DUTIES:

• Lead the design, implementation, and maintenance of scalable, fault-tolerant systems and infrastructure, collaborating with other Principal Developers, Engineering Managers, and Architects.

• Establish and enforce best practices for system monitoring, alerting, and incident response.

• Collaborate with development teams to improve the reliability and performance of applications and services.

• Develop and implement automated deployment and configuration management systems.

• Collaborate with DevOps teams for CI/CD pipeline ownership and development.

• Conduct performance analysis, capacity planning, and optimization of infrastructure components.

• Investigate and troubleshoot complex system and application issues and provide timely resolution.

• Define and measure service-level objectives (SLOs) and ensure adherence to them.

• Mentor and coach junior SRE engineers, fostering a culture of learning and continuous improvement.

• Stay up to date with industry trends and emerging technologies and evaluate their potential impact on our systems.



Skills Required:

• Strong understanding of established architecture and development patterns.

• Familiarity with distributed systems and event-driven architecture.

• Strong knowledge of cloud-native architectures and microservices.

• Knowledge of DevOps principles and practices, including CI/CD.

• Deep understanding of database technologies.

• Experience in application development and integration. Preferably Java.

• Deep understanding of containerization technologies like Docker and orchestration frameworks.

• Experience in cloud platforms and services such as Azure, GCP, or AWS.

• In-depth knowledge of networking concepts, protocols, and security practices.

• Expertise with monitoring and observability tools like Dynatrace.

• Experience with log management and analysis tools like Splunk.

• Excellent oral and written communication skills.

• Fluent English essential.



Experience:

• 5 years' experience in a similar role, managing or operating at large-scale complex production systems.

• 8+ Years' experience in design & development of end to end complex applications



Education Required:

• Bachelor's in Computer Science or equivalent



PREFERRED QUALIFICATIONS:

• Experience with infrastructure automation tools.

• Experience with big data technology platforms like Apache Kafka.

• Experience with international or multi-level marketing business.

• Advanced certifications for related fields such as DevOps Certifications, Cloud Administration, or Docker/Kubernetes Administration.

• Advanced certifications related to monitoring or observability tools such as Dynatrace, Splunk.

ID: 18313389