Location: Preferably based in Nuevo León, but open to candidates anywhere in Mexico
Job Type: Full-time
Work Modality: Hybrid / Remote
Department: Technology / Infrastructure / DevOps
Position Overview:
We are looking for a Senior Site Reliability Engineer (SRE) with experience in software engineering, infrastructure automation, and cloud operations. This position is part of a specialized team focused on improving the performance, security, and resilience of our platforms through automation, development lifecycle best practices (SDLC), and close collaboration with engineering teams.
The main objective of this role is to design, implement, and maintain automation tools and cloud architecture solutions that ensure stable, efficient, and scalable systems.
Key Responsibilities:
Technical
Solve technical issues, from complex challenges to day-to-day problems.
Automate infrastructure and reduce system complexity.
Develop tools and automation software.
Improve system observability and monitoring capabilities.
Design and document cloud architecture solutions.
Analyze infrastructure, detect anomalies, and optimize performance.
Respond to performance and security-related events.
Participate in on-call rotations as needed.
Collaborative
Participate in code reviews and promote best practices.
Mentor and train team members.
Improve communication across teams.
Write Root Cause Analysis (RCA) reports.
Optimize the software development lifecycle (SDLC) and promote a DevOps culture.
Requirements:
5+ years of experience in systems engineering and DevOps.
Strong hands-on experience with Kubernetes administration and deployment pipelines.
Proficiency in SDLC automation.
Advanced knowledge of cloud platforms: AWS, GCP, Azure.
Experience in Linux system administration and Git version control.
Solid understanding of networking, security, and distributed systems analysis.
Experience with CI/CD tools such as GitHub Actions, Jenkins, etc.
Proficiency in containers (Docker) and orchestrators (EKS, GKE, AKS).
Experience with Infrastructure as Code (Terraform, CloudFormation).
Experience with relational databases (MySQL, PostgreSQL, SQL Server) and NoSQL databases (MongoDB, Redis, DynamoDB, etc.).
Proficiency in at least one modern programming language: Java, Python, Ruby, Bash, Rust, C, or similar.
Nice to Have:
5+ years of software development experience.
Experience with automation frameworks such as Chef, Ansible, or Puppet.
Experience with RESTful API development.
Knowledge of messaging/streaming platforms like Kafka, Pulsar, Kinesis, Pub/Sub.
Familiarity with encryption and security concepts.
Key Competencies:
Analytical and critical thinking
Complex problem-solving
Effective communication and collaboration
Adaptability and results orientation
Proactivity, accountability, and sense of urgency
Continuous learning and mentoring
What We Offer:
Competitive salary
Above-law benefits
Opportunities for professional growth and career development
Involvement in global projects and cutting-edge technologies
Excellent work environment and multidisciplinary team
Recuerda que ningún reclutador puede pedirte dinero a cambio de una entrevista o un puesto. Asimismo, evita realizar pagos o compartir información financiera con las empresas.