Hace 2 sem
Site Reliability Engineer
Salario no mostrado por compañía.
Hace 2 sem
Site Reliability Engineer
Salario no mostrado por compañía.
Gemalto
en
Sobre el empleo
Detalles
Descripción
Position Summary
The candidate will be working as a SRE member who will help the organization to constantly ensure reliability, availability and performance of large-scale ODC services.
SRE will work closely with development teams to design, build, and maintain scalable and reliable infrastructure, automate processes, monitor system health, and respond to incidents effectively with a mindset of efficiency on day-to-day activities.
SRE will constantly adopt ITIL and Agile methodologies/processes, coaching and mentoring on best practices. Will endorse whole lifecycle over Public Cloud ensuring to meet external customer SLA and internal OLAs.
Essential Functions / Key Areas of Responsibility
• Develop and maintain Infrastructure as a Code and automation tools
• Responsible to Integrate, Operate and Support 7x24 mission critical services with 5x9 availability on public cloud.
• Responsible to ensure tier 1 / Platinum SLAs
• Responsible to review technical products and understand customer requirements.
• Responsible to perform regular tuning.
• Able to work with distributed teams worldwide.
• Responsible for defining business continuity strategy for Operated services over public cloud.
• Must animate and motivate the team on daily basis through Agile ceremonies (Daily, refinement, planning...)
• Must animate the team in term of self-organization.
• Responsible for suggesting indicators on team monitoring.
• Responsible for facilitating exchanges with the many stakeholders.
• Continuously improve service reliability, performance, and security of the services
• Collaborate with Service Delivery Managers on traffic trends, analyze the impact of mid-term business changes on capacity requirements.
• Participate in capacity management processes and security audits.
• Design and implement changes into the systems.
• Adapt solution parameters to make architecture evolutions.
Minimum Requirements: Skills, Experience, Education, Technical/Specialized Knowledge, Certifications, Language
· Bachelor Degree in Information Technology or a related field
· +5 years of experience in design, development and implementation of applications.
· +5 years of proven experience in Public Cloud (GCP or AWS)
· Minimum C1 English (Advanced Level)
· Strong experience on Kubernetes (certification)
· Strong experience on Apache Http Server
· Strong experience on TLS >= 1.2
· Ability to work SRE engineers during integration and operation project phases.
· Strong experience working in Agile teams
· ITIL/Agile certification
· Experience in embedding agile performance metrics to drive accountability.
· Effective verbal and written communication skills
· Strong working experience on one of the scripting languages – SHELL/Python is required.
· Ability to problem solve and be analytical.
· Strong results orientation with follow-up skill.
· Experience with SOAP and Rest API.
· Experience in No-SQL and SQL query construction.
· Experience on Datadog Monitoring tool implementation and monitoring
· Experience on Github
· Experience on Pipelines
· Experience to operate secrets on Vault
· Experience on GCP Terraform
ID: 18414808