Job Title: NOC Operator (Operador del Centro de Operaciones de Red)
Location: Mexico, Tijuana
Role Overview:
The NOC Operator will be responsible for real-time monitoring, identifying, and responding to system alerts. This role is critical in ensuring our services run smoothly 24/7, with rapid incident response and escalation when necessary. The ideal candidate will have experience in monitoring complex architectures, managing logs, and following well-defined escalation processes.
Key Responsibilities:
Monitoring & Alerting:
- Continuously monitor system performance using Prometheus, Grafana, and other monitoring tools.
- Respond to alerts promptly and assess the impact of issues on our services.
- Report and escalate incidents according to the predefined escalation process.
Incident Management:
- Gather logs, metrics, and other relevant information during incidents.
- Troubleshoot basic issues and apply quick fixes where applicable.
- Coordinate with development and Engineering teams to resolve more complex incidents.
Communication:
- Maintain clear and concise communication with internal teams via Discord during incidents.
- Document incident details, actions taken, and outcomes in incident reports.
Process Improvement:
- Participate in post-incident reviews and contribute to improving monitoring and escalation processes.
- Suggest improvements to monitoring tools and alert thresholds to reduce false positives and enhance early detection.
Reporting:
- Create regular reports on system performance, incidents, and downtime.
- Highlight patterns and suggest preventive measures for recurring issues.
Requirements:
- Experience in a NOC, monitoring, or similar operational support role.
- Proficiency with monitoring tools such as Prometheus and Grafana.
- Familiarity with log management and basic troubleshooting techniques.
- Strong communication skills and ability to operate effectively in a fast-paced environment using tools like Discord.
- Knowledge of incident management and escalation processes.
- Availability to work in shifts, including nights and weekends, to provide 24/7 support.
Nice to Have:
- Experience in the sportsbook or gaming industry.
- Knowledge of cloud infrastructure and DevOps practices.
- Basic scripting skills for automation (e.g., Python, Bash).
What We Offer:
- Competitive salary and benefits package.
- Opportunity to work with a dynamic and innovative team.
- Continuous learning and professional development opportunities.