Site Reliability Engineer
Job Summary: We are seeking a proficient Site Reliability Engineer who possesses a diverse range of technical skills and a strong focus on development and automation to address complex challenges. The ideal candidate is enthusiastic about contributing to significant technological advancements that affect a large user base. In our dynamic SRE team, you will collaborate with seasoned engineers on large-scale applications that provide real-time content to numerous users within a rapidly evolving organization.
Key Responsibilities:
- Collaborate with fellow engineers to optimize and enhance the everyday operational processes of essential applications and services within a 24/7/365 environment, utilizing both cloud and traditional data centers.
- Continuously advance application visibility to guarantee the reliability and availability of our applications and infrastructure.
- Leverage a wide array of open-source technologies to develop resilient, scalable, and secure high-performance services and deployment pipelines on a global scale.
Qualifications:
- Proven experience in constructing scalable production environments.
- Familiarity with source code management systems including version control, branching and merging, configuration, build management, artifact repositories, automated build processes, testing frameworks, and deployment methodologies.
- Extensive knowledge of Continuous Integration and Continuous Delivery (CI/CD) practices and tools (e.g., Jenkins, GitLab CI), as you will be responsible for maintaining and supporting automated build pipelines.
- Proficient scripting skills in languages such as BASH, PowerShell, Python, Go, Ruby, and others.
- Strong experience working with Docker containers and container orchestration technologies, particularly Kubernetes.
- Knowledge of infrastructure as code tools like Ansible, Chef, or Terraform is desirable.
- Ability to thrive in a collaboratively distributed team environment.
- Experience with monitoring and metrics systems (such as Prometheus, Logstash, Grafana).
- Programming abilities in Java or .NET frameworks.
Preferred Skills:
- Understanding of database management systems (such as MSSQL or PostgreSQL).
- Practical experience with messaging technologies, including but not limited to RabbitMQ or Apache Kafka, with skills in configuration and troubleshooting.
This is a fantastic opportunity for a talented professional to grow in a pioneering and fast-paced sector. If you possess the required skills and are driven to make a significant impact, we encourage you to apply.