الوظائف الحالية
اكتشف و تقدم بالطلب الآن
Site Reliability Engineer (m/f/d)
Site Reliability Engineer
We are looking for a highly skilled Site Reliability Engineer with strong expertise in Kubernetes, ELK stack, OpenStack, and Linux. The role involves building and maintaining reliable infrastructure, ensuring observability, and supporting production environments with automation and strong troubleshooting skills.
Key Responsibilities
- Deploy, manage, and scale Kubernetes clusters and containerized workloads.
- Design and implement logging and monitoring solutions using the ELK stack (Elasticsearch, Logstash, Kibana).
- Operate and maintain OpenStack cloud environments.
- Perform advanced Linux administration (troubleshooting, patching, optimization).
- Automate infrastructure provisioning and management using Terraform, Helm, and Ansible.
- Build and maintain CI/CD pipelines to support development and production deployments.
- Ensure system reliability, scalability, performance, and security across environments.
- Participate in on-call rotations, providing incident response and root cause analysis.
Skills and Competencies
- Proven hands-on experience with Kubernetes and Docker.
- Strong knowledge of the ELK stack (Elasticsearch, Logstash, Kibana).
- Expertise in managing OpenStack environments (STC OpenStack cloud preferred).
- Advanced proficiency in Linux administration (RHEL, Ubuntu).
- Strong scripting skills in Python and Bash.
- Experience with CI/CD tools (GitLab preferred).
- Familiarity with cloud platforms (AWS, GCP, or Azure).
- Experience with Prometheus/Grafana for monitoring.
- Understanding of Kubernetes and Linux security hardening best practices.
Qualifications
- 6–10 years of experience in Linux and Cloud Technologies
- Bachelor's degree in Computer Science, Information Technology, or related field
- Relevant Certification (preferred):
- CKA, CKS (Certified Kubernetes Administrator / Security Specialist)
- Linux Professional Institute Certification (LPIC-2 or LPIC-3) or Red Hat Certified Engineer (RHCE)
- OpenStack Administrator Certification
- HashiCorp Terraform Associate Certification
- GCP/AWS/Azure Solutions Architect or SysOps certifications
Halian Group:
With over 28 years of experience, we have come to understand that innovation is the only way to provide agile, practical solutions that transform businesses and careers. Our resourcing and smart services help you to realize tomorrow’s potential. Discover the amazing things possible when you bring the right people and the right technologies together.
At Halian, we recognize that diversity, equity, and inclusion (DEI) are essential to building high-performing teams for our clients. We are committed to connecting organizations with top talent from all backgrounds, ensuring that every individual feels valued, respected, and empowered to contribute their unique perspectives. We encourage applications from all qualified candidates, regardless of race, gender, disability, or any other characteristic that makes them unique. By fostering diverse and inclusive workplaces, we help our clients drive innovation, enhance collaboration, and better reflect the communities they serve.
#LI-ST1