Reliability Engineering Manager, Cloud Operations
Zscaler
- תל אביב
- משרה קבועה
- משרה מלאה
- Lead and manage a team of cloud operations engineers, providing guidance, mentorship, and support to ensure the team's success.
- Design and deploy various customer facing Linux and BSD based systems infrastructures
- Create and deploy scalable monitoring systems for massively growing global infrastructure
- Architect and implement various cloud management automations
- Write, augment and maintain Ops documentations
- Resolve NOC escalations and help prevent reiteration of those incidents by creating NOC processes, procedures and automations
- Linux/UNIX system engineering (create and support highly scalable compute solutions)
- Make recommendations on integration strategies, platforms, and application infrastructure required to successfully implement desired solutions providing best practice advice to other teams to optimize Zscaler Cloud effectiveness.
- 10+ years of experience in a Linux/UNIX System Administration or lead DevOps role
- Bachelor's degree in Computer Science, a related technical field involving computer systems engineering, or equivalent practical experience.
- Experience leading and managing teams, with a focus on fostering collaboration, innovation, and professional development.
- Knowledge of security best practices and compliance standards in cloud environments.
- Strong analytical and problem-solving skills, with a focus on continuous improvement and optimization.
- Excellent communication and interpersonal skills, with the ability to effectively engage with stakeholders at all levels of the organization.
- Comfort and experience with Ops environment growing at a rapid scale
- Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
- Ability to debug, optimize code, and automate routine tasks.
- Hands on experience with immutable infrastructure and using infrastructure as code tools like Puppet, Chef, Ansible or similar
- Knowledge of Virtualization, OpenStack, Cloud Architecture and Services, Automated Deployments, API, Docker and Kubernetes
- Strong background in Linux/Unix system administration
- Excellent scripting skills and experience (Bash & Python/Perl) (Python preferred)
- Experience maintaining and deploying systems and software in diverse environments
- Strong understanding of web security and protocols HTTP, SSL/TLS, DNS, SQL Network fundamentals DHCP, ARP, Subnetting, Routing, NAT, Firewalls, IPv4 and IPv6
- Rich DevOps skills across CI/CD, SCM, Static Code Analyzer, Builds and Releases, Continuous Integration Tools and frameworks (e.g. SVN, GIT, Jenkin, BitBucket, etc)
- Knowledge of advanced networking technologies and services including SDN, NFV, SDWAN, MPLS, BGP routing, switching, VXLANs, and architectures is a definite plus