Director, Information Technology
Bengaluru, Karnataka, India Apply NowWe Are:
At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological innovation.
You Are:
You are a dynamic and strategic leader with a passion for technology and innovation. With over a decade of experience in infrastructure technologies, you have honed your skills in managing Site Reliability Engineering (SRE) lifecycle practices. You thrive in environments where you can leverage your expertise in Linux platforms, networking protocols, storage solutions, and databases. Your problem-solving capabilities are top-notch, and you are adept at troubleshooting and resolving complex infrastructure issues.
Automation is second nature to you, and you have a proven track record in developing custom monitors and implementing automated testing processes. You are process-oriented, always looking for ways to enhance reliability and efficiency. Your proactive mindset drives you to anticipate potential issues and plan for future needs, ensuring that systems are resilient and performant.
Communication is one of your strong suits. You excel at keeping stakeholders informed, building strong relationships across teams, and fostering a culture of open communication. You are a leader who inspires your team to think ahead, take proactive steps, and continuously seek improvements. Your ability to manage change effectively ensures that disruptions are minimized while maximizing efficiency.
What You’ll Be Doing:
- Developing and implementing a comprehensive Reliability Architecture to ensure high standards of reliability and performance across all platforms, systems, and tools.
- Integrating Monitoring and Alerting systems into the observability platform to ensure early detection and resolution of issues.
- Managing Incident Response, leading the team during major incidents, ensuring quick and effective resolution, and conducting thorough postmortem analyses.
- Driving Automation & Continuous Improvement initiatives to reduce incidents, improve efficiency, and enhance reliability.
- Instilling a culture of Proactive Thinking within the team, managing change effectively, and overseeing the improvement of Change Management procedures.
- Communicating effectively with stakeholders regarding reliability issues, building strong relationships, and fostering a culture of open communication.
The Impact You Will Have:
- Ensuring the reliability and performance of critical infrastructure services that support engineering and business teams.
- Enhancing the resilience and efficiency of our private cloud environment through strategic reliability practices.
- Improving incident response times and minimizing the impact of major incidents on business operations.
- Driving continuous improvement and innovation in infrastructure reliability and performance.
- Building a proactive and forward-thinking team culture that anticipates and addresses potential issues.
- Fostering strong relationships and effective communication across teams and stakeholders, ensuring alignment and collaboration.
What You’ll Need:
- 5+ years of practical application and experience in managing SRE lifecycle practices for a mid-to-large organization.
- 10+ years of experience working on various infrastructure technologies, including Linux platforms, storage platforms, networking protocols, DNS/LDAP, and databases.
- 5+ years of direct experience troubleshooting and solving infrastructure problems.
- Proven process-oriented approach to solving problems and improving reliability.
- Proven experience with Automation, focusing on developing custom monitors and automated testing processes.
Who You Are:
- A strategic and dynamic leader with a passion for technology and innovation.
- An excellent communicator who can build strong relationships and foster a culture of open communication.
- A proactive thinker who anticipates potential issues and plans for future needs.
- A problem solver with top-notch troubleshooting skills and a process-oriented approach.
- A champion of continuous improvement, always seeking ways to enhance reliability and efficiency.
The Team You’ll Be A Part Of:
You will lead a team of dedicated Site Reliability Engineers focused on developing and operationalizing lifecycle practices for all infrastructure services. The team is pivotal in ensuring the resilience and performance of our private cloud environment, working closely with cross-functional teams and service owners to build and maintain high standards of reliability.
Rewards and Benefits:
We offer a comprehensive range of health, wellness, and financial benefits to cater to your needs. Our total rewards include both monetary and non-monetary offerings. Your recruiter will provide more details about the salary range and benefits during the hiring process.
Inclusion and Diversity are important to us. Synopsys considers all applicants for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, military veteran status, or disability.
Apply NowRelevant Jobs
- HAPS Application Engineer( FPGA) Shanghai, China Engineering
- Applications Engineering, Staff Engineer Yongin-si, South Korea Engineering
- Digital Design Verification Engineering Internship Nepean, Canada Interns/Temp
Find the open role that’s
right for you
- HAPS Application Engineer( FPGA) Shanghai, China
- Applications Engineering, Staff Engineer Yongin-si, South Korea
- Digital Design Verification Engineering Internship Nepean, Canada
- R&D Engineer - TCAD Hillsboro, Oregon
View all job opportunities here
View all job opportunities here