For job seekers
For companies
Set your preferences and let your AI copilot handle the job search while you sleep.
Senior Engineer - Cloud Operations (Platform Support)As a Cloud Operations Engineer in our Cloud Operations Center, you will be a key player in ensuring the 24x7x365 smooth operation of Saviynt’s Enterprise Identity Cloud. This role focuses on maintaining the stability, performance, and reliability of our platform with a strong emphasis on application layer support and operational ownership. You will be working closely with other operations team members, development, and engineering to resolve issues, implement improvements, and provide exceptional support. This is an opportunity for someone who enjoys operational challenges and problem-solving in a dynamic cloud environment and wants to see their work through tocompletion.
WHATYOU WILL BE DOING
·
Strong pod-level troubleshooting skills in AKS/EKS (not just restarting pods).·
Analyze application and DB (RDS, MySQL) performanceissues.Deeplyinvestigate and analyze application performance issues (Java, Grails, Hibernate), identifying root causes and implementingsolutions.·
Overseethe monitoring of our SaaS applications and underlying infrastructure (Kubernetes on AWS and Azure, VPN connections, customer applications, Elastic Search, MySQL) for alerts and performanceissues.·
Strongunderstanding of basic computing concepts like DNS, IP addressing, Networking, andLDAP.·
Effectivelyparticipate and contribute in on-call escalations with a strong operational mindset and provide technical guidance during criticalincidents.·
Proactivelycommunicate with customers on technical issues whenrequired.·
Abilityto guide junior engineers when neededtechnically.·
Managethe full lifecycle of alerts, incidents, and service requests reported through FreshService, ensuring timely and accurate logging, prioritization, resolution, andescalation.·
Develop, implement, and maintain operational procedures, runbooks, and knowledge base articles to standardize incident resolution and service requestfulfillment.·
Drivecontinuous improvement initiatives to optimize operational efficiency, reduce incident rates, and improve service request turnaroundtimes.·
Collaboratewith backend engineering and development teams to troubleshoot complex issues, identify root causes, and implement preventativemeasures.·
Ensureadherence to defined SLAs (Service Level Agreements) and KPIs (Key Performance Indicators) for operationalperformance.Maintainoperational documentation, including system diagrams, contact lists, and escalationpaths.·
Ensurecompliance with relevant security and compliancepolicies.·
Planand coordinate scheduled maintenance activities with minimal impact to serviceavailability.WHATYOU BRING
·
Bachelor's degree in Computer Science, Information Technology, Engineering, or a relatedfield.·
Minimumof 6-8 years of experience in IT/Cloud operations and application support (specifically Java apps), with knowledge of cloud infrastructure (AWS and Azure).·
Strong experience with application support (Java, Grails, Hibernate) and performance analysis in a production environment, able to pinpoint a performance degradation throughanalysis.·
Strongunderstanding of cloud computing concepts, architectures, and services on both AWS and Azureplatforms.·
Workingknowledge of containerization and orchestration technologies, specificallyKubernetes.End-to-endtechnical accountability and operationalownership.Willingnessto work in a 24/7 operatingmodel.·
Experiencemanaging and troubleshooting network connectivity, including VPNs and connections to externalnetworks.·
Familiaritywith monitoring tools and practices, with experience in setting up and responding toalerts.·
Hands-onexperience with log management and analysis tools, preferably ElasticSearch.·
Workingknowledge of database systems, preferably MySQL, including L2 troubleshooting and performancemonitoring.·
Experiencewith ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request managementprocesses.·
Excellentproblem-solving, analytical, and troubleshooting skills with a data-drivenapproach.Experiencewith Grafana systems and dashboards is aplus.·
Strongcommunication (written and verbal), interpersonal, and presentationskills.·
Abilityto work effectively under pressure and manage multiple priorities in a fast-pacedenvironment.·
Experiencein developing and documenting operational procedures andrunbooks.·
Experiencewith automation tools and scripting languages (e.g., Python, Bash) is aplus.·
Experienceworking in a SaaS environment is highly desirable.·
Workingknowledge of database systems, preferably MySQL, including L2 troubleshooting and performancemonitoring.·
Experiencewith ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request managementprocesses.·
Excellentproblem-solving, analytical, and troubleshooting skills with a data-drivenapproach.Experiencewith Grafana systems and dashboards is aplus.·
Strongcommunication (written and verbal), interpersonal, and presentationskills.·
Abilityto work effectively under pressure and manage multiple priorities in a fast-pacedenvironment.·
Experiencein developing and documenting operational procedures andrunbooks.·
Experiencewith automation tools and scripting languages (e.g., Python, Bash) is aplus.·
Experienceworking in a SaaS environment is highly desirable.We offer you a competitive total rewards package, learning and tremendous opportunities to grow and advance in your career. At Saviynt, it is not typical for an individual to be hired at or near the top of the range for their role and final compensation decisions are dependent on many factors including, but are not limited to location; skill sets; experience and training; licensure and certifications; and other relevant business and organizational needs. A reasonable estimate of the current range is $Min,000 - $Max,000 annually.
You may also be eligible to participate in a Saviynt discretionary bonus plan, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational
performance.Ifrequired for this role, you will:Complete security & privacy literacy and awareness training during onboarding and annually thereafterReview (initially and annually thereafter), understand, and adhere to Information Security/Privacy Policies and Procedures such as (but not limited to):> Data Classification, Retention & Handling Policy> Incident Response Policy/Procedures> Business Continuity/Disaster Recovery Policy/Procedures> Mobile Device Policy> Account Management Policy> Access Control Policy> Personnel Security Policy> Privacy Policy
Saviynt is an amazing place to work. We are a high-growth, Platform as a Service company focused on Identity Authority to power and protect the world at work. You will experience tremendous growth and learning opportunities through challenging yet rewarding work that directly impacts our customers, all within a welcoming and positive work environment. If you're resilient and enjoy working in a dynamic environment you belong with us!
Saviynt is an equal opportunity employer and we welcome everyone to our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!