Description & Requirements
Infor is looking for a talented Senior Site Reliability Engineer to join our India Infor SunSystems development team as we build momentum for our recently released Multi-Tenant cloud solution. SunSystems is a well-established financial management solution with a depth of functionality that attracts thousands of B2B customers in multiple verticals around the globe. The solution is combined with real time analytics and is tightly integrated into Infor’s Technology platform.
SunSystems R&D is growing a team based in Hyderabad, to work side by side with the rest of the organisation currently based in Farnborough, UK and Shanghai China. This new team is intended to accelerate our delivery work on new features and product modernisation, backing up existing team members and bringing in new experienced team members who can help us drive transformation, innovate and build the future of SunSystems. This team will be part of a thriving local Infor community at the Hyderabad office which spans several Infor products and this is great opportunity to get in at the start of the SunSystems India team.
As a member of our R&D team, the Senior Site Reliability Engineer works closely with DevOps, QA, Data, Architects and Support team to drive high availability, high performance and systemic resilience and fault tolerance in our multi-tenant SAAS product. The Senior SRE will also work cross-functionally to investigate, understand and resolve the most complex problems and incidents, bringing these to a successful conclusion.
A Day in The Life Typically Includes:
Working closely with Architects, Developers and DevOps team members to improve the resilience of the product end to end, applying best practises and designing new approaches to ensure high availability, performance and systemic resilience of our SAAS product.
Take the lead as a cross-functional problem solver to investigate and resolve critical escalated incidents where the root cause is initially unknown, and may span software, infrastructure, data and configuration aspects.
Take a “whole product” view to improve the product, but also improve the customer experience of the product, through understanding customer needs and pushing for improvements where necessary. (as SRE you will take a very broad view, and may interact with many other team members in seeking to improve the overall product)
As we modernize the product further, we anticipate a growing landscape of microservices. Work closely with our architects and DevOps team to take a resilience and availability lens to these conversations as we shape the future state of the product.
Reporting clearly any defects found, capturing logs and scenarios and reproducing as required to support software investigations.
Supporting product team and development leadership team to understand the current state of the product, and to inform prioritisation decisions for product improvement (e.g. resilience improvement work vs feature delivery, tech debt resolution)
Basic Qualifications
Strong experience as an SRE for a complex cloud-based microservices product, able to articulate specific improvements you have driven, the approach taken and the benefit delivered.
Understanding of high availability systems in a cloud landscape.
Broad technical understanding covering software, data, devops, infrastructure.
Experience with escalated incident management under pressure, and understanding of incident management approach, incident communications, evidence-based decision making and lessons learned.
Experience working in an Agile (pref. Scrum) and iterative development approach.
Strong written and verbal communication skills in English.
Enthusiasm and ability to collaborate well with others, including remote teams
Professional pride, drive and curiosity, a diligent self-starter that keeps up to date with best practise and keeps your skillset sharp.
Strong problem resolution skills
Preferred Qualifications:
Your ability to articulate your central role in the collaborative design and continual improvement of high availability cloud systems is key.
Your ability to demonstrate your leading role in critical incidents and cross-functional problem resolution, where you have brought together different disciplines and taken a lead role in driving positive outcomes.
Working knowledge of AWS infrastructure, containerization and orchestration, automation of scaling advantageous.
Knowledge and understanding of best practises in complex system design, with a focus on high availability, systemic resilience and performance efficiency.
Experience of Atlassian suite (Jira, Confluence)
Customer focused mindset, with the ability to understand end user requirements and consider how users work with our software.
Understanding of Accounting/Reporting/Financial applications, or experience with or exposure to Infor SunSystems and Query and Analysis, beneficial but not required.