Software Engineer
Redmond, WA 
Share
Posted 5 days ago
Job Description
OverviewAre you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so, then look no further than the Azure Customer Experience (CXP), Customer ReliabilityEngineering (CRE)Team! Microsoft Azure provides customers with on-demand and infinitely scalable infrastructure and platform for customers to build, host, and scale service applications on the Internet through Microsoft's global data centers. Azure Customer Experience (CXP), Customer ReliabilityEngineering (CRE) is a top-level pillar of Azure Engineering that leads to world-class customer reliability engagements, modern customer-first experiences for scale, and drives deep customer insights and empathy into the broader Azure Engineering organization. Our team prioritizes customer feedback to enhance Azure services, support, incident management, and community interactions. Our commitment to no dead-ends guarantees that all customers can maximize their potential with the Microsoft Cloud. As a Software Engineer, you will play a critical role in ensuring the reliability, availability, and performance of Synthetic infrastructure hosted in Microsoft Azure. As a Software Engineer you will be responsible for designing, implementing, and maintaining robust Synthetic workload and monitoring its systems to track and meet the service level objectives defined in our offerings to internal consumers. You will be accountable to improve customer experience on Azure, for diagnosing and troubleshooting mission critical customer applications built on the Microsoft Azure platform. Microsoft's mission is to empower every person and every organization on the planet to achieve more. Azure aspires to be the world's computer. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond
ResponsibilitiesDevelop a foundational understanding of distributed systems design, interactions between cloud technology layers and components, basic dependencies at scale, and the code that defines infrastructures. Develop an understanding of the code, features, and operations of Synthetic infrastructure at scale as required to contribute to incremental improvements in infrastructure availability, reliability, efficiency, observability, and/or performance; participate in on-boarding, code/design reviews, and regular meetings with the engineering teams that develop and/or manage those infrastructure components. Develop Synthetic workload to improve the observability, reliability, and operability of a defined range of platforms, systems, features with direction from other engineers. Support ongoing engagements with product engineering teams by participating in code/design reviews, and regular meetings throughout synthetic infrastructure development and operations cycles; draws insights from engagements with product engineering teams and basic analyses of telemetry data to propose potential improvements to code and designs for a defined set of product components or features with guidance from other engineers. Implement simple configuration and data changes across Synthetic workloads or features with guidance from other engineers to develop an understanding of how configurations, binaries, and data can be managed using code, tooling, and automation at scale. Uses existing tools to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components or features with guidance from other engineers. Suggests potential solutions to resolve and prevent recurring issues and brings them to the attention of other engineers or team leaders. Participate in On-call rotations, including Incident response and mitigation within the infrastructure. During on call rotations evaluate the impact levels of incidents, resolves basic issues, notifies product teams or owners about substantial customer-affecting concerns, and escalates the resolution of intricate or multi-component/feature issues to other engineers as required. Communicates incident details and resolutions through post-mortem reports and in regular review meetings. Develop an understanding of key learnings, insights, and best practices that can be applied to improve system, platform, and/or product development and operations by participating in code/design reviews, incident drills and debriefs, and regular meetings, as well interactions with more experienced Site Reliability Engineers (SREs) and members of product engineering teams. Collaborate closely with Engineering/Program Managers to ensure the availability and performance of Live Site and the satisfaction of our customers. Drives continuous improvement in the Azure platform incorporating feedback from internal/external customers.

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields