Research Intern - Cloud Systems Reliability

Last updated 2 days ago
Location:Redmond, Washington
Job Type:Full Time

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers. Our researchers and engineers pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Empower every person and organization on the planet to achieve more. That’s what inspires us, drives our work, and pushes us to challenge the status quo every day. At Microsoft, we also work to empower our employees, so they can achieve more. We believe we should each find meaning in our work and we ensure employees have the freedom and the reach to help make a difference in the world.

Microsoft Research provides a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers. Our researchers and engineers pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The cloud reliability research in Microsoft Research aims to develop practical tools and techniques that can help cloud developers adequately debug, test, configure, and monitor their systems. The research combines Distributed Systems, PL, Software Engineering, and Machine Learning techniques and spans all aspects of improving reliability and availability of large-scale cloud systems, including:

  • Analyzing production data to understand how real cloud systems fail and what can be done to prevent them.
  • Developing practical static and dynamic analysis techniques to uncover hard-to-find bugs before production. The techniques can be evaluated and used with thousands with Microsoft software projects.
  • Developing practical and novel techniques for diagnosing failures, runtime monitoring, logging, & failure prevention, etc.

Over the last several years, technology arising from our research has significantly improved reliability of many Azure products and has led to publications in top systems conferences, including a Best Paper at ACM SOSP 2019.

During this internship the intern will have the opportunities to work with top researchers and engineers at Microsoft Research as well as various Azure systems to develop innovative solutions to challenging research problems in the above technical areas. You will also have the unique opportunity to evaluate your techniques on thousands of software projects in Microsoft.

Responsibilities

Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, interns learn, collaborate, and network for life. Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, students are paired with mentors and expected to collaborate with other interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Qualifications

In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.

Required Qualification

  • Must be currently enrolled in a PhD program in Computer Science or a related STEM field.

Preferred Qualifications

  • Familiarity with common failures of cloud systems and techniques for debugging, testing, configuring, and monitoring large-scale cloud systems
  • Familiarity with existing distributed systems, software engineering, and machine learning techniques to improve software reliability

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.