Machine Learning Researcher - Audio/Speech/Computer Vision

Redmond, WA, United States

From being able to log you in with face recognition, launch Cortana with a voice command, to the exciting possibilities in augmented reality, are you itching to play a part in bringing applications of computer vision to millions?

 

The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen hardware products and is working on several exciting projects that will shape how computers and other devices perceive the user and the user’s environment. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling new experiences to the market. A lot of these experiences will be powered by speech and computer vision – and as part of this team, you will have the unique opportunity to work on almost every aspect of a shipping audio and vision system: camera optics, sensors, data pipeline and of course, developing and implementing the algorithms that make magic happen!

Responsibilities

We are looking for an audio, speech and/or computer vision researcher with expertise in deep learning techniques to help our devices compute better understanding of the user and the environment. The ability to analyze multimodal sensor data and interpret various human and human-object interactions is key to Applied Sciences’ mission of enabling a seamless set of human computer interactions. As part of this team, you will be working with a growing team of talented researchers already dedicated to this mission and use data and hardware only available to a select few. Naturally, the opportunity for you to push the state of the art in this field is huge.

Qualifications

Requirements:

 

PhD in Computer Science, Electrical Engineering, or related field, or Master's degree and 2+ years of related experience.

 

Strong publication record in top-tier audio/speech/vision conferences (ICASSP, InterSpeech, CVPR, ECCV, ICCV) and journals.

 

Expertise in deep learning techniques (RNN’s, CNN’s, LSTM, reinforcement learning)

 

Strong knowledge on Computer Science and Signal Processing and ability to understand and implement complex algorithms.

 

Familiarity with Python research stack (Numpy, Matplotlib, Jupyter, OpenCV) is a plus

 

TensorFlow, Caffe, Torch, CNTK experience is a plus

 

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Mission

We’re passionate about connecting highly skilled women with leading companies commited to diversity and inclusion

Are you looking for your dream job? In Office. Flexible. Remote.

Join our Movement

Are you hiring? Join our platform for diversifying your team

Post a job