Microsoft Teams is the hub for teamwork that integrates all the people, content, and tools your team needs to be more engaged and effective. It is core to Microsoft’s modern work, modern life & modern education value prop. We are reinventing the way people communicate and work together across the globe.
We are looking to hire a student pursuing their MSc./Ph.D. for a 12-week an internship, joining
CMD Labs – an applied science team within Microsoft Teams – to work on
improving transcription accuracy for AI applications, with emphasis on named entities (e.g., names of people/products/teams etc.) by applying existing or novel research and leveraging training, fine tuning, and prompt engineering of speech transformer models, as well as LLMs and audio-enabled foundations models as post-processing and re-scoring modules.
Our flagship AI applications for Teams Meetings such Team Copilot, Personal Meeting Copilot and Intelligent Recap are all fully dependent on an accurate meeting transcription as the primary grounding data. Within transcription, the importance of named entities - names of people, projects, products, companies and places - is often the most important, and yet the most challenging for the transcription engine since the names might not be a part of the model's training data.
Responsibilities
- Conduct experiments, create and validate metrics, and develop candidate algorithms to improve the accuracy of transcription of named entities and reduce chances of error in downstream LLM-based applications.
- Collaborate closely with CMD Labs researchers and engineers to leverage existing assets, datasets, and ensure results can be leveraged back into the product.
- Embody our culture and values
Qualifications
Internship Duration: 12 Weeks
Required Qualifications
- Currently enrolled in M.Sc./ Ph.D. program in Computer Science, Electrical or Computer Engineering, Statistics, or a related field.
- Practical experience in training, fine-tuning, and prompt engineering of transformer models or LLMs.
- Practical Python coding experience leveraging PyTorch or TensorFlow or similar framework
Preferred Qualifications
- Field of research and publications directly related to transcription or the Audio LLMs.
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.