Job Details
Meta is looking for software engineers to help scale and improve the efficiency of large AI/ML work loads. A part of this is enabling high performance interconnect (HPI) solutions, optimising collective operations to improve machine learning model performance.This is an opportunity to work within a highly skilled team, collaborating with a large set of cross-functional partners and help bringing next generation large cluster architectures to life.
Software Engineer - Systems Specialist Responsibilities:
Minimum Qualifications:
Preferred Qualifications:
About Meta:
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Software Engineer - Systems Specialist Responsibilities:
- Support networking and compute hardware acceleration techniques to improve ML inference and training model performance
- Implement ML model optimisation features
- Debug custom and third party multi-host, accelerator enabled AI platforms
- SW development using C++/C and Python
- Work closely with other teams to deliver impact
- develop & improve features and innovations
- Extend and optimize large scale learning collective operations
Minimum Qualifications:
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- Specialized experience in one or more of the following machine learning/deep learning domains: Hardware accelerators, AI Infrastructure, OR high performance computing,
- Experience of ML systems & AI Frameworks (like PyTorch)
- Solid experience developing in C++/C
- English language proficiency
Preferred Qualifications:
- GPU architecture experience
- Experience with distributed systems at scale
- Parallel programming in MPI, OpenMP, Posix threads or similar distributed frameworks or languages
- Experience of large scale machine learning clusters
About Meta:
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Company Details
Meta
Menlo Park, CA, United States
The Facebook company is now Meta. Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched... Read more