Principal ML Engineer, AMD

Principal ML Engineer, AMD
Written By:
Srinivas
Reviewed By:
Sankha Ghosh
Published on

AMD is looking to hire a Principal ML Engineer in Bengaluru to drive the development of large model distributed training on GPUs. In this full-time position, you will help optimize ML pipelines, contribute to open source AI, and push the limits of generative AI. Applicants should have over 10 years of experience with PyTorch, TensorFlow or JAX, strong Python capabilities, and experience with distributed training. If you want to work with a leading-edge AI team at AMD, please apply.

Location: Bengaluru, India

Job Type: Full Time

Job ID: 37328

Work Area: Artificial Intelligence, Engineering

Role: Principal ML Engineer

Apply: Click Here

Primary Responsibility

The ideal candidate has experience with distributed training pipelines, knowledge of parallel algorithms (Data, Tensor, Pipeline, ZeRO), and large model training. 

  • Train large models to convergence on AMD GPUs.

  • Improve the end-to-end training pipeline performance.

  • Optimize the distributed training pipeline and algorithm to scale out.

  • Contribute changes to open source.

  • Up to date with the latest training algorithms.

  • Influence the direction of AMD AI platform.

  • Cross teams collaborate with various groups and stakeholders.

Preferred Experience

  • 10+ years of experience.

  • Experience in ML frameworks such as PyTorch, JAX, or Tensorflow.

  • Experience with distributed training and a distributed training framework, such as DeepSpeed.

  • Experience with LLM or Vision, extensive models, is a plus.

  • Excellent Python programming skills, including debugging, profiling, and perf analysis.

  • Experience with ML pipeline.

  • Strong communication and problem-solving skills.

Academic Requirements 

A master’s degree in computer science, artificial intelligence, machine learning, or a related field.

About AMD

AMD (Advanced Micro Devices) is a global leader in high-performance computing and graphics solutions, providing the foundation for innovation in data centers, artificial intelligence, personal computers, gaming, and embedded systems. AMD has a culture of collaboration, humility, and innovation, and is helping to change the future of adaptive computing and improve people's lives worldwide.

Join our WhatsApp Channel to get the latest news, exclusives and videos on WhatsApp

Related Stories

No stories found.
logo
Analytics Insight: Latest AI, Crypto, Tech News & Analysis
www.analyticsinsight.net