Job Openings

Principal ML Engineer, AMD

Written By : Srinivas
Reviewed By : Sankha Ghosh

AMD is looking to hire a Principal ML Engineer in Bengaluru to drive the development of large model distributed training on GPUs. In this full-time position, you will help optimize ML pipelines, contribute to open source AI, and push the limits of generative AI. Applicants should have over 10 years of experience with PyTorch, TensorFlow or JAX, strong Python capabilities, and experience with distributed training. If you want to work with a leading-edge AI team at AMD, please apply.

Location: Bengaluru, India

Job Type: Full Time

Job ID: 37328

Work Area: Artificial Intelligence, Engineering

Role: Principal ML Engineer

Apply: Click Here

Primary Responsibility

The ideal candidate has experience with distributed training pipelines, knowledge of parallel algorithms (Data, Tensor, Pipeline, ZeRO), and large model training. 

  • Train large models to convergence on AMD GPUs.

  • Improve the end-to-end training pipeline performance.

  • Optimize the distributed training pipeline and algorithm to scale out.

  • Contribute changes to open source.

  • Up to date with the latest training algorithms.

  • Influence the direction of AMD AI platform.

  • Cross teams collaborate with various groups and stakeholders.

Preferred Experience

  • 10+ years of experience.

  • Experience in ML frameworks such as PyTorch, JAX, or Tensorflow.

  • Experience with distributed training and a distributed training framework, such as DeepSpeed.

  • Experience with LLM or Vision, extensive models, is a plus.

  • Excellent Python programming skills, including debugging, profiling, and perf analysis.

  • Experience with ML pipeline.

  • Strong communication and problem-solving skills.

Academic Requirements 

A master’s degree in computer science, artificial intelligence, machine learning, or a related field.

About AMD

AMD (Advanced Micro Devices) is a global leader in high-performance computing and graphics solutions, providing the foundation for innovation in data centers, artificial intelligence, personal computers, gaming, and embedded systems. AMD has a culture of collaboration, humility, and innovation, and is helping to change the future of adaptive computing and improve people's lives worldwide.

Join our WhatsApp Channel to get the latest news, exclusives and videos on WhatsApp

Shiba Once Made Millionaires, Can This Meme Coin Do It Next, in The Next Coming Crypto Bull Run?

Cold Wallet Soars with 3,423% ROI Potential as Dogecoin (DOGE) Price Sentiment Weakens & Pi Network Struggles at Resistance

$387M Raised & 3M Users Join - BlockDAG Crushes Baby Bitcoin & Mirror Chain Presales

Pepeto (PEPETO) Price Prediction, Why It’s The Best Crypto To Buy, Before The Next Bull Run

3 Best Meme Coins For Exponential Returns With Massive Roi Potential