Firmenlogo

Remote Deep Learning Engineer - LLMs and Diffusion Models

NVIDIA  ·  Schweiz, Schweiz · Remote

Details zum Jobangebot

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and voice-powered intelligent assistants. Academic and commercial groups around the world are using NVIDIA GPUs to revolutionize deep learning and data science, and to power data centers.

We are looking for a Deep Learning Engineer with strong research skills to join the DL Algorithms team that is developing new efficient DL architectures. In this role you will interact with the scientific community creating algorithms to accelerate the training and inference of Large Language Models and Diffusion Models.

What You’ll Be Doing

  • Invent and develop model optimization algorithms based on Neural Architecture Search, Pruning, Knowledge Distillation, Quantization, Conditional Computation, Model fine-tuning, etc.
  • Work in a dynamic, applied team of researchers and engineers
  • Work on large-scale multi-node ML models
  • Publish research papers and implement the results in Nvidia products
  • Collaborate with academia

What We Need To See

  • M.Sc. plus 4 years of commercial experience in Computer Science, Artificial Intelligence, Applied Math, or related field
  • Machine learning fundamentals (linear algebra, probability theory, optimization, supervised/unsupervised/self-supervised ML, etc.)
  • Hands-on experience with designing Deep Learning models (Transformers, Diffusion Models, Convolutional Neural Networks etc.)
  • Programming skills (Python, C/C++), algorithms & data structures, debugging, performance analysis, and design skills.
  • Strong experience with deep learning frameworks such as PyTorch or TensorFlow
  • Ability to work independently and handle your own work effort
  • Good communication and documentation habits

Ways To Stand Out From The Crowd

  • Ph.D. degree or equivalent experience in Computer Science, Artificial Intelligence, Applied Math, or related field
  • Strong track of publications on Deep Learning in leading international conferences/journals
  • Experience with ML model optimization techniques such as Neural Architecture Search, Pruning, Distillation, Quantization, Conditional Computation, etc.
  • Knowledge of CPU and/or GPU architectures in the context of ML algorithms

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Jetzt bewerben

Weitere Jobs