Aún no hay resultados para tu búsqueda
Encontramos estas vacantes similares que podrían interesarte.
Hace 1 sem
Senior Kernel Developer
Si el reclutador te contacta podrás conocer el sueldo
luxoft
Esta es una vacante externa, deberás completar el proceso en el sitio de la empresa.
Sobre el empleo
Categoría: Tecnologías de la Información - Sistemas
Subcategoría: Desarrollo de software - Programador
Educación mínima requerida:
Detalles
Contratación:
PermanenteEspacio de trabajo:
PresencialDescripción
Project description
Luxoft is looking for an AI software development engineer to develop ML kernels in the Triton kernel language. We are looking for an engineer who is passionate about optimizing Machine Learning GPU kernels and improving the performance of key applications and benchmarks. What you do directly impacts the performance of AMD GPUs and enables us to become a competitive solution for generative AI. Become a part of our high-impact and incredibly talented Triton kernels team.
Responsibilities
Develop ML kernels for matrix multiplication, Flash Attention and other ML operators
Benchmark, perform competitive analysis and optimize kernels to improve performance
Collaborate with the GPU architecture team to improve future generations
Apply knowledge of software engineering best practices
Skills
Must have
Proficiency with C/C++
Proficiency in CUDA or HIP / ROCm or OpenCL programming
Solid understanding of parallel programming models, and optimization techniques
Strong problem-solving skills and the ability to work in a collaborative environment
Nice to have
Familiarity with models like LLama, Mixtral and Gemma is a plus
Knowledge of MLIR, LLVM and GPU assembly and GPU architecture is a plus
Familiarity with PyTorch or JAX
Other
Languages
English: B2 Upper Intermediate
Seniority
Senior
Luxoft is looking for an AI software development engineer to develop ML kernels in the Triton kernel language. We are looking for an engineer who is passionate about optimizing Machine Learning GPU kernels and improving the performance of key applications and benchmarks. What you do directly impacts the performance of AMD GPUs and enables us to become a competitive solution for generative AI. Become a part of our high-impact and incredibly talented Triton kernels team.
Responsibilities
Develop ML kernels for matrix multiplication, Flash Attention and other ML operators
Benchmark, perform competitive analysis and optimize kernels to improve performance
Collaborate with the GPU architecture team to improve future generations
Apply knowledge of software engineering best practices
Skills
Must have
Proficiency with C/C++
Proficiency in CUDA or HIP / ROCm or OpenCL programming
Solid understanding of parallel programming models, and optimization techniques
Strong problem-solving skills and the ability to work in a collaborative environment
Nice to have
Familiarity with models like LLama, Mixtral and Gemma is a plus
Knowledge of MLIR, LLVM and GPU assembly and GPU architecture is a plus
Familiarity with PyTorch or JAX
Other
Languages
English: B2 Upper Intermediate
Seniority
Senior
Recuerda que ningún reclutador puede pedirte dinero a cambio de una entrevista o un puesto. Asimismo, evita realizar pagos o compartir información financiera con las empresas.
ID: 20497921