AI Research Engineers - Apertus Initiative — EPFL
Role overview
Introduction The Apertus Initiative, a joint effort between EPFL and ETH Zürich, seeks talented engineers to join our team developing the next generation of open foundation models.
Working with the world's most powerful public AI supercomputer (Alps), we aim to create open and transparent LLMs and multimodal AI models that benefit Swiss society while maintaining technological sovereignty.
Main duties and responsibilities We are looking for AI Research Software Engineers with expertise in one or more of the following areas: • Distributed AI training and optimization • Data curation (text and multimodal) • Model evaluation and benchmarking • Inference systems and deployment • Reasoning • Multi-modal AI models Profile • MSc in Computer Science, Data Science, AI or related field (exceptional BSc candidates will also be considered) • Experience in AI and neural network architectures • Strong programming skills in Python and knowledge of PyTorch or similar ML frameworks • Experience with software development practices, including version control systems, debugging, testing, and deployment.
Description
Introduction The Apertus Initiative, a joint effort between EPFL and ETH Zürich, seeks talented engineers to join our team developing the next generation of open foundation models.
Working with the world's most powerful public AI supercomputer (Alps), we aim to create open and transparent LLMs and multimodal AI models that benefit Swiss society while maintaining technological sovereignty.
Main duties and responsibilities We are looking for AI Research Software Engineers with expertise in one or more of the following areas: • Distributed AI training and optimization • Data curation (text and multimodal) • Model evaluation and benchmarking • Inference systems and deployment • Reasoning • Multi-modal AI models Profile • MSc in Computer Science, Data Science, AI or related field (exceptional BSc candidates will also be considered) • Experience in AI and neural network architectures • Strong programming skills in Python and knowledge of PyTorch or similar ML frameworks • Experience with software development practices, including version control systems, debugging, testing, and deployment.