Post-Training Engineer - Apertus — EPFL

CHF 60'500 - 91'500

EPFL · Lausanne (VD)

Categoria: Ingegneria Contratto: full-time Salario: CHF 60'500 - 91'500

Apply now

Location: Lausanne
Contract: full-time
Posted: 67 days ago

SalaryCHF 60'500 - 91'500

Role overview

Introduction

The Apertus project, a joint effort between EPFL and ETH Zürich, is seeking a practical and motivated engineer to help build the next generation of open foundation models.

The successful candidate will help develop and run post-training and reinforcement learning pipelines for the Apertus project.

Introduction
The Apertus project, a joint effort between EPFL and ETH Zürich, is seeking a practical and motivated engineer to help build the next generation of open foundation models.
Main duties and responsibilities
The engineer will contribute to the development, execution, and evaluation of scalable post-training workflows for Apertus. Infrastructure and systems engineering

Main responsibilities

Main duties and responsibilities
The engineer will contribute to the development, execution, and evaluation of scalable post-training workflows for Apertus. Infrastructure and systems engineering
Build and maintain containerized environments for LLM post-training and RL workloads.
Adapt containers and dependencies for execution on Alps / CSCS infrastructure.
Run and monitor Slurm-based training and evaluation jobs.
Debug failures related to distributed execution, checkpointing, filesystem performance, networking, and GPU utilization.
Help maintain reproducible training recipes, configuration files, launch scripts, and documentation.
Work with researchers and CSCS engineers to improve the reliability and performance of large-scale experiments.
LLM post-training and Reinforcement Learning
Support SFT, preference optimization, and reinforcement learning workflows.

Additional details

The engineer will contribute to the development, execution, and evaluation of scalable post-training workflows for Apertus. Infrastructure and systems engineering
Debug common post-training issues, including optimization instability, reward hacking, regressions, and evaluation failures.
Strong collaboration and communication skills and ability to work across research and engineering teams. Strongly preferred
Experience with frameworks such as veRL, slime, Megatron-LM, DeepSpeed, TRL, vLLM, SGLang, or similar tools.
Experience with large-scale evaluation pipelines.

Notes and original content

The engineer will contribute to the development, execution, and evaluation of scalable post-training workflows for Apertus.
Infrastructure and systems engineering
Strong collaboration and communication skills and ability to work across research and engineering teams.
Strongly preferred
Nice to have

Apply now

Post-Training Engineer - Apertus — EPFL

Role overview

Main responsibilities

Additional details

Notes and original content

Related jobs

Articles for cross-border workers

Explore similar jobs