Lead Infrastructure Engineer - #2129079
Pallon
Date: vor 15 Stunden
Stadt: Berlin
Vertragstyp: Ganztags
Arbeitsplan: Volle Tag

About Pallon
At Pallon, a spin-off from ETH Zurich, we’re creating AI that automatically detects defects in sewer inspection videos and advises cities on when & how to fix them. By providing more precise, objective data, we aim to fix wastewater leaks, reduce CO2 emissions, and prevent urban flooding. Our mission is to make cities more sustainable and resilient.
The Role
We're looking for a seasoned infrastructure engineer to take full ownership of our infrastructure — from our high-performance GPU cluster to our cloud systems. You’ll be joining a small, deeply technical team building cutting-edge computer vision and deep learning systems.
This is a hands-on, high-impact role. You’ll lead critical decisions around architecture, performance, and scale, while also jumping in to solve real-world issues — whether that’s designing GPU scheduling strategies, tuning networking performance, or swapping out hardware.
You’ll collaborate closely with our platform and computer vision teams to make sure their tools run fast, reliably, and securely — and you'll have the autonomy to shape how that all comes together.
In this role, you might find yourself:
You don’t need to have experience with all of this — but here’s what we use today:
Benefits & Team Culture
As a part of Pallon, you will:
At Pallon, we highly value equality of opportunity and inclusivity, and we would like to particularly encourage women and candidates from under-represented backgrounds to apply, even if you don’t match with 100% of the requirements.
At Pallon, a spin-off from ETH Zurich, we’re creating AI that automatically detects defects in sewer inspection videos and advises cities on when & how to fix them. By providing more precise, objective data, we aim to fix wastewater leaks, reduce CO2 emissions, and prevent urban flooding. Our mission is to make cities more sustainable and resilient.
The Role
We're looking for a seasoned infrastructure engineer to take full ownership of our infrastructure — from our high-performance GPU cluster to our cloud systems. You’ll be joining a small, deeply technical team building cutting-edge computer vision and deep learning systems.
This is a hands-on, high-impact role. You’ll lead critical decisions around architecture, performance, and scale, while also jumping in to solve real-world issues — whether that’s designing GPU scheduling strategies, tuning networking performance, or swapping out hardware.
You’ll collaborate closely with our platform and computer vision teams to make sure their tools run fast, reliably, and securely — and you'll have the autonomy to shape how that all comes together.
In this role, you might find yourself:
- Designing and building a custom GPU cluster for deep learning workloads.
- Deciding how we manage and scale our infrastructure — both on-prem and in the cloud.
- Keeping systems running smoothly and securely — from data pipelines to distributed training jobs.
- Troubleshooting weird kernel errors, configuring systemd units, or debugging Kubernetes evictions.
- Making calls on when to script, when to automate, and when to just fix the thing.
- You’ve spent 5+ years owning infrastructure end-to-end, ideally in startup environments.
- You’re comfortable at every layer — from bare-metal servers and NVMe drives to container orchestration and cloud-native tools.
- You have strong Linux fundamentals, and you know your way around networking, storage, and distributed systems.
- You can code well enough to automate, debug, and build tooling across a variety of languages.
- You communicate clearly and collaborate well — especially with engineers who aren’t infra specialists.
- You thrive with autonomy and can manage your own priorities effectively.
- You’re curious and fast-learning, especially when tackling new tools or challenges.
- You have a university degree in Computer Science or a related field.
- Experience with machine learning infrastructure or HPC clusters.
- Familiarity with data engineering workflows and ETL pipelines.
You don’t need to have experience with all of this — but here’s what we use today:
- HPC Cluster (our hardware, colocated in a datacenter): Linux, Nvidia GPUs, Slurm, Infiniband
- Cloud: Google Cloud Platform, Kubernetes, Docker, GitLab CI/CD
- Data Analytics: DBT, BigQuery, Metabase
Benefits & Team Culture
As a part of Pallon, you will:
- Contribute to a positive impact on society and the environment.
- Develop a novel product that changes a whole industry.
- Be part of a motivated, smart, fun, and supportive team of software engineers and AI researchers.
- Own a part of Pallon and have a part in our success with our Employee Stock Option Plan (ESOP).
- Work for the Underworld, not the Devil: exploring sewers virtually and in real life during our Pallon offsites.
- Work from home or enjoy access to our beautiful office space located in Zürich.
At Pallon, we highly value equality of opportunity and inclusivity, and we would like to particularly encourage women and candidates from under-represented backgrounds to apply, even if you don’t match with 100% of the requirements.
Wie bewerbe ich mich?
Um sich für diesen Job zu bewerben, müssen Sie auf unserer Website autorisieren. Wenn Sie noch kein Konto haben, registrieren Sie sich bitte.
Veröffentlichen Sie einen LebenslaufÄhnliche Jobs
Electrical Engineer Internship in Canary Island
EX Venture Inc.,
vor 15 Stunden
Title: Electrical Engineer Intern Location: Canary Islands, Spain (Remote) Type: Internship (4-6 months) Erasmus+ funding available We Need People Who Can Join ASAP! About The Role EX Venture Academy is seeking a highly motivated Electrical Engineer Intern with a Master’s...

Informatiker (w/m/d) als IT Service Owner Basis-IT
Die Autobahn GmbH des Bundes,
€48,500
-
€60,000
/ Jahr
vor 19 Stunden
Informatiker (w/m/d) als IT Service Owner Basis-IT Standort(e): Berlin, DE, 10557 Unternehmen: Zentrale Fachbereich: IT Erfahrungsniveau: Berufserfahrene Entgeltgruppe: E13 Vertrag: Unbefristet Gemeinsam. Sicher. Mobil. Die Autobahn GmbH nimmt bei der Digitalisierung mit ihrer Cloud-First-Strategie eine Vorreiterrolle ein. Die Standorte der...

MTA / MTLA Medizinisch-technischer Laboratoriumsassistent
Labor Berlin,
€34,500
-
€47,500
/ Jahr
vor 19 Stunden
MTA / MTLA Medizinisch-technischer Laboratoriumsassistent (m/w/d) Labor Berlin wurde zum 1. Januar 2011 als Tochterunternehmen der Charité – Universitätsmedizin Berlin und Vivantes Netzwerk für Gesundheit GmbH gegründet. Über 750 Mitarbeitende versorgen mehr als 25.000 Krankenhausbetten in Berlin und dem gesamten...
