Site Reliability Engineer - Linux - On-prem / No-cloud / GPU's
Our client is a fast-growing Dutch start-up that develops innovative AI solutions for healthcare. They develop proprietary AI models / LLMs for this, which run on their own servers. To manage and optimize this (no-cloud) environment, they are looking for a Site Reliability Engineer.
This position does not offer work permit /visa sponsorship and is therefore only open to candidates with EU/EEA country citizenship, or otherwise not in need of sponsorship. Thanks for your understanding.
The role
As a Site Reliability Engineer, you are responsible for setting up and managing the on-premise infrastructure with the latest hardware, including NVIDIA GPUs. You work together with a small team that develops the software and AI models internally and ensures that the systems are optimized for performance and security.
Focus is primarily on network and server management (site reliability), but ideally you can also support the build pipelines. DevOps tasks will be limited and it’s a no-cloud environment. This means:
- Managing on-premise servers
- Configuring and maintaining the Kubernetes cluster according to a GitOps approach
- Ensuring that all resources are available reliably and securely
- Ensuring optimal stability, uptime and performance
What do we need?
Key knowledge and experience:
- At least 3 years of relevant work experience
- Knowledge of managing an on-premise, no-cloud, Linux environment
- Focus on servers, networking and storage (less DevOps)
- Experience with Kubernetes and Docker
- Proficient with Bash and Python
- You prefer to work on solutions with positive societal impact
- You are okay with working in a small startup/scale-up environment
- You are pragmatic, hands-on and do not get stuck in over-analyzing problems
- Fluent in English and open to work in a company with mainly Dutch colleagues
- You don’t need work permit/visa sponsorship and you preferably live in The Netherlands or will move here on short notice (not dependent on job offer).
Nice to have:
- Experience with monitoring tools (Prometheus, Grafana)
- Experience with managing GPU resources
- Experience with GitOps (FluxCD)
What can you expect?
- A good salary and travel allowance
- 31 vacation days
- Option to work from home up to 2 days a week
- Possibility to grow into a lead role when the company grows
- You will become part of a company that develops its own state-of-the-art AI models and has access to the latest hardware.
- The opportunity to make a meaningful contribution to improving healthcare
- Wine tastings on Friday afternoons and fun team outings
-
Geen categorie
-
5757 Keer bekeken
- Salaris 100,000.00€ Per jaar
- Land Netherlands
- Stad Leiden
- Solliciteer direct! Bezoek website
- Vacature link Bezoek website
- 0
- Per jaar
- Breda
- 0
- Per jaar
- Delft
- 0
- Per jaar
- Rotterdam
- 0
- Per jaar
- Halfweg
We schrijven zelden, maar alleen de beste inhoud.
Controleer uw e-mail voor een bevestigingsmail.
Pas nadat u uw e-mailadres heeft bevestigd, wordt u geabonneerd op onze nieuwsbrief.