Description
The role
Nebius operates complex, mission-critical infrastructure environments. As a Systems Administrator, you will be responsible for maintaining, troubleshooting, and integrating Linux-based systems that support our production platforms. This role sits at the intersection of systems, networking, and automation, with a strong focus on diagnosing issues and keeping services reliable.
You will work closely with infrastructure, networking, and data center teams, and will occasionally travel to data centers to support hands-on troubleshooting and systems integration when needed.
Your responsibilities will include:
- Administer, monitor, and maintain Linux-based systems in productionb environments
- Debug and resolve system, service, and network-related issues
- Operate and support core network services such as DNS, DHCP, NTP, and related infrastructure services
- Perform systems integration across hardware, operating systems, and internal platforms
- Build and maintain automation for routine operational tasks
- Support incident response, root cause analysis, and post-incident follow-up
- Work with data center and hardware teams during deployments and maintenance
- Document systems, procedures, and operational runbooks
- Participate in on-call or operational support rotations as required
What we expect you to have:
- Professional experience as a systems administrator or in a similar operations role
- Strong hands-on experience with Linux systems and troubleshooting
- Solid understanding of networking fundamentals and common network services
- Proven ability to debug complex issues spanning systems and networks
- Experience with systems integration and operational tooling
- Comfort working in production environments with a high ownership mindset
- Clear communication skills and ability to collaborate across teams
It will be an added bonus if you have:
- Programming or scripting experience (Python preferred)
- Experience with automation tools or configuration management systems
- Familiarity with data center operations or on-site infrastructure
- Experience supporting large-scale or distributed systems
- Exposure to cloud or hybrid infrastructure environments
Working conditions:
- Primarily remote or office-based role, depending on location
- Occasional travel to data centers may be required, especially if not located near one
- Collaboration with globally distributed engineering and operations teams
Key employee benefits:
- Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
- 401(k) plan: up to 4% company match with immediate vesting
- Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
- Remote work reimbursement: up to $85/month for mobile and internet
- Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
Compensation
-
We offer competitive salaries, ranging from $100k- $140k base + quarterly performance bonuses.
Join Nebius and help operate the systems that power next-generation AI
infrastructure.
Company
Nebius provides an AI-focused cloud platform enabling scalable GPU clusters (from single GPU to thousands of NVIDIA GPUs) with pre-configured drivers, InfiniBand networking, and orchestrators like Kubernetes or Slurm. It offers fully managed services (MLflow, PostgreSQL, Apache Spark), cloud-native tooling (Terraform, API, CLI), ready-to-go solutions, and expert support. Nebius also runs data centers and is active in AI research collaborations and open-source AI ecosystem examples (vLLM, CRISPR-GPT references) and has partnerships with NVIDIA as Reference Platform Cloud Partner.
Related postings
Alter Solutions Portugal
Linux Systems AdministratorValència, Valencia, Spain and 2 othersVultr
Linux Systems AdministratorUnited StatesInetum
Linux System AdministratorLisbon, Portugal and 1 otherInetum
Linux System AdministratorLisbon, Portugal