Senior Hardware engineer (R&D / GPU / AI)
Title: Senior Hardware engineer (R&D / GPU / AI) Location: United States
The role
Nebius is looking for a System Engineer (Servers Hardware R&D Team) to support our expanding North American operations. This position requires occasional on-site presence in our Data Center locations as needed.
Your responsibilities will include:
- Participate in the design, deployment, and maintenance of high-performance cloud systems optimized for AI workloads.
- Arrange and perform hardware R&D tests and experiments on-site in data center environments.
- Troubleshoot and resolve complex system issues related to GPUs, networking (InfiniBand, NVLink), PCIe, and server infrastructure.
- Conduct deep investigations into hardware, software, and networking issues to ensure optimal system performance and reliability.
- Develop and execute test plans and methodologies for advanced GPU, InfiniBand, and compute systems to benchmark and validate performance.
- Collaborate closely with cross-functional engineering and operations teams to improve system performance and reliability.
- Monitor system performance and continuously fine-tune configurations for maximum efficiency.
What we expect you to have:
- Strong knowledge of modern server architecture, particularly in high-performance, GPU-based environments.
- Hands-on experience with GPUs, networking, NVLink, and PCIe technologies.
- Proficiency in Linux systems, with experience using Python and Bash for automation and tooling.
- Demonstrated ability to troubleshoot complex hardware, software, and networking issues.
- Experience with deep problem investigation, root cause analysis, and performance optimization in cloud or high-performance computing environments.
- Strong analytical and problem-solving skills with a performance-first mindset.
- Basic electronics modification skills, including soldering and wiring.
It will be an added bonus if you have:
- Knowledge of the Linux kernel and experience with kernel-level debugging or troubleshooting.
- Familiarity with electronic measurement equipment such as oscilloscopes and multimeters.
Key employee benefits:
- Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
- 401(k) plan: up to 4% company match with immediate vesting.
- Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Remote work reimbursement: up to $85/month for mobile and internet.
- Disability & life insurance: company-paid short-term, long-term and life insurance coverage.
Compensation
- We offer competitive salaries, ranging from $150k- $200k base + quarterly performance bonuses.
Join Nebius Today!