NebiusNebius

Data Engineer

Added 3 months ago

Description

The role 

The Data Engineering team is responsible for building and maintaining robust data infrastructure that powers analytics, and business intelligence across Nebius. We design and implement scalable data pipelines, optimize data storage and processing, and enable data-driven decision making across the organization. The team works closely with product teams and business stakeholders to ensure alignment with company goals.We are looking for a Data Engineer to design, build, and maintain our data infrastructure and pipelines. You will work on processing large-scale datasets, optimizing data workflows, and enabling analytics capabilities that support our rapidly growing cloud platform.

 

Your responsibilities: 

  • Design, develop, and maintain scalable data pipelines.
  • Build and optimize data infrastructure.
  • Implement data quality monitoring and validation frameworks.
  • Optimize data storage, processing, and query performance for large-scale datasets.
  • Design and implement data models for analytics and reporting use cases.
  • Develop tools and automation to improve data engineering workflows and productivity.
  • Ensure data governance, security, and compliance standards are met.
  • Participate in on-call rotation to support production data systems.

Must-haves: 

  • 3+ years of experience in data engineering or related roles.
  • Experience building and maintaining data pipelines using orchestration tools (e.g., Airflow, Prefect, Dagster).
  • Strong proficiency in SQL and solid programming skills in Python.
  • Experience with distributed data processing frameworks (e.g., Apache Spark, or similar).
  • Knowledge of data modeling principles and best practices.
  • Understanding of data architectures and storage systems.

Nice-to-haves: 

  • Experience with real-time data streaming platforms.
  • Familiarity with Infrastructure as Code tools (Terraform, etc).
  • Experience with containerization (Docker) and Kubernetes.
  • Knowledge of data governance and privacy frameworks (GDPR, SOC2).
  • Knowledge of data quality and observability tools (Great Expectations, etc.).

We conduct coding interviews as part of the process.

Company

Nebius provides an AI-focused cloud platform enabling scalable GPU clusters (from single GPU to thousands of NVIDIA GPUs) with pre-configured drivers, InfiniBand networking, and orchestrators like Kubernetes or Slurm. It offers fully managed services (MLflow, PostgreSQL, Apache Spark), cloud-native tooling (Terraform, API, CLI), ready-to-go solutions, and expert support. Nebius also runs data centers and is active in AI research collaborations and open-source AI ecosystem examples (vLLM, CRISPR-GPT references) and has partnerships with NVIDIA as Reference Platform Cloud Partner.

See more data engineer jobs in Israel