Description
We are looking for a Lead Data Engineer to own the event ingestion and identity layer that connects product instrumentation to downstream analytical systems. This role focuses on the operational reliability and correctness of event and identity data as it moves through the data platform. You will design and operate pipelines, schema validation, and replay workflows that ensure product events remain consistent and safe to use for analytics and customer-facing reporting. You will work closely with product engineering teams on instrumentation patterns, with the CDP team on event contracts and definitions, and with platform teams to ensure event infrastructure and analytical systems scale reliably. This role builds the foundational event and identity datasets required for reliable downstream modeling. Behavioral models, canonical entities, and business analytics datasets are owned by the analytics engineering team.
Responsibilities:
- Define event schemas, required fields, and compatibility rules in collaboration with the CDP team
- Implement automated validation and contract enforcement to prevent breaking schema changes
- Maintain versioning and compatibility guarantees for event producers and downstream consumers
- Build and maintain pipelines that ingest, validate, and process high-volume product events
- Ensure event streams are deduplicated, ordered correctly, and safe for downstream consumption
- Partner with platform teams to ensure ingestion pipelines scale with product growth
- Define and maintain identity stitching logic across anonymous and authenticated users
- Handle identity merges, splits, and corrections while preserving tenant boundaries
- Ensure identity resolution remains explainable, deterministic, and safe for downstream datasets
- Design workflows that allow event datasets and identity graphs to be replayed or rebuilt safely
- Build tooling for historical corrections, schema evolution, and dataset reprocessing
- Ensure downstream models can be rebuilt without manual intervention when definitions evolve
- Provide guidance and tooling that help product teams emit events consistently
- Maintain validation checks and schema enforcement that catch instrumentation issues early
- Collaborate with engineering teams to evolve instrumentation safely over time
- Ensure deletion and suppression requests propagate correctly through event and identity pipelines
- Partner with governance and security teams to support policy requirements
- Define requirements and interfaces for event infrastructure and downstream analytical systems
- Work with platform teams to ensure pipelines remain reliable, scalable, and observable.
Company
GoHighLevel provides an all-in-one AI-powered platform for business growth, including CRM, automation, websites, funnels, scheduling, invoicing, reviews, and marketing tools aimed at helping agencies grow their clients’ businesses.
Related postings
Weekday
Lead Data scientistIndia and 2 othersS&P Global
Lead Data ScientistIndia and 2 othersSutherland
Lead Data EngineerBengaluru, Karnataka, IndiaS&P Global
Lead Data EngineerMadhapur Rd, Rai Durg, Hyderabad, Telangana, India