Data Reliability Engineer
Ensure our ETL and orchestration systems are observable, reliable and fast.
Responsibilities
- Define and own SLIs/SLOs for pipelines and schedulers
- Build alerting and auto‑remediation playbooks
- Improve ingestion/transform performance and cost
- Lead on‑call and incident response with clear comms
- Instrument tracing and logs for root‑cause analysis
- Partner with product/solutions to harden runbooks
- Scale environments with infra‑as‑code
- Mentor engineers on reliability best practices
- Continuously raise the bar for quality
Requirements
Must have
- 3+ years in SRE/Platform/Data Engineering
- Strong SQL and one programming language (Python/Go)
- Hands‑on with metrics, logs and traces
Bonus
- Warehouse internals (Snowflake/BigQuery), dbt
- Kafka/Kinesis, event‑driven architectures
- Kubernetes, Terraform, cloud networking
Benefits
- Competitive salary + equity
- Remote‑friendly, flexible hours
- Learning budget, hardware and wellness
Apply
Send your resume and a brief example of a reliability project.
