The Core Infrastructure organization builds and operates reliable, scalable and repeatable cloud infrastructure for DoorDash developers to ship great products quickly and reliably.
The Observability team within Core Infrastructure builds and operates systems to provide introspection into the health and performance of all DoorDash systems and services. These observability systems are used by all DoorDash developers!
About the RoleYour responsibilities include:
Plan and execute infrastructure projects that improve observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management Developing and maintaining systems to enable our engineering colleagues to confidently run their services with high quality, and quickly understand and mitigate any technical issues Partnering with our observability vendors to improve their products and align with our roadmap needs Working with engineering colleagues to contribute to our observability strategy with a strong emphasis on open source libraries, data formats and query languages You’re excited about this opportunity because you will… Join a growing company and grow right along with us Take on significant technical challenges and have a large impact Have the ability to shape and improve our engineering culture Build upon an established foundation with a focus on innovation We’re excited about you because… You have at least 5 years experience as a backend software engineer You have experience building and operating infrastructure at scale You apply a product mindset to infrastructure systems and feel accomplished enabling others You strive for design simplicity and consistency above all else You solve problems using software to automate or prevent toil You love data, and identifying patterns and correlations providing new insights is exciting to you Nice to have...Experience with these specific technologies or similar alternatives is not required but helpful.
Development using GoLang, Kotlin, and/or Python Infrastructure: AWS, Kubernetes, Helm, Terraform, Service Mesh, Envoy Observability: OpenTelemetry, Prometheus/PromQL, Grafana, Fluentbit Data: Kafka, Clickhouse, SQL, Redis Previous experience running a production service from a DevOps or SRE perspective
Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only
We use Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023, and resumed using Covey Scout for Inbound again on June 29, 2024.
The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here: Covey