Sr Software Development Engineer
What You’ll Do
- Develop and Maintain Core Platform Services – Design, build, and maintain highly scalable and resilient microservices supporting platform-wide capabilities. Ensure our cloud platform is modular, extensible, and meets the needs of multiple product teams.
- Enhance System Reliability & Observability – Implement and manage robust monitoring and alerting systems using Datadog to ensure operational visibility and proactive issue resolution. Drive best practices for logging, tracing, and monitoring across platform services.
- Infrastructure as Code & Cloud Automation – Utilize Infrastructure as Code (IaC) tools such as Terraform and Terragrunt to automate and streamline cloud infrastructure provisioning and management. Optimize deployment pipelines to improve reliability and developer efficiency.
- Technical Leadership & Cross-Team Collaboration – Provide technical leadership in system design, architecture, and best practices for building scalable services. Collaborate with product managers, engineers, and other teams to align platform capabilities with business needs. Act as a connector across teams, fostering collaboration and ensuring smooth integrations.
- Operational Excellence & Continuous Improvement – Participate in operational reviews, post-mortems, and reliability initiatives to enhance system stability. Create follow-up actions for incident resolution and continuously work to improve system reliability and scalability. Drive efforts to reduce technical debt and improve engineering efficiency through automation and best practices.
What You’ll Bring
- Software Development Experience – Proven track record of delivering enterprise-ready, cloud-based systems with a focus on performance, security, and scalability.
- Modern Software Practices – Strong proficiency in one or more programming languages: C#, Go, or TypeScript. Experience with API services, distributed systems, and microservice architectures.
- Cloud & Site Reliability Engineering (SRE) Skills – Deep understanding of AWS, Google Cloud, or Azure with hands-on experience designing for scalability, observability, and reliability. Knowledge of Kubernetes (EKS/GKE), Docker, and cloud-native application design.
- Infrastructure Automation & DevOps – Experience with Infrastructure as Code (Terraform, AWS CDK, Terragrunt). Proficiency in CI/CD tooling such as GitHub Actions, ArgoCD, or Jenkins.
- Observability & Monitoring – Hands-on experience with Datadog, Prometheus, or similar monitoring tools to drive operational excellence.
- Cross-Team Collaboration & Business Focus – Ability to work effectively across teams, communicate clearly, and drive alignment with multiple stakeholders. A strong understanding of customer needs and how technical solutions align with business objectives.
- Platform/Core Services Experience (Highly Desired) – Prior experience working on platform, core, or shared services teams is highly desirable. Experience building foundational services that support multiple product lines and teams.
Who You Are
- A strong team player who values open and constructive feedback and fosters a culture of learning.
- Someone who communicates value, risks, tradeoffs, and recommended direction effectively.
- Adaptable and flexible, ready to navigate changing priorities with a positive mindset.
- Proactive in identifying and solving problems, with persistence and coordination to overcome challenges.
- Able to balance speed with risk, making data-driven decisions while ensuring long-term sustainability.
- Committed to mentorship and lifting others up, fostering shared growth and excellence across the team.