- Department: Platform Engineering
- Employment Type: Full-time
- Experience Level: Mid-Level (3-7 years)
- Location: Remote
Company Overview
Join our innovative platform engineering team where we champion infrastructure as code, architect for high availability, and excel in fast-paced environments. We are building the next generation of platform solutions that power our diverse portfolio of products and services at enterprise scale.
Position Summary
We are seeking accomplished Platform Engineers to take ownership of critical infrastructure systems and drive technical excellence across our platform ecosystem. This role is ideal for experienced engineers ready to own mission-critical systems, mentor team members, and lead architectural initiatives. The successful candidate will demonstrate expertise in infrastructure automation, system reliability, and will play a key role in shaping our platform's future while supporting a diverse portfolio of products and services.
Key Responsibilities
Infrastructure Architecture & Management
- Own and architect comprehensive cloud infrastructure components across multiple environments
- Lead containerization initiatives and design scalable solutions for our global Kubernetes ecosystem
- Drive infrastructure optimization and capacity planning for high-availability systems
- Make critical architectural decisions that impact platform scalability and reliability
Automation & CI/CD Leadership
- Design, build, and maintain sophisticated automation scripts and CI/CD pipelines
- Take full ownership of pipeline reliability, performance optimization, and deployment strategies
- Establish automation standards and best practices across the engineering organization
- Lead initiatives to reduce manual processes and improve operational efficiency
Developer Experience & Platform Tools
- Own and enhance internal developer tools and platform services
- Lead troubleshooting efforts for complex platform-related issues
- Drive strategic improvements to developer productivity and platform usability
- Collaborate with development teams to understand requirements and deliver solutions
Monitoring, Observability & System Reliability
- Design and maintain comprehensive monitoring systems, dashboards, and alerting mechanisms
- Apply deep expertise in interpreting system metrics and proactively identifying performance issues
- Implement observability best practices and drive data-driven operational decisions
- Lead capacity planning and performance optimization initiatives
Technical Documentation & Standards
- Lead the creation of comprehensive technical documentation for platform processes and procedures
- Establish and maintain documentation standards across the platform engineering team
- Develop detailed troubleshooting guides, runbooks, and operational playbooks
- Drive knowledge transfer initiatives and technical training programs
Incident Response & On-Call Operations
- Participate in on-call rotations with accountability for system reliability and uptime
- Lead incident response efforts during critical system emergencies
- Conduct thorough post-incident reviews and implement preventive measures
- Drive continuous improvement in incident response processes and system resilience
Technical Leadership & Mentorship
- Mentor junior engineers and provide technical guidance on complex projects
- Lead technical discussions and architectural decision-making processes
- Drive adoption of new technologies and evaluate emerging platform solutions
- Contribute innovative ideas to refine and evolve our platform architecture
Technical Requirements
Core Platform Technologies
- Container Orchestration: Advanced Kubernetes ecosystem management and architecture
- Cloud Platforms: AWS, Google Cloud Platform (GCP), Microsoft Azure
- Infrastructure as Code: Terraform, Cloud Development Kit for Terraform (CDKTF)
- Operating Systems: Advanced Linux/Unix systems administration and performance tuning
- Monitoring & Observability: Prometheus, Quickwit, and enterprise monitoring solutions
- CI/CD: Advanced pipeline design, optimization, and enterprise deployment strategies
- Containerization: Docker, container security, and orchestration at scale
- Networking: Advanced networking concepts, security, and troubleshooting
- Version Control: Git workflows, branching strategies, and code review practices
- Programming Languages: Python, Rust, JavaScript with focus on automation and tooling
- Configuration Management: Enterprise-grade configuration and deployment automation
Required Qualifications
- Demonstrated expertise in 4-6 technologies from our core platform technology stack
- Proven ability to own and drive complex technical initiatives from conception through production deployment
- Extensive experience with incident response, on-call responsibilities, and system reliability engineering
- Strong problem-solving capabilities with meticulous attention to detail and demonstrated accountability
- Excellent communication skills with collaborative mindset and proven ability to mentor team members
- Track record of successfully delivering infrastructure projects in fast-paced, high-availability environments
- Bachelor's degree in Computer Science, Engineering, or related technical field, or equivalent professional experience
Preferred Qualifications
- 3-7 years of professional experience in software development or infrastructure engineering
- Advanced working knowledge of additional technologies from our core platform stack
- Deep understanding of infrastructure as code principles and provider-agnostic design patterns
- Proven experience leading technical projects and driving architectural decisions
- Previous experience with incident response, post-mortems, and site reliability engineering practices
- Demonstrated track record of mentoring junior engineers and leading technical initiatives
- Experience with enterprise-scale infrastructure and multi-region deployments
- Relevant industry certifications (AWS, GCP, Azure, Kubernetes, etc.)
What We Offer
Technical Leadership Opportunities
- Ownership over critical platform technologies and strategic architectural decisions
- Direct influence on platform roadmap and technology stack evolution
- Opportunity to lead technical initiatives that impact the entire engineering organization
- Access to cutting-edge infrastructure technologies and enterprise-scale challenges
Professional Growth & Development
- Comprehensive mentorship opportunities with junior engineers
- Leadership development programs and technical advancement pathways
- Clear career progression within the platform engineering discipline
- Conference attendance, training budgets, and certification support
Impact & Collaboration
- Direct opportunity to enhance developer productivity across the organization
- Collaboration with cross-functional teams on strategic platform initiatives
- Contribution to the evolution and scaling of our platform infrastructure
- Influence on engineering culture and best practices
Compensation & Benefits
- Competitive salary commensurate with experience and expertise
- Comprehensive benefits package including health, dental, and vision coverage
- Equity participation and performance-based compensation
- Flexible work arrangements and professional development budget
Application Process
Interested candidates should submit a resume and cover letter detailing their relevant experience and interest in platform engineering. Please include examples of any relevant projects, certifications, or contributions to open-source platforms.
Directus is an Equal Opportunity Employer. We're committed to building a diverse team of talented individuals who bring different perspectives to the company, and who feel a sense of inclusion and belonging when they join our team.
If you're unsure about some of the details above, feel free to reach out with questions. We would love to hear from you!