We are seeking a highly skilled and experienced Architecture & Operations Lead to drive the development of infrastructure for automation testing, internal DevOps, CI/CD, and deployment.
This role is critical in designing and maintaining scalable and high-performance infrastructure for software development, testing, and production environments. The ideal candidate has a strong background in cloud infrastructure, automation, microservices architecture, performance monitoring, and software best practices.
Key Responsibilities:
Infrastructure & Automation:
Design and implement infrastructure for automation testing and internal DevOps processes.
Develop and manage CI/CD pipelines, ensuring smooth and automated software deployment.
Architect and maintain scalable infrastructure on AWS, leveraging Terraform and infrastructure-as-code (IaC) best practices.
Define and enforce software best practices, ensuring reliability, maintainability, and security.
Operations & Performance Monitoring:
Lead performance monitoring and optimization efforts using tools like APM (Application Performance Monitoring) and New Relic.
Implement Site Reliability Engineering (SRE) principles to enhance system reliability and scalability.
Monitor and improve system performance, ensuring high availability and fault tolerance.
Collaboration Across Teams:
Work closely with development, product, and DevOps teams to align infrastructure strategies with system architecture.
Conduct design reviews and provide recommendations to optimize software and infrastructure performance.
Oversee GitHub Actions workflows for efficient automation and deployment processes.
Security & Compliance:
Ensure infrastructure meets industry security and compliance standards.
Collaborate with security teams to perform vulnerability assessments and implement secure deployment strategies.
Software Development & Best Practices:
Define and enforce best practices for software development and deployment.
Ensure backward compatibility compliance, preventing API breakages.
Drive automation initiatives to reduce manual effort and increase efficiency.
Requirements: Key Experience and Qualifications Required:
Bachelors or Masters degree in Computer Science, Software Engineering, or a related field.
8+ years of experience in software infrastructure, DevOps, or cloud architecture, including leadership roles.
Expertise in designing and managing CI/CD pipelines using GitHub Actions.
Strong experience with AWS, Terraform, and infrastructure-as-code (IaC) principles.
Proficiency in Python for automation and infrastructure management.
Strong understanding of microservices architecture and distributed systems.
Experience with performance monitoring tools such as New Relic and APM solutions.
Familiarity with containerization and orchestration (Docker, Kubernetes).
Hands-on experience with SRE methodologies and best practices.
Strong problem-solving skills with a focus on scalability and system-wide impact.
Preferred Skills:
Experience in high-availability system design and cloud-based infrastructure optimization.
Knowledge of compliance and security frameworks for cloud environments.
Strong analytical skills for performance tuning and optimization.
This position is open to all candidates.