Talentsearchpro · 6 hours ago
Site Reliability Engineer
Talentsearchpro is seeking a Site Reliability Engineer (SRE) who is passionate about ensuring the performance and reliability of complex systems. The role involves optimizing infrastructure, automating processes, and monitoring system health to ensure platforms are fast, stable, and available to users around the clock.
Human Resources
Responsibilities
Optimize infrastructure, automate processes, and monitor system health to ensure platforms are fast, stable, and available to users around the clock
Qualification
Required
MUST: Experience at trading shop/exchanges, avoid NYSE and NASDAQ
3+ years of site reliability engineering experience at a top rated financial firm (check target companies)
Experience designing and operating large-scale production systems
Experience working with AWS or other cloud providers
Preferred
Deep Kubernetes expertise: building operators, custom schedulers etc. Not required but would be super handy
Experience with Infrastructure-as-Code (e.g. Terraform, Ansible, AWS CDK, CDKTF would be a plus)
Experience with observability best practices and tooling (we use Prometheus/Loki/Tempo/Pyroscope)
Experience building deployment pipelines leveraging common CI/CD tools (we use ArgoCD and GitHub Actions)
A good understanding of software engineering principles and an ability to write clean code in any programming language (we use a mix of Python and Go)
A good understanding of web applications and architecture
Company
Talentsearchpro
Talent Search Pro, we understand that making the right hire is a critical action that you have to take as an organization.
Funding
Current Stage
Early StageCompany data provided by crunchbase