TRISTAR Insurance Group · 13 hours ago
Systems Engineer
TRISTAR Insurance Group is seeking a Senior Systems Engineer responsible for designing, building, and operating their core infrastructure platform. This role emphasizes Linux systems, Kubernetes, and automation, with a focus on improving reliability and supporting a DevOps operating model.
Financial ServicesInsuranceRisk Management
Responsibilities
Design, build, and operate Kubernetes clusters in production, including upgrades, patching, scaling, and reliability improvements
Establish platform standards and operating practices as the environment matures (cluster configuration, access patterns, resource governance, and runbooks)
Serve as the senior escalation point for Kubernetes platform issues and drive resolution through root-cause analysis and prevention
Design and implement Kubernetes storage patterns (StorageClasses, PV/PVC lifecycle, capacity planning) and support stateful workloads
Implement, test, and maintain Kubernetes-native backup/restore and recovery procedures
Integrate Kubernetes persistence needs with enterprise storage platforms, including Dell ObjectScale and existing virtualization/storage systems
Own Kubernetes traffic entry, including ingress controllers, load balancers, routing patterns, and TLS/certificate handling
Define repeatable patterns for exposing services and troubleshooting connectivity across platform components
Administer and harden Linux systems that support the platform, including patching, performance tuning, service reliability, logging, and baseline configuration
Troubleshoot system and platform issues across compute, storage, and network dependencies
Build automation to reduce manual work and increase consistency across infrastructure operations using Python/PowerShell/Bash and API-driven workflows
Evaluate, recommend, and help implement an automation / configuration management approach (tooling, patterns, and standards) to support repeatable tasks such as provisioning, configuration enforcement, patching, drift detection, and validation
Develop reusable automation assets (modules/playbooks/templates/scripts) and establish version-controlled workflows (Git), documentation, and operational handoff practices
Leverage RESTful APIs to integrate systems and create operational workflows (health checks, reporting, event-driven automations, and change validation)
Monitor alert sources and observability tooling (including SolarWinds on-prem), investigate events, and drive issues to completion
Document incidents, actions taken, and final resolutions contribute to improved alerting quality and operational visibility
Provide occasional on-site support as needed in the data center for infrastructure prep and troubleshooting (racking equipment, cabling, and physical connectivity verification)
Maintain working familiarity with server hardware and data center best practices to support rare hands-on needs
Partner with development and infrastructure teams to plan and progress TRISTAR’s long-term transition toward cloud-hosted deployments of the application stack
Contribute to cloud design discussions with a practical understanding of core cloud concepts (networking, identity/access, security, reliability, scalability, and cost considerations) across major providers (AWS/Azure/GCP)
Translate application and platform requirements into cloud-ready operational patterns (container orchestration in cloud, managed services vs self-managed tradeoffs, environment isolation per client, and deployment repeatability)
Support early-stage cloud initiatives such as proofs of concept, reference architectures, and migration planning, including identifying skill/tooling gaps and recommending realistic next steps
Apply Infrastructure-as-Code and automation principles to cloud readiness efforts to ensure future deployments are repeatable, supportable, and auditable
Create and maintain IT documentation, including platform runbooks, operational procedures, and architecture/standards documentation
Work with the Manager, Network Services and general IT staff to analyze and resolve technical issues affecting infrastructure and applications
Partner closely with development teams as part of TRISTAR’s DevOps transition to improve operability, deployment reliability, and platform usability
Work alongside the service desk to remedy end-user workstation issues; backfill and answer service desk calls when required
Perform night/day/weekend work as required to meet project objectives and support maintenance windows
Traveling to remote sites is rare, but possible and may be required as needed
Qualification
Required
Bachelor's degree in a related field (preferred); minimum of 7-year related experience; or equivalent combination of education and experience
7+ years of progressively responsible experience in systems/infrastructure engineering with strong production experience in Linux administration
Hands-on production experience with Kubernetes, including cluster build and lifecycle management (architecture, upgrades, patching, scaling, troubleshooting)
Strong understanding of Kubernetes storage and stateful workload operations, including troubleshooting PV/PVC and storage provisioning patterns
Experience implementing Kubernetes-native backup/restore practices and validating recovery procedures
Demonstrated automation experience using scripting (Python/PowerShell/Bash) and leveraging RESTful APIs for systems integration and automation
Experience with monitoring/observability platforms and operational alerting; SolarWinds experience strongly preferred
Strong troubleshooting skills across distributed systems, networking fundamentals, and infrastructure dependencies
Strong written and verbal communication skills, including documentation/runbooks/standards
Company
TRISTAR Insurance Group
Tristar Insurance Group provides excellent claims and risk management services to a variety of clients.
H1B Sponsorship
TRISTAR Insurance Group has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (3)
2021 (11)
Funding
Current Stage
Late StageCompany data provided by crunchbase