Senior Systems Engineer – Mechanical jobs in United States
cer-icon
Apply on Employer Site
company-logo

Fleet Data Centers · 21 hours ago

Senior Systems Engineer – Mechanical

Fleet Data Centers designs and operates mega-scale data center campuses and is seeking a Senior Systems Engineer – Mechanical. This role focuses on the design validation and optimization of data center cooling systems, ensuring efficiency and reliability across various operating scenarios.

Data CenterData ManagementIT Infrastructure

Responsibilities

Develop and maintain a deep understanding of Fleet data center cooling topology, including:
Air-side systems: fan walls, CRAHs/CRACs, air handlers, ducting, containment, filters
Liquid-side systems: chillers, dry coolers, pumps, CDUs, heat exchangers, headers/manifolds, valve trains
Rack-level solutions: liquid-cooled cold plates, rear-door heat exchangers, in-rack manifolds, hybrid air/liquid configurations
Determine the air-to-liquid mix needed to support a given rack layout and density, considering rack SKUs, aisle configuration, containment, and site constraints
Ensure cooling topology and capacity at room/aisle level support current and forecast rack deployments and density targets
Understand the air and liquid cooling requirements for each rack SKU, including:
Inlet temperature and humidity ranges
Liquid flow, pressure, and temperature ranges for cold plates and rear-door heat exchangers
Maintain structured mapping from rack SKUs to:
Required airflow per rack/aisle
Required liquid flow per rack/manifold/loop
Special constraints (e.g., mixed air/liquid aisles, max ΔT)
Ensure specifications and counts for cooling components (fan wall modules, CRAHs/CRACs, CDUs, pumps, valves, manifolds, piping sizes, coils) are accurate, documented, and provided to capacity planning and procurement
Perform CFD analysis at room and aisle level to:
Validate that planned rack placement does not create hot spots
Confirm that airflow patterns, pressure profiles, and temperature distributions are within allowable limits
Identify and mitigate cooling stranding, where cooling capacity exists but cannot be effectively delivered to IT load because of placement or topology
Use CFD and thermal modeling tools to:
Evaluate different rack arrangements and containment strategies
Test sensitivity to changes in IT load, fan speeds, supply temperatures, and air-to-liquid mix
Quantify margin to thresholds (e.g., maximum rack inlet temperature, maximum component temperatures)
Translate CFD results into actionable design rules, placement constraints, and deployment guidelines for capacity planners and operations
Optimize cooling for each aisle based on:
Actual and forecasted IT load distribution
Air-to-liquid split for the racks in that aisle
Containment strategy (cold aisle, hot aisle, full containment, partial containment)
Recommend fan wall octet configurations (and other fan wall module configurations) per deployment to:
Deliver required airflow and pressure with redundancy
Maintain redundancy and margin for failure and maintenance scenarios
Minimize energy use while maintaining thermal headroom
Work with operations to tune setpoints (supply temperatures, fan speeds, differential pressure, chilled water temperatures) to support uptime SLAs and reduce cooling stranding and over-provisioning
Conduct failure mode simulations and analyses for mechanical systems, including at minimum:
CRAC/CRAH outage scenarios (single unit or multiple simultaneous failures)
Dry cooler outage or degraded performance scenarios
Pump failures, valve failures, and partial loss of liquid loops
Evaluate for each scenario:
Transient and steady-state temperature excursions at the rack and component level
Time-to-threshold (how long before violating safe temperature limits)
Impact on redundancy, load shedding requirements, and achievable uptime
Use results to:
Recommend design improvements (additional redundancy, loop segmentation, capacity rebalancing)
Define operational responses and MOPs (e.g., load shedding priorities, setpoint changes)
Optimize uptime SLAs while minimizing cooling stranding, especially in mixed air/liquid deployments and high-density aisles
Lead or support infrastructure upgrades and expansion impact analyses for cooling systems, including:
Adding or resizing fan walls, CRAHs/CRACs, dry coolers, chillers, pumps, CDUs, and distribution headers
Increasing liquid cooling fraction as AI-heavy racks grow in share
Changing setpoints or operating modes (e.g., different supply temperatures, economization strategies)
Quantify for proposed changes:
Effect on current and future thermal capacity and headroom
Changes in aisle-level and room-level airflow / liquid flow distribution
Impact on PUE, water usage, and operating costs
Provide mechanical engineering input into MOPs and risk assessments for any cooling system change that could impact live IT load
Partner with capacity planners, rack design teams, site operations, facilities engineering, and procurement to ensure:
Cooling design and capacity assumptions are aligned with rack deployment plans and SLAs
Air-to-liquid decisions are integrated into forecast models and program timelines
Produce and maintain clear design guides, reference one-lines, piping schematics, and airflow diagrams for:
Baseline Fleet data halls
High-density / AI-specific deployments
Contribute mechanical content to internal standards and playbooks covering:
Cooling topology design rules
CFD analysis methodologies and acceptance criteria
Failure mode simulation procedures and reporting standards

Qualification

Mechanical EngineeringData Center CoolingCFD AnalysisThermal Systems EngineeringFailure Mode AnalysisAnalytical SkillsEffective CommunicationCollaborationLeadership

Required

Bachelor's degree in Mechanical Engineering or a closely related engineering discipline
6+ years of experience in data center mechanical engineering, mission-critical HVAC design, or thermal systems engineering for large industrial or technology facilities
Demonstrated deep understanding of data center cooling topologies, including both air-cooled and liquid-cooled architectures (fan walls, CRAHs/CRACs, chillers, dry coolers, pumps, heat exchangers, CDUs, manifolds, containment systems)
Hands-on experience performing and interpreting CFD analysis for data halls or similar mission-critical environments, with a track record of using CFD results to drive design changes and rack placement decisions
Proven ability to determine appropriate air-to-liquid mix for given rack layouts and densities
Proven ability to assess and optimize thermal performance at rack, aisle, and room levels
Proven ability to identify and remediate hot spots and cooling stranding
Experience designing or analyzing failure modes for cooling systems (e.g., CRAC/CRAH outage, dry cooler/chiller degradation, pump or valve failures) and translating results into design and operational mitigations
Strong analytical and problem-solving skills, with the ability to connect thermal and mechanical design decisions to uptime, SLA performance, and site efficiency (PUE, water usage)
Clear written and verbal communication skills, including the ability to document complex cooling concepts and present analyses to engineering and operations stakeholders

Preferred

Experience in hyperscale or colocation data centers, especially supporting high-density AI/GPU clusters and advanced liquid cooling (direct-to-chip, rear-door heat exchangers, in-rack manifolds)
Proficiency with industry-standard CFD and thermal analysis tools and familiarity with integrating results into DCIM/BMS or capacity planning workflows
Familiarity with data center efficiency metrics (e.g., PUE, WUE) and how cooling design decisions influence them
Experience with DCIM, BMS, and monitoring systems for tracking and optimizing thermal performance in production environments
Knowledge of relevant mechanical and building codes and standards as applied to mission-critical facilities
Prior experience conducting infrastructure upgrade or expansion impact analyses in live data centers, including development of MOPs and risk mitigations

Benefits

100% employer-covered medical, dental, and vision insurance
401K program
Standard paid holidays
Unlimited PTO

Company

Fleet Data Centers

twittertwitter
company-logo
Fleet Data Centers is a data infrastructure company that designs, constructs, and operates mega-scale data centers.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase