Databricks Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Catapult Federal Services · 2 days ago

Databricks Data Engineer

Catapult Federal Services is seeking a Databricks Data Engineer to develop and support data pipelines and analytics environments in Azure. The role involves translating business requirements into data engineering solutions, managing ETL operations, and ensuring data quality while collaborating with cross-functional teams.

Information Technology & Services
badNo H1BnoteU.S. Citizen Onlynote
Hiring Manager
Heidi Duffin
linkedin

Responsibilities

Design, build, and optimize scalable data solutions using Databricks and Medallion Architecture
Manage ingestion routines for processing multi-terabyte datasets efficiently for multiple projects simultaneously, where each project may have multiple Databricks workspaces
Integrate data from various structured and unstructured sources to enable high-quality business insights. Proficiency in data analysis techniques for deriving insights from large datasets
Implement effective data management strategies to ensure data integrity, availability, and accessibility. Identify opportunities for cost optimization in data storage, processing, and analytics operations
Monitor and support user requests, addressing platform or performance issues, cluster stability, Spark optimization, and configuration management
Collaborate with the team to enable advanced AI-driven analytics and data science workflows
Integrate with various Azure services including Azure Functions, Storage Services, Data Factory, Log Analytics, and User Management for seamless data workflows
Provision and manage infrastructure using Infrastructure-as-Code (IaC)
Apply best practices for data security, data governance, and compliance, ensuring support for federal regulations and public trust standards
Proactively collaborate with technical and non-technical teams to gather requirements and translate business needs into data solutions

Qualification

DatabricksAzure cloud servicesPythonSparkData governanceCI/CD automationInfrastructure-as-CodeAgile methodologyR.NET development

Required

BS degree in Computer Science or related field and 3+ years or Master's degree with 2+ years of experience
3+ years of experience developing and designing Ingestion flows (structured, streaming, and unstructured data) using cloud platform services with data quality
Databricks Data Engineer certification and 2+ years of experience maintaining Databricks platform and development in Spark
Ability to work directly with clients and act as front line support for requests coming in from clients. Clearly document and express the solution in form of architecture and interface diagrams
Proficient at Python, Spark and R are essential
Knowledge and experience with data governance, including metadata management, enterprise data catalog, design standards, data quality governance, and data security
Experience with Agile process methodology, CI/CD automation, and cloud-based developments (Azure, AWS)
Public Trust clearance (U.S. Citizenship Required)

Preferred

Experience with the above Azure services is a plus
.NET based development is a plus
Not required, but additional education, certifications, and/or experience are a plus: Certifications in Azure cloud, Knowledge of FinOps principles and cost management

Company

Catapult Federal Services

twitter
company-logo
Catapult Federal Services is headquartered in Plano, TX, with its primary business operations office located in the greater Washington, D.C. area.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase