Data Architect (AI/ML) jobs in United States
cer-icon
Apply on Employer Site
company-logo

BALIN TECHNOLOGIES LLC · 3 weeks ago

Data Architect (AI/ML)

BALIN TECHNOLOGIES LLC is seeking a seasoned Data Architect with expertise in Databricks and AI/ML enablement to lead a critical data modernization initiative. The role involves transforming legacy data platforms into scalable, cloud-native architectures while driving the integration of AI/ML capabilities across data workflows.

Computer Software

Responsibilities

Lead the architectural modernization from an on-prem/legacy platform to a unified Databricks Lakehouse ecosystem
Architect and optimize data pipelines (batch and streaming) to support AI/ML and GenAI workloads on Databricks
Migrate and re-engineer existing Spark workloads to leverage Delta Lake, Unity Catalog, and advanced performance tuning in Databricks
Drive integration of AI/ML models (including GenAI use cases) into operational data pipelines for real-time decision-making
Design and implement robust orchestration using Apache Airflow or Databricks Workflows, with CI/CD integration
Establish data governance, security, and quality frameworks aligned with Unity Catalog and enterprise standards
Collaborate with data scientists, ML engineers, DevOps, and business teams to enable scalable and governed AI solutions

Qualification

DatabricksAI/ML enablementApache SparkDelta LakeApache AirflowCloud-native architectureStakeholder engagementCI/CD pipelinesInfrastructure-as-CodeCommunication skills

Required

12+ years in data engineering or architecture, with a strong focus on Databricks (at least 4-5 years) and AI/ML enablement
Deep hands-on experience with Apache Spark, Databricks (Azure/AWS), and Delta Lake
Strong knowledge of Apache Airflow, Databricks Jobs, and cloud-native orchestration patterns
Experience with structured streaming, Kafka, and real-time analytics frameworks
Proven ability to design and implement cloud-native data architectures
Solid understanding of data modeling, Lakehouse design principles, and lineage/tracking with Unity Catalog
Excellent communication and stakeholder engagement skills

Preferred

Certification in Databricks Data Engineering Professional is highly desirable
Experience transitioning from in house data platforms to Databricks or cloud-native environments
Hands-on experience with Delta Lake, Unity Catalog, and performance tuning in Databricks
Expertise in Apache Airflow DAG design, dynamic workflows, and production troubleshooting
Experience with CI/CD pipelines, Infrastructure-as-Code (Terraform, ARM templates), and DevOps practices
Exposure to AI/ML model integration within real-time or batch data pipelines
Experience with LLM/GenAI enablement, vectorized data, embedding storage, and integration with Databricks is an added advantage

Company

BALIN TECHNOLOGIES LLC

twitter
company-logo
At Balin Technologies LLC, we are driven by innovation, excellence, and a deep commitment to helping businesses transform through technology.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase