Software Engineer, AI Data Platform jobs in United States
cer-icon
Apply on Employer Site
company-logo

Granica · 1 day ago

Software Engineer, AI Data Platform

Granica is redefining how enterprises prepare and optimize data at the fundamental layer of the AI stack. They are seeking a Software Engineer for their AI Data Platform to build next-generation infrastructure and optimize data processing for efficiency and performance.

Artificial Intelligence (AI)Information TechnologySoftware
check
H1B Sponsor Likelynote

Responsibilities

Own the ACID backbone. Design and harden transactional layers and metadata services so that petabyte-scale tables can time-travel in microseconds and schema evolution becomes a non-event
Turn metadata into rocket fuel. Build compaction, caching, and pruning services that keep millions of file pointers within 50 ms from lookup to plan
Squeeze more signal per byte. Optimize data layouts-from column ordering to dictionary and bit-packing, bloom filters, and zone-map indexes-to cut scan I/O by 10× on real-world workloads
Ship adaptive indexing with research. Co-invent machine-driven indexes that learn access patterns and automatically re-partition nightly—no more manual “analyze table” ever again
Scale the engine, not the babysitting. Write Spark, Flink, or batch pipelines that autoscale across S3, GCS, and ADLS; expose observability hooks; and survive chaos drills without triggering a pager storm
Code for longevity. Write clean, test-soaked Java, Scala, Go, or C++. Document key invariants so future teams can extend the system—instead of rewriting it
Measure success in human latency. If analysts see their dashboards refresh in blink-level time, you’ve won. Publish your breakthrough and mentor the next engineer to raise the bar again

Qualification

Distributed SystemsColumnar Storage OptimizationMetadataIndexing SystemsDistributed ComputeProgramming in Java/Scala/Go/C++Open Table FormatsCatalog ServicesOSS ContributionsResilienceObservability

Required

Distributed Systems and Storage Fundamentals — consistency, replication, sharding, durability, transactions
Columnar Storage Optimization — deep knowledge of Parquet or similar formats (column ordering, compression, zone maps)
Metadata and Indexing Systems — experience building metadata-driven services, compaction, caching, and adaptive indexing
Distributed Compute at Scale — production-grade Spark/Flink or equivalent pipeline development across S3, GCS, or ADLS
Programming for Scale and Longevity — strong coding in Java, Scala, Go, or C++, with clean testing and documentation practices
Resilient Systems and Observability — you've built systems that survive chaos drills and expose the right metrics

Preferred

Exposure to open table formats such as Apache Iceberg, Delta Lake, or Hudi
Experience with catalog services, query planning, or compaction frameworks
OSS contributions or published work in data infrastructure or distributed systems

Benefits

Competitive salary and meaningful equity
Unlimited PTO + quarterly recharge days
Premium health, vision, and dental
Team offsites, deep tech talks, and learning stipends

Company

Granica

twittertwittertwitter
company-logo
Granica is the world's first AI Data Readiness Platform, creating cutting-edge and enterprise-ready AI infrastructure services.

H1B Sponsorship

Granica has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (7)
2023 (2)

Funding

Current Stage
Early Stage
Total Funding
$45M
Key Investors
New Enterprise Associates
2023-06-08Series A· $45M
2020-01-01Seed

Leadership Team

leader-logo
Rahul Ponnala
CEO & Co-Founder
linkedin

Recent News

Company data provided by crunchbase