Granica · 1 day ago
Software Engineer, AI Data Platform
Granica is redefining how enterprises prepare and optimize data at the fundamental layer of the AI stack. They are seeking a Software Engineer for their AI Data Platform to build next-generation infrastructure and optimize data processing for efficiency and performance.
Artificial Intelligence (AI)Information TechnologySoftware
Responsibilities
Own the ACID backbone. Design and harden transactional layers and metadata services so that petabyte-scale tables can time-travel in microseconds and schema evolution becomes a non-event
Turn metadata into rocket fuel. Build compaction, caching, and pruning services that keep millions of file pointers within 50 ms from lookup to plan
Squeeze more signal per byte. Optimize data layouts-from column ordering to dictionary and bit-packing, bloom filters, and zone-map indexes-to cut scan I/O by 10× on real-world workloads
Ship adaptive indexing with research. Co-invent machine-driven indexes that learn access patterns and automatically re-partition nightly—no more manual “analyze table” ever again
Scale the engine, not the babysitting. Write Spark, Flink, or batch pipelines that autoscale across S3, GCS, and ADLS; expose observability hooks; and survive chaos drills without triggering a pager storm
Code for longevity. Write clean, test-soaked Java, Scala, Go, or C++. Document key invariants so future teams can extend the system—instead of rewriting it
Measure success in human latency. If analysts see their dashboards refresh in blink-level time, you’ve won. Publish your breakthrough and mentor the next engineer to raise the bar again
Qualification
Required
Distributed Systems and Storage Fundamentals — consistency, replication, sharding, durability, transactions
Columnar Storage Optimization — deep knowledge of Parquet or similar formats (column ordering, compression, zone maps)
Metadata and Indexing Systems — experience building metadata-driven services, compaction, caching, and adaptive indexing
Distributed Compute at Scale — production-grade Spark/Flink or equivalent pipeline development across S3, GCS, or ADLS
Programming for Scale and Longevity — strong coding in Java, Scala, Go, or C++, with clean testing and documentation practices
Resilient Systems and Observability — you've built systems that survive chaos drills and expose the right metrics
Preferred
Exposure to open table formats such as Apache Iceberg, Delta Lake, or Hudi
Experience with catalog services, query planning, or compaction frameworks
OSS contributions or published work in data infrastructure or distributed systems
Benefits
Competitive salary and meaningful equity
Unlimited PTO + quarterly recharge days
Premium health, vision, and dental
Team offsites, deep tech talks, and learning stipends
Company
Granica
Granica is the world's first AI Data Readiness Platform, creating cutting-edge and enterprise-ready AI infrastructure services.
H1B Sponsorship
Granica has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (7)
2023 (2)
Funding
Current Stage
Early StageTotal Funding
$45MKey Investors
New Enterprise Associates
2023-06-08Series A· $45M
2020-01-01Seed
Recent News
Google Patent
2025-02-07
Company data provided by crunchbase