SIGN IN
Cloudera Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Yahara Software · 16 hours ago

Cloudera Data Engineer

Yahara Software is seeking a full-time Cloudera Data Engineer to join their innovative Software Development team in Madison, Wisconsin. The role involves designing and maintaining enterprise-scale data pipelines using the Cloudera Data Platform and focuses on building scalable ETL/ELT workflows while collaborating with cross-functional Agile teams.
SoftwareAppsRoboticsInformation TechnologyMobile Apps
check
Growth Opportunities
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Design and maintain enterprise-scale pipelines using CDP and big data tooling
Build scalable ETL/ELT workflows for structured and unstructured data
Develop distributed processing jobs using big data framework components
Design data storage solutions balancing performance and cost
Collaborate with analysts, scientists, and developers to deliver data solutions
Develop technical documentation for pipelines and architectures

Qualification

Cloudera Data PlatformETL/ELT workflowsDistributed data processingBig data technologiesPythonAdvanced SQLCloud experienceAgile developmentCollaborationProblem-solvingAttention to detail

Required

5–7 years in data engineering with big data or distributed systems
Experience with CDP, CDH, or similar enterprise big data platforms
Degree in CS, Data Science, Information Systems, or equivalent experience
Strong background in distributed data processing
Ability to obtain and maintain Public Trust clearance
Self‑starter with a passion for data engineering
Strong analytical and problem‑solving skills
Enthusiastic about big data technologies and performance optimization
Detail‑oriented with a commitment to accuracy and reliability
Ability to translate business requirements into effective solutions
Collaborative, able to recognize blockers and leverage team strengths
Experience with Agile development environments
Proven experience designing and implementing production pipelines
Specific Technical Qualifications: Cloudera ecosystem experience: CDP, HDFS, Hive/Impala, Spark
Programming: Python, Scala, or Java
Advanced SQL and distributed compute (Spark, MapReduce)
Shell scripting and version control (Git)
Data storage formats: Parquet, Avro, ORC
Workflow orchestration and scheduling
Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns

Preferred

Experience in biohealth, laboratory, or scientific data environments is a plus
Familiarity with HIPAA, FDA, or GxP preferred but not required

Benefits

20+ days of PTO accruable in the first year!
Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO options
Health Savings Account (HSA) with annual employer contributions
401(k) with guaranteed company match (Traditional and Roth options)
100% company-paid short-term and long-term disability
100% company-paid life insurance with option to increase coverage
100% company-paid identity theft protection
On-site gym with basketball court
Hybrid/remote schedule with home office stipend
Fresh fruit, healthy snacks, and beverages provided daily
Bonus certification program (Microsoft, AWS, PMP, IIBA, etc.)
Employee Assistance Program (counseling, legal, financial services)
Monthly and Quarterly Recognition Awards with spot bonuses
Company-supported community outreach and volunteer opportunities
Employee-run committee involvement opportunities
Collaborative culture founded on realized values and incredible people

Company

Yahara Software

twittertwittertwitter
company-logo
Yahara Software is a computer software company offering application, product development, and interactive web solutions.

Funding

Current Stage
Growth Stage

Leadership Team

K
Kevin Meech
Chief Executive Officer
linkedin
Company data provided by crunchbase