Yahara Software · 16 hours ago
Cloudera Data Engineer
Yahara Software is seeking a full-time Cloudera Data Engineer to join their innovative Software Development team in Madison, Wisconsin. The role involves designing and maintaining enterprise-scale data pipelines using the Cloudera Data Platform and focuses on building scalable ETL/ELT workflows while collaborating with cross-functional Agile teams.
SoftwareAppsRoboticsInformation TechnologyMobile Apps
Responsibilities
Design and maintain enterprise-scale pipelines using CDP and big data tooling
Build scalable ETL/ELT workflows for structured and unstructured data
Develop distributed processing jobs using big data framework components
Design data storage solutions balancing performance and cost
Collaborate with analysts, scientists, and developers to deliver data solutions
Develop technical documentation for pipelines and architectures
Qualification
Required
5–7 years in data engineering with big data or distributed systems
Experience with CDP, CDH, or similar enterprise big data platforms
Degree in CS, Data Science, Information Systems, or equivalent experience
Strong background in distributed data processing
Ability to obtain and maintain Public Trust clearance
Self‑starter with a passion for data engineering
Strong analytical and problem‑solving skills
Enthusiastic about big data technologies and performance optimization
Detail‑oriented with a commitment to accuracy and reliability
Ability to translate business requirements into effective solutions
Collaborative, able to recognize blockers and leverage team strengths
Experience with Agile development environments
Proven experience designing and implementing production pipelines
Specific Technical Qualifications: Cloudera ecosystem experience: CDP, HDFS, Hive/Impala, Spark
Programming: Python, Scala, or Java
Advanced SQL and distributed compute (Spark, MapReduce)
Shell scripting and version control (Git)
Data storage formats: Parquet, Avro, ORC
Workflow orchestration and scheduling
Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns
Preferred
Experience in biohealth, laboratory, or scientific data environments is a plus
Familiarity with HIPAA, FDA, or GxP preferred but not required
Benefits
20+ days of PTO accruable in the first year!
Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO options
Health Savings Account (HSA) with annual employer contributions
401(k) with guaranteed company match (Traditional and Roth options)
100% company-paid short-term and long-term disability
100% company-paid life insurance with option to increase coverage
100% company-paid identity theft protection
On-site gym with basketball court
Hybrid/remote schedule with home office stipend
Fresh fruit, healthy snacks, and beverages provided daily
Bonus certification program (Microsoft, AWS, PMP, IIBA, etc.)
Employee Assistance Program (counseling, legal, financial services)
Monthly and Quarterly Recognition Awards with spot bonuses
Company-supported community outreach and volunteer opportunities
Employee-run committee involvement opportunities
Collaborative culture founded on realized values and incredible people
Company
Yahara Software
Yahara Software is a computer software company offering application, product development, and interactive web solutions.
Funding
Current Stage
Growth StageRecent News
2024-10-21
Company data provided by crunchbase