Saxon Global · 10 hours ago
Data Engineer
Saxon Global is seeking a Senior Data Engineer to develop and implement data pipelines using AWS Glue and PySpark. The role involves sourcing and transforming data, delivering cleaned datasets, and managing external integrations while collaborating with the team for efficient solutions.
Responsibilities
Develop and Implement Data Pipelines: Design, build, and maintain robust data pipelines primarily using AWS Glue and PySpark
Data Sourcing and Transformation: Source data from various systems, including Redshift and Aurora, performing necessary streaming transformations and heavy data cleaning
Data Delivery: Push resulting, cleaned datasets into S3 buckets
External Integration: Manage the secure transfer of resulting files via SFTP to an external 3rd party company's server, adhering to non-negotiable external integration deadlines
Collaboration: Work closely with the team to consult on the best and most efficient solutions for achieving required data outputs, given the constraints of the AWS Glue/PySpark environment
Qualification
Required
Heavy expertise in the AWS ecosystem, specifically AWS Glue
Hands-on experience working with PySpark on complex application implementations is required
Heavy knowledge of both relational (e.g., Redshift, Aurora) and non-SQL databases, and how to leverage them within the AWS Glue/PySpark environment
Strong general knowledge of how to efficiently get, transform, and push out data
Company
Saxon Global
Saxon Global is an IT consulting and engineering solution company.
H1B Sponsorship
Saxon Global has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (43)
2024 (47)
2023 (81)
2022 (52)
2021 (39)
2020 (54)
Funding
Current Stage
Growth StageCompany data provided by crunchbase