Big Data Software Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

GIGATEC Engineering ยท 5 hours ago

Big Data Software Engineer

GIGATEC Engineering is seeking a Big Data Software Engineer to transform a Streamlit-built UI prototype into a polished, production-ready interface. The role involves developing and extending the interface for future external availability, utilizing skills in software development and big data processing.

Cyber SecurityBig DataArchitecture & DesignAnalyticsArchitecture

Responsibilities

Using the Linux CLI and Linux tools
Developing Bash scripts to automate manual processes
Recent software development experience using Python and Java
Familiar with Streamlit, a Python framework for building interactive web applications
Experience developing front-end applications using TypeScript, HTML, and CSS
Experience with JavaScript frameworks such as React, Angular, and Vue
Familiar with Distributed Big Data processing engines including Apache Spark
Experience using Jupyter Notebook
Experience with data wrangling and preprocessing using tools such as pandas, NumPy
Experience working with structured, semi-structured, and unstructured data such as Parquet, JSON, CSV, XML
Familiarity with data quality concepts, data validation, and anomaly detection
Experience with Git Source Control System
Experience using Apache Airflow (DAG design, scheduling, operators, sensors) to orchestrate, schedule, and monitor complex workflows
Experience with SQL technologies such as MySQL, MariaDB, and PostgreSQL for querying, joining, and aggregating large datasets
Familiar with HPC Job Scheduling tools including Slurm
Experience using the Atlassian Tool Suite (JIRA, Confluence)
Appreciates a sense of humor and the occasional well-timed joke

Qualification

PythonJavaApache SparkStreamlitTypeScriptHTMLCSSJavaScript frameworksGitBash scriptingJupyter NotebookData wranglingData quality conceptsSQL technologiesHPC Job SchedulingAtlassian Tool SuiteSense of humor

Required

Using the Linux CLI and Linux tools
Developing Bash scripts to automate manual processes
Recent software development experience using Python and Java
Familiar with Streamlit, a Python framework for building interactive web applications
Experience developing front-end applications using TypeScript, HTML, and CSS
Experience with JavaScript frameworks such as React, Angular, and Vue
Familiar with Distributed Big Data processing engines including Apache Spark
Experience using Jupyter Notebook
Experience with data wrangling and preprocessing using tools such as pandas, NumPy
Experience working with structured, semi-structured, and unstructured data such as Parquet, JSON, CSV, XML
Familiarity with data quality concepts, data validation, and anomaly detection
Experience with Git Source Control System

Preferred

Experience using Apache Airflow (DAG design, scheduling, operators, sensors) to orchestrate, schedule, and monitor complex workflows
Experience with SQL technologies such as MySQL, MariaDB, and PostgreSQL for querying, joining, and aggregating large datasets
Familiar with HPC Job Scheduling tools including Slurm
Experience using the Atlassian Tool Suite (JIRA, Confluence)
Appreciates a sense of humor and the occasional well-timed joke. ๐Ÿ˜„

Company

GIGATEC Engineering

twittertwitter
company-logo
At GIGATEC, we specialize in engineering advancements across cybersecurity, software development, and systems integration.

Funding

Current Stage
Early Stage
Company data provided by crunchbase