Photon · 7 hours ago
Software Engineer - Gen AI Inferencing | Onsite | Addison, TX / Charlotte
Photon is a company focused on innovative technology solutions, and they are seeking a Software Engineer specializing in Gen AI Inferencing. This role involves designing, building, and operating reusable toolkits for Gen AI capabilities, while ensuring software compliance and maintainability.
E-CommerceSoftwareAppsInformation TechnologyMobile AppsWeb DesignWeb Development
Responsibilities
Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements
Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained
Mentors other software engineers and coach team on Continuous Integration and Continuous Development (CI-CD) practices and automating tool stack
Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle
Performs spike/proof of concept as necessary to mitigate risk or implement new ideas
Automates manual release activities
Designs, develops, and maintains automated test suites (integration, regression, performance)
Utilizes multiple architectural components (across data, application, business) in design and development of client requirements
Manage multiple priorities, and simultaneously engage with multiple teams
Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle
Be vocal and actively participate in all session with business stakeholders and agile teams
Collaborate with product teams, data analysts and data scientists to design and build solutions
Qualification
Required
5+ years OOP in Python/Scala/Java programming experience with expert level development skills
Experience with AI/ML/GenAI Lifecycle Management and Development and its Ecosystem. Hands on experience building frameworks using MLOps, Fine – Tuning techniques, Inference Frameworks
Experience with deploying models using vLLM/Triton Inference Server in containers in production with automation. Performs Continuous Integration and Continuous Development (CI-CD) activities. Performance Tuning those models and deployment to provide higher throughput
Track record of maintaining large scale Python/Unix based systems
Hands on experience and knowledge generative AI RAG process for various use cases, including chunking, embedding, retrieval, reranking and summarization
Hands-on experience in application development in one or more areas MongoDB, Redis, Angular/React Frameworks, Containerization, Building API based application leveraging FAST API services, JWT Integration, API Gateway
Develop efficient utilities, automation frameworks, data science platforms that can be utilized across multiple Data Science teams for AI/ML and GenAI work
Working in large sized teams that collaboratively develop on a shared multi-repo codebase using IDEs (e.g. VS Code rather than Jupyter Notebooks), Continuous Integration (CI), Continuous Deployment (CD) and Continuous Testing
Strong automation, scripting, and Python development skills. Hands-on DevOps experience with one or more of the following enterprise development tools: Version Control (GIT/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest Unit Testing), Artifact Management (Artifactory) and Deployment (Ansible)
Preferred
Experience building & deploying Gen AI inferencing platform with open-source toolsets, building inferencing & servicing capabilities (AI Gateway, Policy store, Observability) for RAG/ MCP use cases etc
Hands on experience on driving and maintaining a culture of quality, innovation, and experimentation
Research on new tools and capabilities for better UI and UX for advanced analytics platform, quick prototype and demonstrate the features and capabilities, and participate on various user forums
Benefits
Medical, vision, and dental benefits
401k retirement plan
Variable pay/incentives
Paid time off
Paid holidays
Company
Photon
Photon is a technology corporation that provides Strategy Consulting, Creative Design, and Technology Services to global enterprise.
H1B Sponsorship
Photon has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (233)
2024 (168)
2023 (236)
2022 (184)
2021 (157)
2020 (249)
Funding
Current Stage
Late StageRecent News
Bangkok Post
2024-05-20
2024-04-02
Company data provided by crunchbase