Knox Systems, Inc. · 2 weeks ago
Level 1 (L1) Cloud Operations Specialist
Knox Systems runs the largest Federal managed cloud, building and operating secure cloud and AI environments that support the U.S. government’s most critical missions. The Cloud Operations Specialist (L1) is responsible for first-line monitoring, triage, and rapid incident response across cloud environments, ensuring system availability, security, and compliance.
ComputerCyber SecurityGovernment
Responsibilities
Monitor infrastructure, applications, and network health using tools such as Grafana, Wiz, Datadog, and CrowdStrike Falcon
Detect, triage, and escalate alerts based on severity and business impact
Document incident timelines, actions, and resolutions in ticketing systems (ServiceNow, Jira Service Management)
Follow established FedRAMP incident handling and escalation procedures
Execute predefined runbooks for system checks, restarts, and health verifications
Validate post-maintenance and deployment health of systems and services
Assist with system patching coordination, log collection, and audit evidence gathering
Maintain situational awareness of system uptime, customer impact, and scheduled changes
Support basic troubleshooting for hosted applications
Validate API connectivity and assist in identifying failed integrations or logic errors
Collaborate with developers and CloudOps engineers to verify deployment health after releases
Escalate application-related issues with complete context — affected users, tenant ID, and integration dependencies
Ensure all activities follow change control, access management, and incident response procedures
Record detailed incident notes and maintain compliance-ready audit trails
Participate in Continuous Monitoring (ConMon) reporting and FedRAMP evidence collection
Qualification
Required
1–3 years of experience in a NOC, SOC, or application support / support center environment, supporting production systems or customer-facing web applications
Experience supporting customer-facing web applications, including alert triage, incident documentation, and escalation to engineering or platform teams
Familiarity with Linux administration and command-line tools
Familiarity with AWS, Azure, or GCP infrastructure services
Understanding of network, compute, and application monitoring fundamentals
General application troubleshooting experience, including familiarity with web-based applications, common application architectures, and foundational web technologies such as HTTP, REST APIs, and JSON
Strong attention to detail, communication, and documentation skills
Due to the nature of our work with federal government clients and compliance with applicable regulations, this position requires U.S. citizenship. Candidates must be able to provide documentation verifying U.S. citizenship status as part of the background check process
Preferred
CompTIA Security+, Linux+, ITIL v4, AWS Cloud Practitioner, or Microsoft Fundamentals (AZ-900)
Benefits
Medical
Dental
Vision
Life & Disability
Unlimited PEO
Employee funded 401k plan
Company
Knox Systems, Inc.
FedRAMP in 90 Days for 90% less.
Funding
Current Stage
Growth StageTotal Funding
$6.5MKey Investors
Felicis
2025-07-10Seed· $6.5M
Recent News
Company data provided by crunchbase