The Company
NorthMark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission as it operates at the bleeding edge of technology. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients' research, production, and delivery, enabling breakthroughs that shape the industries of tomorrow. Its engineers build critical infrastructure to eliminate friction in scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation.
The Position
In this role, you will leverage your expertise in networking and software engineering to automate the provisioning, configuration, and observability of complex network environments. You will collaborate closely with network engineers, platform engineers, and security teams to ensure that our networking stack is reliable, secure, and efficient. This is a hands-on engineering role where you will design and implement automation solutions that support large-scale compute and research workflows, driving innovation in how modern networks are managed.
Responsibilities:
Design, build, and maintain automation frameworks that simplify network provisioning and lifecycle management
Develop software in Python and leverage tools such as Ansible, Terraform, and Jinja2 to implement Infrastructure-as-Code practices
Integrate network automation into CI/CD pipelines, enabling testable, repeatable, and reliable deployments
Build APIs and tooling to expose networking capabilities as self-service for engineering teams
Implement observability workflows for network performance, availability, and telemetry using modern monitoring stacks
Collaborate with Network Engineering, Security, and Platform teams to deliver resilient and scalable automation solutions
Participate in on-call rotations and incident response, contributing to the reliability of production systems
Contribute to architectural design discussions, ensuring automation is built for long-term maintainability and scalability
Requirements:
Proven experience in software engineering, network engineering, or a hybrid of both, with a focus on automation
Strong programming skills in Python and familiarity with software engineering practices (testing, version control, CI/CD)
Solid understanding of computer networking (routing, switching, firewalls, protocols) and network design principles
Experience with Infrastructure-as-Code and configuration management tools (Ansible, Terraform, Jinja2)
Familiarity with containerization and orchestration platforms (Docker, Kubernetes)
Exposure to observability, monitoring, or telemetry systems for distributed infrastructure
Strong problem-solving skills with the ability to navigate ambiguity and take ownership of projects
Comfortable working in collaborative, agile engineering teams
Knowledge of event-driven or message-based architectures (Kafka, RabbitMQ)
Experience with API design, service integration, or platform engineering
Familiarity with service discovery, secrets management, or policy-as-code frameworks
Understanding of large-scale data center networking concepts and technologies (Cisco, Arista, segment routing, multitenant designs)
Prior contributions to open-source networking or automation projects