Senior Infrastructure Engineer

2 weeks ago


Greater London, United Kingdom Cosine Full time

Job DescriptionAbout the RoleWe’re looking for a Senior Platform / Infra Engineer to own the core infrastructure that powers Cosine’s products — from Kubernetes and deployment pipelines to networking and platform services.You’ll design and run the “paved road” that our engineers, researchers, and customers build on: reliable Kubernetes clusters, fast and safe CI/CD, solid observability, and hardened environments for demanding enterprise and on-prem deployments. You’ll also wear a classic “DevOps/SRE” hat: thinking in SLOs, running incident response, and keeping us up even as we move quickly.This is a high-ownership role at a fast-paced, venture-backed Silicon Valley startup. You’ll work directly with founding engineers and leadership, and your decisions will materially shape how we build and ship products.What You’ll DoOwn core infrastructureDesign, operate, and evolve our Kubernetes-based platform (EKS or similar), including cluster topology, node groups, autoscaling, and multi-environment isolation.Manage supporting cloud resources: container registries, load balancers, queues, caches, and data infra needed to run our APIs and agents.Build the deployment & tooling layerDesign and maintain CI/CD pipelines for image builds and infra rollouts (e.g. Pulumi/Terraform + Helm/Docker).Implement safe rollout strategies (blue/green, canary, staged rollouts) and fast rollback paths.Build internal tools and abstractions that make it easy for product teams to self-serve infra safely.Own reliability & operations (SRE-ish)Define and track SLOs/SLIs for key services (latency, error rates, availability).Improve our observability stack (metrics, logs, traces, alerts) so issues are obvious, actionable, and debuggable.Participate in the on-call rotation, lead incident response when needed, and drive blameless post-mortems and fixes.Shape networking & securityDesign and maintain networking: VPCs, subnets, ingress/egress, service meshes / L7 routing, DNS, and TLS.Implement least-privilege access via IAM, secure secret management, and hardened configurations for multi-tenant and isolated customer environments.Help design patterns for secure enterprise and on-prem / regulated deployments.Partner with product & researchWork closely with application, ML, and research teams to understand their needs and translate them into reusable infra building blocks.Provide guidance on “how to run this in production” — capacity planning, failure modes, and operational readiness reviews.You Might Be a Great Fit If YouHave strong experience5+ years building and operating production infrastructure on a major cloud (AWS, GCP, or Azure).Significant hands-on experience running Kubernetes in production (EKS/GKE/AKS or self-managed):Cluster upgrades, autoscaling, node group design, and multi-env setups.Helm or similar for packaging services.Think in infrastructure-as-codeDeep experience with IaC tools (Pulumi, Terraform, CDK, or similar).Comfortable managing infra changes via code review, CI, and automated rollouts.Care deeply about reliabilityHave owned the uptime and performance of user-facing systems.Comfortable participating in (and improving) on-call rotations and incident management.Experience setting up / tuning observability (Prometheus, Grafana, CloudWatch, OpenTelemetry, etc.).Build great tooling & abstractionsYou’ve built internal tools, libraries, or platforms on top of cloud providers so product teams can move faster with fewer foot-guns.You think about developer experience and “golden paths,” not just raw infra.Are comfortable in codeStrong scripting and programming skills in at least one modern language (e.g. TypeScript, Go, Python).Happy to dive into app code when needed to debug a production issue or improve an integration.Have the startup mindsetEnjoy working in a fast-moving environment with evolving priorities and incomplete specs.Bias toward pragmatic solutions: ship something small, measure, iterate.Communicate clearly, give/receive direct feedback, and collaborate across functions.Nice to Have (Not Required)Experience with:AWS primitives like EKS, ECS/Fargate, ECR, SQS, ElastiCache/Redis.Argo CD or other GitOps tools for Kubernetes.On-prem, air-gapped, or regulated industry deployments (e.g. finance, healthcare).AI/ML infrastructure (GPU workloads, model hosting, feature stores).Prior experience as an early infra / platform hire at a startup.



  • Greater London, United Kingdom Rutherford Briant Full time

    Have you led complex Azure or infrastructure projects? Do you enjoy taking technical ownership while guiding others?Our client is seeking a Senior IT Infrastructure Engineer who can drive Azure deployments, lead infrastructure transformation, and support a high-performing operations team. This is an international professional services organisation recognised...


  • Greater London, United Kingdom Octavius Infrastructure Full time

    A leading civil engineering organization in the UK is seeking an Electrical Project Engineer to drive electrification projects. The role involves managing construction activities, ensuring compliance with safety standards, and collaborating with various teams. The ideal candidate will possess a degree in Electrical Engineering and experience in railway...


  • Greater London, United Kingdom Xpand Group Full time

    Senior Infrastructure EngineerPart time, 3 days per week2 to 3 month contract£500 per day inside IR35Immediate starta Council in esex is hiring a Senior Infrastructure Engineer on a temporary basis. This role sits onsite in Essex for two days each week and focuses on keeping core infrastructure stable, secure and fully supported while leading improvement...


  • Greater London, United Kingdom Vertex it Solutions Full time

    Senior Infrastructure Engineer Duration: 12-month fixed-term staff contract (not contract), with a strong potential for conversion to a permanent role.Location: This role is office based, located in Hammersmith, London + 1 day remote work per week We are looking for a highly motivated and experienced Senior Infrastructure Engineer to join our global IT team....


  • Greater London, United Kingdom Guillaume Masson Full time

    Overview Senior Infrastructure Engineer — Part time, 3 days per week; 2 to 3 month contract; £500 per day inside IR35; Immediate start. a Council in essex is hiring a Senior Infrastructure Engineer on a temporary basis. This role sits onsite in Essex for two days each week and focuses on keeping core infrastructure stable, secure and fully supported while...


  • Greater London, United Kingdom Prism Digital Full time

    Senior Infrastructure Engineer | AWS, Windows, Terraform | Media & TVSalary - £60,000Holborn / TCR office 3 days a weekMy client is a world-leading independent production and distribution group that produces some of the most popular TV programmes in the world.They're looking for a Senior Infrastructure Engineer who will support circa 30 internal companies...


  • Greater London, United Kingdom Prism Digital Full time

    Get AI-powered advice on this job and more exclusive features. This range is provided by Prism Digital. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Salary - £60,000 Holborn / TCR office 3 days a week My client is a world-leading independent production and distribution group that...


  • Greater London, United Kingdom Rutherford Briant Full time

    Have you led complex Azure or infrastructure projects? Do you enjoy taking technical ownership while guiding others? Our client is seeking a Senior IT Infrastructure Engineer who can drive Azure deployments, lead infrastructure transformation, and support a high‑performing operations team. This is an international professional services organisation...


  • Greater London, United Kingdom Stack Infrastructure Full time

    Lead the drive for consistency, speed, and excellence in STACK's European delivery program.STACK Infrastructure is building Europe's next generation of data centers - scalable, sustainable, and ready to power the world's digital future. To accelerate our delivery performance, we're hiring a Director of Infrastructure Readiness / Commissioning - a strategic...


  • London Area, United Kingdom NST Recruitment Limited Full time

    Senior Infrastructure Engineer – Azure, AWS, Active Directory, Exchange, Group Policy, Meraki, Fortinet, SD-WAN, PowerShell, ITIL, Change Management, Hybrid (4 days p/w in London), Site TravelUp to £80,000 Basic + Benefits + Travel ExpensesThis is a fantastic Senior Infrastructure Engineer opportunity to work with a leading infrastructure services...