Software System Reliability Engineer

7 days ago


London, Greater London, United Kingdom HCLTech Full time
Job Description

We are seeking a highly skilled Software System Reliability Engineer to join our dynamic team at HCLTech. The ideal candidate will have a proven track record in implementing DevOps and SRE practices, with extensive experience in cloud platforms, containerization, and observability tools.

About Our Team

We are a dynamic team of experienced professionals working together to deliver industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products.

We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of $13 billion.

Responsibilities

  • Drive the implementation and evolution of DevOps and SRE practices across multi-disciplinary teams, fostering a culture of continuous improvement and collaboration.
  • Leverage advanced knowledge and hands-on experience with cloud platforms and Infrastructure as a Service (IaaS) offerings, preferably Amazon Web Services (AWS) or Microsoft Azure.
  • Utilize strong and proven Java skills to develop, maintain, and optimize application performance.
  • Apply expertise in Linux and networking fundamentals to ensure robust and secure infrastructure.
  • Deploy and manage containerization technologies, ideally using Docker and Kubernetes, to streamline application delivery and scalability.
  • Oversee the processes involved in release, integration, and deployment, ensuring efficient and reliable promotion pathways within these processes.
  • Implement and maintain observability principles and practices, including monitoring, logging, tracing, and alerting systems, using tools such as Dynatrace and Datadog, to provide transparency and actionable insights into system performance and health.
  • Address performance and optimization issues, demonstrating a capability to diagnose and resolve problems efficiently.
  • Operate across the entire stack, including hardware, application, and network layers, to ensure comprehensive system reliability and performance.
  • Champion agile software development methodologies and environments to enhance team productivity and project outcomes.


  • London, Greater London, United Kingdom Dabster Full time

    We are seeking an experienced Software Reliability Engineering Manager to join our team at Dabster. As a Software Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems, while also collaborating with cross-functional teams to drive business growth.The ideal candidate will have a strong background in software...


  • London, Greater London, United Kingdom Google Full time

    About the RoleThe Software Reliability Engineer position at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you will keep an ever-watchful eye on our systems' capacity and performance, optimizing existing systems, building infrastructure, and eliminating work through...


  • London, Greater London, United Kingdom ZipRecruiter Full time

    Job Summary: System Reliability EngineerWe're seeking a highly skilled System Reliability Engineer to join our team as our first dedicated SRE/DevOps hire. This role offers an exciting opportunity to design, implement, and manage our infrastructure, CI/CD pipelines, and production operations from the ground up. You'll have autonomy in shaping our tech stack,...


  • London, Greater London, United Kingdom Amazon Full time

    Veeqo is an innovative company that helps high-growth ecommerce businesses build efficient inventory and fulfillment operations. We're seeking a talented DevOps Engineer to join our team and help us improve our system's resilience and security.About the JobAs a DevOps Engineer, you will work closely with multiple teams to build tooling and infrastructure...


  • London, Greater London, United Kingdom Bumble Full time

    Job DescriptionWe are seeking a skilled System Reliability Architect to join our team at Bumble Inc. Estimated salary: $120,000 - $180,000 per year.About the RoleAs a System Reliability Architect, you will be responsible for ensuring the reliability, scalability, and performance of our software systems. This involves proactively managing, automating, and...


  • London, Greater London, United Kingdom Apple Inc. Full time

    At Apple Inc., we are revolutionizing entire industries by crafting experiences that inspire innovation. As a Software Reliability Engineer, you will be part of a diverse collection of talented individuals who work together to deliver world-class products and services.Job ResponsibilitiesYou will design, engineer, and run systems and infrastructure that will...


  • London, Greater London, United Kingdom AYS System Full time

    About AYS SystemAYS System is a leading provider of innovative urban solutions, committed to building stronger communities through quality infrastructure.About the RoleWe are seeking an experienced Electrical Systems Specialist to join our team. As an electrical engineer, you will play a crucial role in designing, developing, and maintaining electrical...


  • London, Greater London, United Kingdom ENGINEERINGUK Full time

    Job SummaryThe role of a Robotics Systems Engineer in the Reliability and Automation Engineering Team involves working with cross-functional teams to drive the implementation and continuous improvement of world-class maintenance, repair, and supportability solutions for Amazon Robotics portfolio. You will analyze large-scale data from databases, PLCs,...

  • Software Engineer

    4 weeks ago


    London, Greater London, United Kingdom Quantcast Full time

    Job DescriptionWe are seeking a highly skilled Software Engineer to join our team as a Platform Reliability Specialist. As a key member of our engineering team, you will be responsible for designing, developing, and maintaining large-scale distributed systems that respond to millions of real-time requests per second efficiently.The ideal candidate will have...


  • London, Greater London, United Kingdom xAI Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our dynamic team in London. The ideal candidate will have a strong background in software engineering and a passion for ensuring high system availability.The main responsibilities of this role include:Improving Observability: Design and implement monitoring systems to provide...


  • London, Greater London, United Kingdom Kosli Full time

    Kosli is seeking a Reliable Software Engineer to join our team. As a Reliable Software Engineer, you will play a key role in building and maintaining our large-scale data and compute cloud infrastructure.">Manage and evolve Kosli's cloud infrastructure using Terraform and AWSLead security implementation and compliance checks across our infrastructureOwn and...


  • London, Greater London, United Kingdom Palantir Technologies Full time

    About the RoleWe are seeking a skilled Reliable Software Architect to join our team at Palantir Technologies. In this role, you will play a critical part in ensuring the stability and reliability of our products.As a key member of our engineering team, you will collaborate with cross-functional teams to develop and deploy scalable, reliable software for our...


  • London, Greater London, United Kingdom IG Index Limited Full time

    Lead a team of engineers to ensure the stability and resilience of our cloud-based services.We are IG Index Limited, a global company that uses advanced technology to help ambitious people achieve financial freedom.Your role in the TeamYou will oversee the development of our cloud infrastructure, working closely with the architecture and service management...


  • London, Greater London, United Kingdom Spectrum IT Recruitment Full time £75,000 - £85,000

    Streamline Software Delivery and Enhance System ReliabilityWe are seeking a highly skilled Senior Site Reliability Engineer to join our client's engineering team. As a key member, you will play a critical role in streamlining software delivery pipelines, enhancing the reliability, performance, and scalability of systems, and driving continuous improvement...

  • Software Engineer

    4 weeks ago


    London, Greater London, United Kingdom Acre Software Full time

    Acre Software is revolutionizing the UK's mortgage market with its cutting-edge management system.OverviewThe company is building a fully digital platform that streamlines the home buying process, reducing unnecessary admin and friction for consumers. Acre's platform covers the entire journey, from determining what buyers can borrow to handing over...


  • London, Greater London, United Kingdom FactSet Full time

    Job Title:Chief System Reliability SpecialistAbout FactSetFactSet is a leading provider of financial data and software solutions for investment professionals. We offer instant access to financial data and analytics that investors use to make informed decisions.Job Description:We are seeking a highly motivated and talented Chief System Reliability Specialist...


  • London, Greater London, United Kingdom TRIA Full time £60,000 - £70,000

    TRIA is seeking a highly skilled System Reliability Engineer to join our team.Job Description:You will be responsible for designing, building, and maintaining scalable and reliable systems that meet the needs of our business.Develop and implement automation scripts using tools like Ansible or TerraformLiaise with the Platform team to ensure alignment with...


  • London, Greater London, United Kingdom Google Full time

    Job DescriptionAs a System Reliability Engineer at Google, you will play a critical role in ensuring the reliability and scalability of our systems. You will work closely with cross-functional teams to design, deploy, and operate large-scale systems that are fault-tolerant and highly available. Your expertise will help us build and maintain infrastructure...


  • London, Greater London, United Kingdom Sahaj Software Full time

    **Job Title:** Senior Software Engineering Manager**About Us:** At Sahaj Software, we're passionate about delivering exceptional software solutions. We're seeking a seasoned Senior Software Engineering Manager to lead our engineering team.**Estimated Salary:** $150,000 - $220,000 per yearAs a Senior Software Engineering Manager, you'll be responsible for...


  • London, Greater London, United Kingdom AYS System Full time

    Transforming Urban Landscapes with Electrical Engineering ExpertiseWe are seeking a highly skilled Electrical Systems Specialist to join our team at AYS System. As an essential member of our infrastructure development team, you will play a critical role in designing and implementing electrical control systems that drive the creation of quality urban...