Site Reliability Specialist

7 days ago


London, Greater London, United Kingdom Phaidon International Full time

Job Title: Site Reliability Specialist

Job Overview: Our client, a leading global prop-trading firm, is seeking a Site Reliability Specialist to join their London office.

About the Role: As a Site Reliability Specialist, you will assist with business development by identifying new trading opportunities, work with developers to design and implement new features, and build new tools to scale and understand the trading system.

Key Responsibilities:

  • Full Lifecycle Support: Oversee the full lifecycle of software components, including development, testing, deployment, and maintenance.
  • Troubleshooting: Troubleshoot and resolve complex technical issues, ensuring minimal downtime and maximum system availability.
  • Tool Development: Develop and maintain tools that enable non-technical teams to make changes, improving efficiency and productivity.
  • Automation: Build automation tools for configuration management, deployment, monitoring, and troubleshooting, streamlining processes and reducing manual effort.
  • Technical Assistance: Offer technical assistance to the trading support team for internal and external inquiries, providing expert guidance and support.
  • Data Analysis: Create tools to analyze data generated by the trading system, enabling data-driven decision making and optimization.
  • Long-Term Planning: Contribute to long-term planning for the trading systems, including capacity, tools, and feature development, ensuring alignment with business objectives.
  • Collaboration: Collaborate with developers to design and implement new trading system features, leveraging expertise and driving innovation.
  • Ongoing Responsibilities: Perform additional responsibilities as required or assigned, demonstrating adaptability and a commitment to excellence.

Requirements:

  • Experience: Over 5 years of experience in supporting production trading environments, with a strong track record of success.
  • Skills: Strong skills in Python and/or Go, with shell scripting; familiarity with C++ is a plus.
  • Knowledge: Solid knowledge of the FIX protocol, with a focus on automating manual tasks and improving system efficiency.
  • DevOps Tools: Proficient in DevOps and CI/CD tools such as Jenkins, Ansible, and Git, with experience in automating manual tasks and streamlining processes.


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a skilled Site Reliability Engineering Specialist to join our team at Fourier. As a key member of our Site Reliability Engineering team, you will be responsible for developing tools for surveillance and enhancement of our production systems.You will work closely with our team to increase system resilience, investigate...


  • London, Greater London, United Kingdom Lorien Full time

    Key Responsibilities:Collaborate with the existing team to deliver a brand-new project.Work on a hybrid model with 1 day a week on-site in London.Develop and maintain reliable and efficient systems.Utilize experience with Java, Python, Splunk, ServiceNow, and MongoDB.Contribute to incident management and application monitoring.Ensure seamless interaction...


  • London, Greater London, United Kingdom Tunstall Healthcare Group Full time

    About UsTunstall Healthcare Group is a leading provider of health and care technology solutions.With over 3,000 colleagues across 18 countries, we strive to deliver exceptional services to millions of people worldwide.Job OverviewWe are seeking a skilled Site Reliability Specialist - Electronics to join our team in North London/Hertfordshire area.This role...


  • London, Greater London, United Kingdom Trilogy International, A Korn Ferry Company Full time

    Job OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Trilogy International, a Korn Ferry company. The successful candidate will be responsible for ensuring the reliability, scalability, and performance of our clients' systems and infrastructure.Key ResponsibilitiesDevelop and implement automation scripts to improve system...


  • London, Greater London, United Kingdom Fourier Full time

    Job DescriptionWe are seeking an experienced Site Reliability Engineering Automation Specialist to join our team at Fourier.The ideal candidate will have a strong background in Python scripting, version control, and configuration management.In this role, you will be responsible for developing tools to enhance the production systems, increasing their...


  • London, Greater London, United Kingdom Reliability Plus Full time

    About Reliability PlusWe are a leading provider of web-based applications for financial institutions. Our mission is to deliver innovative solutions that streamline complex workflows.Our team works closely with clients to understand their needs and develop tailored designs that meet their business objectives.


  • London, Greater London, United Kingdom AVT Reliability Ltd Full time

    About AVT Reliability LtdWe are a leading company in the field of asset integrity and reliability. Our team is passionate about delivering high-quality services to our clients.Job SummaryThis is an exciting opportunity for a talented engineering graduate to join our Asset Integrity Division as a specialist. You will be responsible for supporting a diverse...


  • London, Greater London, United Kingdom Maintech Recruitment Full time

    Due to continued growth, Maintech Recruitment is seeking a skilled Reliability Specialist to support the asset care team. The role is field-based, working on several client sites in the food & beverage industry, focusing on RCM Studies, condition monitoring reports, critical assessments, and root cause analysis assessments.The role covers a significant...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom Preqin Full time

    Role Overview Preqin is seeking an experienced Site Reliability Manager to join our Engineering team. As a Site Reliability Manager, you will play a crucial role in designing, operating, and supporting our infrastructure, middleware, and internal services. Key Responsibilities Design and operate scalable and high-available services, while establishing...


  • London, Greater London, United Kingdom Techruiter Full time

    We are a pioneering tech company specialising in cutting-edge Language Models (LLM) and Machine Learning solutions.About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team and ensure the reliability, scalability, and performance of our LLM and Machine Learning infrastructure.As an SRE, you will play a critical role in...


  • London, Greater London, United Kingdom Wayve Full time

    At Wayve, we're committed to fostering a culture of innovation and excellence. We're seeking a skilled Site Reliability Engineer to join our team and help us drive the development of our Embodied AI technology.As a Site Reliability Engineer, you will play a critical role in ensuring the seamless operation of our autonomous vehicles on public roads. Your...


  • London, Greater London, United Kingdom GoCardless Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team at GoCardless. The successful candidate will be responsible for designing, building, and maintaining our global platform, ensuring it is scalable, reliable, and secure.Key ResponsibilitiesDesign and implement infrastructure solutions using AWS, GCP, and KubernetesDevelop...


  • London, Greater London, United Kingdom Kinetech Full time

    At Kinetech, we're seeking a talented Site Reliability Engineer to join our team. This role is responsible for ensuring the smooth operation of our software systems, with a focus on scalability, reliability, and performance.Key Responsibilities:Design and implement CI/CD pipelines to automate code integration, testing, and deployment.Automate repetitive...


  • London, Greater London, United Kingdom Preqin Full time

    About the Role:Preqin is seeking an experienced Site Reliability Engineer to join our team in London. As a Site Reliability Engineer, you will work across Preqin's full suite of services, supporting our clients around the world.You will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems to ensure our teams...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...


  • London, Greater London, United Kingdom Spectrum IT Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are partnering with a leading company to help them scale their digital marketplace consumer services.This role is crucial in streamlining software delivery pipelines, enhancing reliability, performance, and scalability of systems, and driving continuous improvement across the software lifecycle.You will be...


  • London, Greater London, United Kingdom GoCardless Full time

    The RoleGoCardless is looking for a Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our payment and open banking products.Key ResponsibilitiesDesign and implement scalable and efficient infrastructure solutionsDevelop...


  • London, Greater London, United Kingdom Matchtech Full time

    Aircraft Reliability Management RoleAt Matchtech, we are seeking a skilled Reliability Specialist to join our team. As a key member of our reliability function, you will be responsible for the maintenance of aircraft airworthiness and reliability programmes. Your expertise will ensure the safe operation of aircraft, engines, components, and equipment.Key...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job OverviewJ Bandy Consulting seeks a skilled Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems. The successful candidate will be responsible for developing the SRE culture, applying automation, and monitoring application performance.Key ResponsibilitiesDrive the evolution of the DevOps/GitOps toolchain to...