Site Reliability Engineer, Cloud Operations
2 months ago
This is what a career at River Island is like. And this is where yours starts.
What We Are Looking For...We are looking for someone to build, manage and support our website and cloud infrastructure. Contribute to the specification, delivery and support of new and existing public cloud hosted enterprise and microservice applications.
Key Accountabilities:
- Proactively monitor, maintain and support River island’s website and Cloud Infrastructure running Dev , UAT and production systems
- Work closely with all dev teams and other product teams to implement and support tools and services
- Participate in continuously improve all aspects of Operations
As a Site Reliability Engineer in Cloud and Infrastructure Practice, a primary goal is to support and continuously improve the reliability and maintainability of the platform running website and other products
- Build, manage and maintain River island’s website and existing IT estate hosted in AWS
- Develop and maintain CloudOps tools and services
- Increase observability and drive improvements in security, scalability, reliability, performance, cost optimisation and governance
- Triage, troubleshoot & resolve front-line Production Support alerts and tickets
- Participate in incident response, disaster recovery, and production support testing and implementations
- Work closely with our support and engineering teams to support and implement technical solutions
- Lead and execute projects initiated by operations team to improve all aspects of operations
- Work with project teams to ensure successful realisation of deliverables
- Work with 3rd party vendors to support existing products and delivery new solutions
- Provide out of hours cover for Digital Platforms in line with support rota
- Act as SME for all digital and website related incidents, projects or problems
- Participate in knowledge share session with team members. Keep internal documentation up to date.
Required:
- Solid system admin experience in supporting critical production system running Windows and Linux OS as well as familiarity with MS SQL server
- Strong knowledge of AWS and serverless technologies
- Strong scripting and automation experience
- Excellent working knowledge of popular monitoring/alerting/dashboarding solutions (CloudWatch, New Relic, Prometheus, Grafana)
- Terraform skills and experience maintaining reusable modules
- Good exposure to CI/CD practices and helping technical teams support their deployment pipelines
- Strong knowledge of core IT services like Active directory, DNS, DHCP, Networking and Storage
- Good exposure to configuration, change, release, incident, and problem management
- Excellent problem-solving skills and operational mindset.
- AWS - EC2/ECS/S3/ELB/ALB/NLB/RDS/IAM/VPN/Direct connect/Lambda/AWS networking
- Terraform (AWS as the provider)
- GitHub
- IIS / webservers
- MS SQL
- Awareness of ITIL Service Management & Agile delivery (Scrum/Kanban) in a DevOps culture
- Jira / Confluence
- Ability to translate technical issues into business language
- Trainer mentality to upskill co-workers
- ServiceNow
- Treats team members with consideration and respect
- Demonstrates a flexible and collaborative approach
- Sets high personal standards
- Builds positive and constructive relationships with colleagues
- Proactive and considered approach
- Demonstrates energy and resourcefulness in addressing business needs and requirements
- Focuses on delivering best possible outcome and value for River Island and for the customer
- Takes personal responsibility for team and departmental success
- Takes a positive and responsive approach to identifying, raising and resolving risks and issues
- Must have a real passion for modern enterprise tech.
This Is For You...
- Discount - Generous 50% staff discount so you can treat yourself & a bargain staff shop, all onsite
- RI Rewards - Reducing Islanders everyday expenses through discounts, benefits, financial advice, wellbeing solutions and more with Reward Gateway
- Island culture - We have a free onsite gym, subsidised restaurant & café & various social events throughout the year. We also work closely with the Retail Trust to create dedicated support for all our Islanders
- Work that stays at work - Flexible working is a given, on top of payday early finishes and Summer Fridays
- Family Hub - Every family is unique, we support Islanders with all different family setups enhanced maternity, paternity, adoption & fertility treatment
- Giver Island - Give as you earn scheme, a ‘Giver Island’ day each year and matched funding
- Training on the job - Support with upskilling skills through on the job training and qualifications
- Pension - A contributory private pension scheme
- Bonus - A generous bonus scheme
- Healthcare - With the choice to opt in for healthcare through our provider AXA
- Holiday - 25 days paid holiday, exclusive of Bank Holidays. With the added option to purchase additional holiday for whatever the need
Keeping You Safe...At River Island we are committed to the safeguarding of all of our employees regardless of age or job role. We will fulfil our obligation under the Prevent duty which seeks to stop extremism and extremist views from materialising in our business. We promote and encourage the belief in British Values- including democracy, the rule of law, individual liberty and mutual respect and tolerance of different faiths and beliefs. To find out more, please visit www.gov.uk
Every Islander Counts
Our Island is made up of a diverse community, where we all belong and feel part of something bigger. We are committed to equality of opportunity and welcome applications from individuals, regardless of age, gender, ethnicity, disability, sexual orientation, gender identity, socio-economic background, religion and/or belief. We will consider flexible working requests for all roles unless operational requirements prevent otherwise.
-
Site Reliability Engineer
2 weeks ago
London, Greater London, United Kingdom Preqin Full timeAbout the Role:Preqin is seeking an experienced Site Reliability Engineer to join our team in London. As a Site Reliability Engineer, you will work across Preqin's full suite of services, supporting our clients around the world.You will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems to ensure our teams...
-
Cloud Engineer and Site Reliability Leader
2 weeks ago
London, Greater London, United Kingdom Google Full timeAbout the RoleAt Google, we're looking for a talented Cloud Engineer and Site Reliability Leader to join our team. As a key member of our SRE organization, you'll be responsible for designing, building, and operating large-scale distributed systems that meet the high standards of reliability, scalability, and performance.We're seeking someone with 8+ years...
-
Site Reliability Engineering Lead
2 weeks ago
London, Greater London, United Kingdom BenevolentAI Full timeAbout the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...
-
Site Reliability Engineer
2 months ago
London, United Kingdom Switch Tech Talent Full timeRole: Site Reliability Engineer Location: London/Hybrid (3 days a week in office) Salary: £75,000 Key Skills: AWS, IaC, Docker, Scripting As a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless,...
-
Site Reliability Engineer
2 months ago
London, United Kingdom Bright Purple Full timeSite Reliability Engineer – London - Hybrid (3 Days onsite) Step into a role that promises not just a job, but a rewarding career with a leading tech unicorn. What is in it for you: Salary up to £75,000 including equity in the company Hybrid working arrangements with global offices Generous holiday allowance Private healthcare Professional and...
-
Cloud Reliability Engineer
2 weeks ago
London, Greater London, United Kingdom GoCardless Full timeAbout the RoleWe are seeking an experienced Cloud Reliability Engineer to join our distributed team at GoCardless. As a key member of our engineering team, you will be responsible for designing and implementing scalable and reliable infrastructure solutions.With a strong interest in infrastructure management and site reliability engineering, you will...
-
Site Reliability Engineering Leader
2 weeks ago
London, Greater London, United Kingdom Rewardgateway Full timeEngineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...
-
Site Reliability Engineer
2 days ago
London, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineer
2 days ago
London, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom HCLTech Full timeJob Description HCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services,...
-
Site Reliability Engineer
1 day ago
London, United Kingdom ZipRecruiter Full timeJob Description HCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services,...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineer
2 weeks ago
London,, UK, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineering Lead
2 weeks ago
London, Greater London, United Kingdom BenevolentAI Full time**Job Overview:**We are seeking a highly skilled Senior Site Reliability Engineer to join our team at BenevolentAI. As a key member of our squad, you will play a crucial role in ensuring the reliability and scalability of our cloud infrastructure.The ideal candidate will have a strong background in software development, with experience in implementing cloud...
-
Site Reliability Engineer
1 month ago
London, United Kingdom Switch Tech Talent Full timeRole: Site Reliability Engineer Location: London/Hybrid (3 days a week in office) Salary: £75,000 Key Skills: AWS, IaC, Docker, ScriptingAs a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable,...
-
Site Reliability Engineer
2 months ago
London, United Kingdom Switch Tech Talent Full time €75,000Role: Site Reliability Engineer Location: London/Hybrid (3 days a week in office) Salary: £75,000 Key Skills: AWS, IaC, Docker, Scripting As a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless,...
-
Site Reliability Engineer
2 months ago
London, United Kingdom Switch Tech Talent Full time €75,000Role: Site Reliability Engineer Location: London/Hybrid (3 days a week in office) Salary: £75,000 Key Skills: AWS, IaC, Docker, Scripting As a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless,...
-
Site Reliability Engineer
2 weeks ago
London Area, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...
-
Site Reliability Engineer
2 days ago
London Area, United Kingdom HCLTech Full timeHCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered on digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life...