Service Operation Center

6 months ago


London, United Kingdom VAST Data Full time

This is a great opportunity to be part of one of the fastest-growing infrastructure companies in history, an organization that is in the center of the hurricane being created by the revolution in artificial intelligence.

**"VAST's data management vision is the future of the market."- Forbes**

VAST Data is the data platform company for the AI era. We are building the enterprise software infrastructure to capture, catalog, refine, enrich, and protect massive datasets and make them available for real-time data analysis and AI training and inference. Designed from the ground up to make AI simple to deploy and manage, VAST takes the cost and complexity out of deploying enterprise and AI infrastructure across data center, edge, and cloud.

Our success has been built through intense innovation, a customer-first mentality and a team of fearless VASTronauts who leverage their skills & experiences to make real market impact. This is an opportunity to be a key contributor at a pivotal time in our company’s growth and at a pivotal point in computing history.

The Services Operations Center (SOC) is a team within Customer Success whose function is to monitor the quality of service of VAST Data clusters deployed in the field, and take the necessary actions in the case of service degradation or outage. The person that works in a SOC is a SOC Operator. This person will be looking at a dashboard of monitored items, and when green lights turn red, they will click on the red light, read a runbook, and endeavor to resolve the issue. If they can fix the problem in 30 mins, they will. If they can not fix the problem, they will declare an ‘incident’ and page in Support who will begin troubleshooting, and begin keeping a timeline of events. If Support can't fix the problem, the SOC Operator will page in R&D. The SOC Operator will 'run' the incident until the issue is resolved. Once resolution occurs, the SOC operator close out ticket, and publish a Preliminary Findings Report in the ticket. All the while, the SOC Operator will provide a play-by-play in the internal slack channel to keep everyone aware of what's happening.

**SOC Operator Job Description**

**Position Title**: Services Operations Center (SOC) Operator

**Job Summary**: As a SOC Operator, you will be responsible for monitoring and maintaining the health and performance of our fleet of installed clusters. You will work in a 24/7 operations environment, ensuring the availability, reliability, and security of services. This role involves real-time monitoring, incident detection, incident management, incident resolution, and clear written and verbal communication with other teams and stakeholders.

**Responsibilities**:

- Monitor clusters using internal monitoring tools to detect and troubleshoot issues promptly.
- Respond to alerts and incidents in a timely manner, following standard operating procedures (SOPs) and escalation processes.
- Perform initial investigation and diagnosis of problems, escalating complex issues to support or R&D.
- Document incidents, including their details, troubleshooting steps, and resolutions in the incident tracking system.
- Collaborate with other teams, including Support, R&D, Account teams, and customers to ensure effective incident resolution and communication.
- Conduct routine checks and audits to identify potential problems or vulnerabilities.
- Assist with the implementation of changes and updates to the infrastructure as directed by team leads.
- Participate in shift-based work schedules, including nights, weekends, and holidays, to provide 24/7 coverage in the SOC.
- Maintain up-to-date knowledge of storage technologies, industry trends, and best practices.
- Adhere to security protocols and ensure the confidentiality, integrity, and availability of network and system data.
- Contribute to the development and improvement of SOC processes and procedures.
- Provide excellent customer service to internal and external stakeholders during incident resolution and communication.

**Requirements**:

- High school diploma or equivalent; a degree or certification in information technology or a related field is a plus.
- Proven experience as a SOC Operator or in a similar network monitoring role is preferred.
- Strong understanding of networking concepts, protocols, and technologies (TCP/IP, SNMP, DHCP, DNS, etc.).
- Ability to work independently and collaboratively in a team-based environment.
- Excellent problem-solving and analytical skills, with the ability to multitask effectively.
- Good communication skills, both written and verbal, to interact with technical and non-technical stakeholders.
- Willingness to work in a 24/7 shift-based environment, including nights, weekends, and holidays.
- Detail-oriented and committed to maintaining accurate documentation.
- Demonstrated commitment to continuous learning and self-improvement.

**Note**: This job description is intended to provide a general overview of the responsibilities



  • London, Greater London, United Kingdom DiverseJobsMatter Full time

    We are seeking a Data Center Operations Expert to join our team in London. As a Data Center Operations Expert, you will play a key role in ensuring the smooth operation of our data centers and related infrastructure.About the JobThis is an exciting opportunity for a motivated and experienced engineer to join our team and contribute to the delivery of our...


  • London, Greater London, United Kingdom ENGINEERINGUK Full time

    Data Center Operations ManagerAbout the RoleAs a Data Center Operations Manager at ENGINEERINGUK, you will be responsible for managing the world's largest Cloud Computing Infrastructure. This role requires a unique blend of technical expertise, leadership skills, and problem-solving abilities. You will oversee the design, planning, delivery, and operation of...


  • London, Greater London, United Kingdom Amazon Full time

    Data Center Operations EngineerAmazon Web Services is seeking an experienced Data Center Operations Engineer to join our team in Korea. This critical role involves ensuring the physical infrastructure of our data centers operates at 100% availability while providing first-class customer service.This position is responsible for the on-site management of shift...


  • London, Greater London, United Kingdom Amazon Full time

    About the RoleWe are seeking a highly skilled Data Center Operations Manager to join our team at Amazon. As a key member of our data center operations team, you will be responsible for ensuring the smooth operation of our data centers, including managing a team of facilities managers and engineers, and overseeing the maintenance and repair of critical...


  • London, Greater London, United Kingdom Amazon Full time

    Data Center Operations ManagerAmazon is seeking a highly skilled Data Center Operations Manager to join our team. As a key member of our operations team, you will be responsible for ensuring the smooth operation of our data centers, including managing teams of engineers, maintaining existing operational facilities, and helping to build and bring online new...


  • London, Greater London, United Kingdom Amazon Full time

    About the RoleAs a Data Center Facility Manager at Amazon, you will play a crucial role in ensuring the smooth operation of our global data centers. Your primary focus will be on managing and developing teams of engineers, providing technical and leadership expertise to ensure the highest levels of performance and availability.Key ResponsibilitiesEnsure...


  • London, Greater London, United Kingdom One Avenue Group Full time

    Job ResponsibilitiesThe Assistant Center Director will be responsible for overseeing the daily operations of our center, including:Maintaining high standards of cleanliness and hospitalityManaging on-site staff, including Client Experience Assistants, cleaners, handymen, and contractorsCoordinating office decoration, client office moves, and health and...


  • London, Greater London, United Kingdom Amazon Data Services UK Limited Full time

    About the JobWe are seeking an experienced Data Center Facility Engineer to join our team at Amazon Data Services UK Limited.Key ResponsibilitiesOversee the operation and maintenance of electrical and mechanical infrastructure for Data Centers (DC) in Amazon Web Services (AWS) Cloud regions.Ensure the implementation of safety procedures and maintain the...


  • London, Greater London, United Kingdom Adainfrastructure Full time

    About Ada InfrastructureWe are a global data center business dedicated to making a positive impact on technology, people, and the planet. With a world-class team of industry leaders, we aim to lead the industry in reliable, safe, secure, and sustainable digital infrastructure.SummaryThis is an exciting opportunity to join our team as the Head of EMEA Data...


  • London, Greater London, United Kingdom Amazon Full time

    Data Center Engineering Operations Opportunity at AmazonAs an AWS Data Center Engineering Operations Engineer, you will play a critical role in ensuring the smooth operation of our data centers. Your primary responsibility will be to oversee the maintenance and upkeep of our critical infrastructure, including electrical, mechanical, and fire/life safety...


  • London, Greater London, United Kingdom Equinix Full time

    Job DescriptionAs a Data Center Operations Director at Equinix, you will be responsible for overseeing the day-to-day operations of our data centers in assigned metros. This role requires strong leadership and management skills to drive strategic planning, prioritize tasks, and ensure consistent customer experience across all facilities.Responsibilities:Data...


  • London, Greater London, United Kingdom Vantage Data Centers Full time

    About UsVantage Data Centers is a leader in providing mission-critical data center infrastructure. Our team is dedicated to delivering exceptional service and reliability, ensuring our customers' critical operations run smoothly.


  • London, Greater London, United Kingdom L&G Recruitment Full time

    About the RoleWe are seeking a highly skilled Data Center Operations Expert to join our team at L&G Recruitment. This role involves designing and implementing network infrastructure solutions that meet the needs of our clients.The ideal candidate will have expertise in IP protocols, SD-WAN, and routing & switching. You will work closely with our clients to...


  • London, United Kingdom Amazon Data Services UK Limited - E17 Full time

    High school diploma or equivalent education - 18 years or older - Experience with Microsoft Office Amazon Web Services (AWS) is growing rapidly, and we are looking for trainee technicians to join our Data Center team as part of our Work-Based Learning Program (WBLP). These trainees will participate in our 12-week work and training program in an AWS Data...


  • London, Greater London, United Kingdom ENGINEERINGUK Full time

    Data Center Facility Operations SpecialistWe are looking for a skilled Data Center Facility Operations Specialist to join our Engineering and Facilities team at ENGINEERINGUK. In this role, you will be responsible for ensuring the availability of our customers by maintaining the technical infrastructure and operating the DC facility.Key...


  • London, Greater London, United Kingdom Equinix Full time

    Job OverviewElevate your career as a Data Center Operations Manager at Equinix, where you'll oversee the day-to-day operations of our global data centers. With over 260 facilities worldwide, we're the world's digital infrastructure company.The ideal candidate will possess a proven track record in datacenter/infrastructure operations management and...


  • London, Greater London, United Kingdom Microsoft Full time

    About the RoleAs a Data Center Technician at Microsoft, you will play a critical role in our mission to empower every person and organization on the planet to achieve more. You will develop an understanding of standard processes and procedures for preparing, installing, performing diagnostics, troubleshooting, replacing, and/or decommissioning equipment...


  • London, Greater London, United Kingdom Amazon Full time

    Amazon is seeking a talented Data Center Operations Technician to join its team. The successful candidate will be responsible for maintaining high operational standards in supporting server and network hardware and software.Key Responsibilities7x24 roster duty in data center operationsIncident first responder and follow call leader's instruction for hands on...


  • London, United Kingdom Hamilton Barnes 🌳 Full time

    Job Description Do you want to lead in one of the world’s most advanced trading environments? Join an elite Quant trading firm renowned for its cutting-edge technology and large-scale HPC cluster. Our client is seeking an experienced Data Center Operations Manager to oversee and optimize their global data center infrastructure. Responsibilities: Lead...


  • London, United Kingdom Hamilton Barnes Full time

    Do you want to lead in one of the world’s most advanced trading environments?Join an elite Quant trading firm renowned for its cutting-edge technology and large-scale HPC cluster. Our client is seeking an experienced Data Center Operations Manager to oversee and optimize their global data center infrastructure.Responsibilities:Lead and manage a global team...