National Grid
Platform Owner AIOps SRE
Waltham, MA
Jan 11, 2025
Full Job Description

About us

Every day we deliver safe and secure energy to homes, communities, and businesses. We are there when people need us the most. We connect people to the energy they need for the lives they live. The pace of change in society and our industry is accelerating and our expertise and track record puts us in an unparalleled position to shape the sustainable future of our industry.

To be successful we must anticipate the needs of our customers, reducing the cost of energy delivery today and pioneering the flexible energy systems of tomorrow. This requires us to deliver on our promises and always look for new opportunities to grow, both ourselves and our business.

IT and Digital works in a harmonised partnership with the National Grid group of diverse energy businesses to deliver technology which revolutionises the way we operate. As we lead the charge towards a carbon-free future, our teams are embracing disruptive changes in our industry by working with Agile methodologies and adopting Digital mindsets to drive efficiency and bring new capabilities for our internal and external customers.

Our work here is critical. National Grid moves energy to millions of homes and businesses in the UK and US and the technology we utilise to complete that task is down to us. The successful applicant for this position will be an integral contributor towards this goal and we will support your professional development as part of our multi-cultural, customer-centric global team.

National Grid is hiring a Platform Owner, AI OPS. This is a hybrid opportunity open to offices in Waltham, MA, Brooklyn, NY, and Syracuse, NY.

Job Purpose

As a Platform Owner of AI Ops and SRE, your primary objective is to design and oversee the implementation of complex systems that meet functional and non-functional requirements. You will play a key role in developing system design policies, standards, and innovation processes specific to AI Ops and SRE. Additionally, you will actively monitor emerging technologies and assess their potential impact on the organization. Your responsibilities will include driving the strategic vision for AI Ops and SRE within the platform, ensuring alignment among stakeholders, and promoting a cohesive approach to AI Ops and SRE implementation.

Key Accountabilities

As a Platform Owner of AI Ops and SRE, your primary responsibility is to develop comprehensive strategies for implementing AI Ops and SRE practices within the organization. This involves understanding business requirements, assessing technical capabilities, and identifying areas where AI and automation can be leveraged to enhance reliability, performance, and operational efficiency.
* Strategic Leadership: Define and execute comprehensive strategies for implementing AIOps and SRE practices aligned with business objectives.

* Cloud Architecture solutions: Design scalable and resilient cloud architectures to support energy-sector-specific applications, leveraging AIOps for predictive monitoring and automated incident response.

* SRE Implementation: Establish and promote SRE principles, including reliability engineering, service-level objectives, and monitoring strategies tailored to energy systems

* AIOps Integration: Oversee the implementation of AIOps platforms, ensuring the seamless integration of AI-driven insights into IT operations

* Collaboration: You will partner closely with engineering and operations teams to provide technical guidance and ensure the successful implementation of AI Ops and SRE practices. This involves reviewing designs, providing recommendations, and promoting best practices for building and operating reliable and efficient cloud-based applications.

* Continuous Improvement: Monitor and enhance system performance through iterative AIOps and strategies that incorporate AI Ops and SRE practices within the data center and cloud domain. This involves understanding business requirements, assessing technical capabilities, and identifying opportunities to leverage AI and automation for improved reliability and performance.

* Implementing AI-Driven Monitoring and Analytics: You will implement AI-driven monitoring and analytics solutions within the cloud domain. This includes leveraging machine learning and data analysis techniques to identify and predict system anomalies, performance bottlenecks, and potential failures.

* Managing the infrastructure platform within budget guardrails to ensure alignment with company priorities and goals. Collaborating with Transversal Teams to align Non-Functional Requirements (NFRs) and prioritize them jointly.

Qualifications

* Bachelor's degree in a relevant discipline, or an equivalent combination of education, training, and experience.
* 5 - 7 years of related experience with cloud platforms such as Azure preferred, Amazon Web Services (AWS), or Google Cloud Platform (GCP) is essential for managing and optimizing cloud-based infrastructure. * Containerization and Orchestration: Proficient in Docker and Kubernetes for deploying and managing containerized applications at scale.
* Infrastructure-as-Code (IaC): Knowledgeable in Terraform and AWS CloudFormation for automating infrastructure provisioning and management.
* Monitoring and Observability: Familiar with tools like Prometheus, Grafana, ServiceNow, ELK Stack, and Splunk for system performance monitoring and troubleshooting.
* Continuous Integration and Continuous Deployment (CI/CD): Experienced with CI/CD pipelines and tools such as GitHub and GitLab CI/CD.
* Configuration Management: Knowledge of configuration management tools like Ansible, Puppet, or Chef is valuable for managing and automating configuration changes across infrastructure and application environments.
* Proficiency in incident management tools like ServiceNow, PagerDuty, VictorOps, or ServiceNow, as well as collaboration platforms like Slack or Microsoft Teams, is essential for effective incident response and coordination.
* Understanding of networking concepts, protocols, and security best practices is important for managing network infrastructure, implementing secure access controls, and ensuring system and data protection.
* Database Technologies: Knowledge of database technologies such as MySQL, PostgreSQL, MongoDB, or Redis is valuable for managing and optimizing database systems and ensuring data integrity and availability.

Your Rewards

Rewarding work and a collaborative, team-oriented culture are just the beginning.Review our digital benefit guide at ngbenefitslivebrighter.com for full details and descriptions.

More Information

#LI-RK1 #LI-HYBRID

Salary

Waltham: $179k - $211k a year

Brooklyn:$192k - $226k a year

Syracuse:$160k - $188k a year

This position has a career path which provides for advancement opportunities within and across bands as you develop and evolve in the position; gaining experience, expertise and acquiring and applying technical skills. Candidates will be assessed and provided offers against the minimum qualifications of this role and their individual experience.

National Grid is an equal opportunity employer that values a broad diversity of talent, knowledge, experience and expertise. We foster a culture of inclusion that drives employee engagement to deliver superior performance to the communities we serve. National Grid is proud to be an affirmative action employer. We encourage minorities, women, individuals with disabilities and protected veterans to join the National Grid team.

PDN-9df09146-837f-4d2c-8444-675ad514eea4
Job Information
Job Category:
Skilled Labor
Spotlight Employer
Related jobs
The Nature Conservancy
OFFICE LOCATION Westchester, New York, USABedford, New York, USAMount Kisco, New York, USAPound Ridge, New York, USAMillwood, New York, USA WHO WE ARE The mission of The Nature Conservancy (TNC) is t...
Jan 12, 2025
Westchester, NY
Dish Operator
Pelican Brewing Company
The Nestucca Ridge Family of Companies is an organically grown collection of coastal businesses dedicated to memory-making beach vacations, dining experiences and meeting opportunities for visitors, g...
Jan 12, 2025
Lincoln City, OR
Service Manager
Simply The Best Pest Control
PEST CONTROL SERVICE MANAGERPURPOSE: Oversee a team of skilled pest control technicians, manage daily operations, and serve as a key point of contact for customers.About Simply The Best Pest Control:W...
Jan 12, 2025
Clinton, NJ
©2021 Boston While Black Career Center. All Rights Reserved.
Powered by TalentAlly.
Apply for this job
Platform Owner AIOps SRE
National Grid
Waltham, MA
Jan 11, 2025
Your Information
First Name *
Last Name *
Email Address *
Zip Code *
Password *
Confirm Password *
Create your Profile from your Resume
By clicking the Apply button, you agree to the terms of use and privacy policy.
Continue to Apply

National Grid would like you to finish the application on their website.

Ace your interview with
AI-powered interview practice

Get comfortable talking to hiring managers, receive personalized feedback on areas for improvement, sharpen your ability to answer the most common questions, and build confidence in formulating strong responses on the spot. Click the button below to begin your three free virtual interviews!