ABOUT MISO
The Midcontinent Independent System Operator, Inc. (MISO) ensures the reliable delivery of electricity, at the lowest cost, across high-voltage power lines in 15 U.S. States and the Canadian province of Manitoba. 42 million Americans depend on us to keep the power flowing. MISO also conducts transmission planning and manages the buying and selling of wholesale electricity in one of the world's largest energy markets. MISO is a not-for-profit, member based organization, governed by an independent board of directors and is headquartered in Carmel, Indiana.
POSITION OVERVIEW
This position will be the technical expert for the ITOC monitoring tooling, helping the organization through a transformation including process and technology. The individual hired for this role will partner across IT SME groups to ensure reliability to MISO's most critical systems, and it's a hands-on technical role that contributes to the success of MISO by managing enterprise monitoring of critical and non-critical system layers through the implementation of various measuring and monitoring systems. This position is a strong advocate and catalyst for continual process and technology improvements related to all aspects of monitoring including Network, System, and Application Performance, Capacity, Health, Availability, etc. This role will provide support for the enterprise IT monitoring capabilities by maintaining, managing, and improving various management tools and reports which seek to proactively identify and escalate issues and problems across team boundaries.
ESSENTIAL RESPONSIBILITIES
- Design, re-engineer, implement, manage & develop monitoring tools, such as Solarwinds, Traverse, etc. that will be used to support business decisions for monitoring systems heart rate and capability. Solarwinds expertise is given priority.
- Transition from existing legacy monitoring (Traverse) to Solarwinds
- Setup and utilize Solarwinds NPM, SDM, and Atlas, to discover and monitor a large IT network for potential problems. Problems could include network performance, power, malware intrusion, server faults, bandwidth capacity, storage capacity, server disk utilization, middleware, application performance, as well as memory and processor utilization.
- Monitor the performance and capacity of network and computer systems using a variety of tools including Solarwinds, Traverse, Team Quest and other monitoring tools.
- Work with the Network/Infrastructure/Monitoring teams to develop and advocate for standard procedures to respond to fault, power, capacity or utilization alerts.
- Ensure the monitoring systems operate efficiently and are kept at the most current stable version/release using vendor-supplied updates and patches. Perform research and testing to verify impact of installing all updates. Coordinates vendor support and ensures positive relationships are maintained.
- Develop robust reporting performance analysis from various performance reports for internal and external distribution.
- Proactively identify system deficiencies and assist in root cause analysis of system issues to minimize impact and future occurrence. Escalate issues as warranted.
- Review performance and capacity data and perform trend analyses to detect present and potential problems.
- Assist in the design of establishing standard SLAs and system/application thresholds
- Understands systems technical architecture, and able to identify the performance implications for different layers of system based on design discussions or architecture documents.
- Perform analysis and maintenance of system data and analysis of opportunities for technical and operational improvements.
- Execute initiatives to reduce failures, defects and improving overall performance.
- Utilize industry resources to identify new and innovative techniques and best practices.
- Serve as champion for new techniques as appropriate.
- Contributes to technical presentations to educate teams on how to improve performance and capacity.
- Provide capacity performance information to support technology refresh projects.
- Ability to make timely recommendations to effectively solve problems, using independent judgment consistent with standards, practices, policies, procedures, regulations, and/or law.
- Ability to work in a team/group setting and collaborate by providing transparency in performance results.
- Ability to work in an organization that is experiencing extreme change.
- Must be available for network emergencies or Major Incidents 7x 24. Some evening and/or weekend work as necessary based upon workload
QUALIFICATIONS
- Bachelor's degree in Technical field, or 5+ years relevant work experience equivalency, required
- Minimum of five years' experience working in complex communication environments
Appropriate level will be determined based upon experience and knowledge
TECHNICAL CAPABILITIES
- Demonstrated experience in NOC environments
- Network and System documentation and mapping experience, using Visio
- Creating standard and customized alerts for Enterprise IT Monitoring solutions.
- Building and developing application performance monitoring
- Demonstrated ability to deliver threshold guidance for alerting
- Advanced experience with Solarwinds monitoring software, preferred
- Advanced experience with Traverse monitoring software, preferred
- Development, design, and continuously improves enterprise monitoring
- Able to apply mastery of hardware and software monitoring principles and technologies
- Working knowledge of network protocols and routing, network, server, and host operating systems
- Proficient with scripting and relevant coding languages
- Ability to present complex data to groups of internal and external customers in a clear and concise manner
- Proficient planning, designing and supporting enterprise monitoring solutions
- Must have the ability to understand the application, infrastructure, and critical facility portfolio at MISO in order to evaluate performance and capacity risk
- Advanced Network Performance Analysis and debugging in complex networks
Click here to view the full job posting.
Midcontinent Independent System Operator, Inc. (MISO)
1125 Energy Drive
St. Paul
Minnesota United States
www.midwestiso.org