Job Title: Senior Monitoring Tool Engineer
Location: REMOTE
Duration: 6 Months+
MOI: Skype
Job Summary:
We are seeking an experienced Senior Monitoring Tool Administrator to join our infrastructure support team, specializing in Microsoft System Center Operations Manager (SCOM) in an environment with a strong automation focus. The successful candidate will be responsible for designing, implementing, and maintaining our monitoring and observability solutions, ensuring the smooth operation of our hybrid environment, which includes Windows, Linux, and other legacy resources. Team members will be responsible to collaborate with other teams to clearly define monitoring standards for different IT products, captures those standards in code (eg SCOM management packs and related configuration/infra as code), and to automate the lifecycle of SCOM objects, dashboards, maintenance windows, etc… and their lifecycles via orchestration and DevOps style methodologies. Team members will be responsible to assist consumers with the setup of queries and dashboards to display health of monitored systems and services, and candidates should be familiar with use of KPIs and SLO/SLIs used to measure service quality.
This is a senior role that requires strong technical expertise, excellent problem-solving skills, and the ability to work collaboratively with cross-functional teams in a fast-paced, agile environment.
Required Technology Skills:
- In-depth knowledge of Microsoft System Center Operations Manager (SCOM) and its components
- Experience with SCOM management best practices, including customization of management packs (collection, rules, actions, etc.), integrations, and automation
- Strong PowerShell experience and development skills
- Familiarity with Windows and Linux operating systems
- Basic knowledge of cloud and hybrid environments, including Azure and AWS
- Experience with automation tools, such as PowerShell and Jenkins
- Understanding of IT service management frameworks, such as ITIL
- Familiarity with other monitoring and observability tools, such as AppDynamics, ThousandEyes, Nagios, Prometheus, or Grafana
- Strong understanding of network protocols and architecture
- Experience with database management systems, such as SQL Server (to support SCOM dependencies
- Familiarity with related DevOps tools and methodologies, including:
- Bash scripting
- Python development
- HTTP-based APIs
- Git source control (Azure DevOps, GitHub, GitLab, etc.)
- CI/CD (Jenkins, Azure DevOps, GitHub Actions, GitLab)
- Vault or Secrets Management
- PKI and TLS certificates
- Ansible
- SCOM component dependencies (Windows Server, Windows Clusters, MS SQL with Always On replicas)
- TCP/IP Networking, DNS, Firewalls, etc.
Desired Skills:
- Excellent communication and interpersonal skills, with the ability to work with technical and non-technical stakeholders
- Strong problem-solving and analytical skills, with the ability to troubleshoot complex issues
- Experience with task planning and execution, including project management methodologies such as Agile and Waterfall
- Ability to create and maintain technical documentation, including knowledge base articles, build, and runbooks
- Good understanding of IT security and compliance frameworks
- Ability to work in a fast-paced environment, with a focus on continuous improvement and innovation
- Strong customer service skills, with a focus on delivering high-quality support to internal and external customers
- Familiar with containerization concepts (e.g., Docker) and container orchestration concepts (e.g., Kubernetes)
Experience:
- Minimum 5 years of experience in a monitoring and observability role, with a focus on Microsoft SCOM (or equivalent experience with similar tool, basic experience with SCOM, and the ability and willingness to actually read the entire SCOM manual and master the tool quickly because you’re just that good)
- Preferred 7+ years of experience in a senior monitoring and observability role, with a focus on Microsoft SCOM
- Relevant industry certifications, such as MCSA or MCSE, are desirable
From:
Sankhi Tudu,
Vyze Inc
studu@vyzeinc.com
Reply to: studu@vyzeinc.com