Senior Site Reliability Engineer
$54.50 - $72.25/hr
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc). * Strong hands-on AWS experience building and operating production systems.
$54.50 - $72.25/hr
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc). * Strong hands-on AWS experience building and operating production systems.
$54.50 - $72.25/hr
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc). * Strong hands-on AWS experience building and operating production systems.
Midvale, UT · Hybrid
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
Midvale, UT · Hybrid
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
Midvale, UT · On-site
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
Midvale, UT · On-site
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
$60.75 - $80.75/hr
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc). * Strong hands-on AWS experience building and operating production systems.
$60.75 - $80.75/hr
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc). * Strong hands-on AWS experience building and operating production systems.
Midvale, UT · On-site
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
Midvale, UT · On-site
Monitoring and Logging Technologies (Nimsoft, SPLUNK, AppDynamics, Grafana) * DevOps principles (ADO, Kanban, Git) * Knowledge and/or experience of any scripting language like Bash, Perl, Python ...
$18 - $22/hr
ServiceNow (or similar ticketing system tool) and Nimsoft (or a similar IT monitoring system tool). * Proficient use of all Microsoft products and operating systems. Certifications Relative and ...
$18 - $22/hr
ServiceNow (or similar ticketing system tool) and Nimsoft (or a similar IT monitoring system tool). * Proficient use of all Microsoft products and operating systems. Certifications Relative and ...
Stamford, CT · On-site
$60.75 - $80.75/hr
... Nimsoft, etc). • Strong hands-on AWS experience building and operating production systems. • Proven expertise with Infrastructure as Code (Terraform and/or CloudFormation/CDK). • Strong CI/CD ...
Stamford, CT · On-site
$60.75 - $80.75/hr
... Nimsoft, etc). • Strong hands-on AWS experience building and operating production systems. • Proven expertise with Infrastructure as Code (Terraform and/or CloudFormation/CDK). • Strong CI/CD ...
$7.45 - $8.76
2% of jobs
$8.76 - $10.07
0% of jobs
$10.07 - $11.39
0% of jobs
$11.39 - $12.70
0% of jobs
$12.70 - $14.01
5% of jobs
$14.01 - $15.32
11% of jobs
$16.27 is the 25th percentile. Wages below this are outliers.
$15.32 - $16.63
10% of jobs
The median wage is $17.73 / hr.
$16.63 - $17.94
27% of jobs
$19.01 is the 75th percentile. Wages above this are outliers.
$17.94 - $19.25
26% of jobs
$19.25 - $20.56
11% of jobs
$20.56 - $21.88
10% of jobs
$7
$17
$21
| Aspect | Nimsoft | Network Monitoring Engineer |
|---|---|---|
| Required Certifications | ITIL, Network+, SNMP certifications | CCNA, Network+, CompTIA certifications |
| Work Environment | IT service management, monitoring tools, data centers | Network infrastructure, enterprise networks, data centers |
| Employer & Industry Usage | IT service providers, managed service providers, large enterprises | Telecommunications, IT departments, network service providers |
Both Nimsoft and Network Monitoring Engineers focus on network performance and reliability. Nimsoft is a monitoring platform used by IT teams to oversee systems and applications, while Network Monitoring Engineers actively manage and troubleshoot network infrastructure. Understanding their roles helps organizations choose the right tools and personnel for network health and performance management.

$54.50 - $72.25/hr
Full-time
Medical, Dental, Life, Retirement, PTO
Posted 24 days ago
Own and improve service reliability through SLO/SLI definition, error budgets, and operational best practices.
Design, implement, and maintain observability (monitoring, logging, tracing, alerting) to reduce MTTR and improve proactive detection.
Design and operate highly available, fault-tolerant cloud architectures and implement resilient patterns across compute, storage, networking, and managed services.
Responsibilities:
Reliability Engineering & Operations
Own and improve service reliability through SLO/SLI definition, error budgets, and operational best practices.
Design, implement, andmaintainobservability (monitoring, logging, tracing, alerting) to reduce MTTR and improve proactive detection.
Lead incident response practices including on-call improvements, runbooks, post-incident reviews (RCA), and preventative actions.
Partner with application teams to improve performance, capacity planning, and resiliency under failure scenarios.
Infrastructure & Cloud Architecture
Design andoperatehighly available, fault-tolerantCloudarchitectures (multi-AZ and, whererequired, multi-region).
Implementresilient patterns across compute, storage, networking, and managed services (e.g., autoscaling, load balancing, backups, replication).
Drive cloud governance best practices (tagging, account/landing zone patterns, least privilege, guardrails) in partnership with security and platform teams.
Infrastructure as Code (IaC) & DevOps Enablement
Build and maintainIaCmodules and standards (e.g., Terraform, CloudFormation, CDK) for repeatable, auditable infrastructure delivery.
Develop, standardize, andoptimizeCI/CD pipelines to enable safe, automated deployments (e.g., GitHub Actions, GitLab CI, Jenkins, AWS CodePipeline).
Promote DevOps practices: version-controlled infrastructure, automated testing, immutable deployments, and progressive delivery patterns.
Establish environment consistency across dev/test/stage/prod and ensure infrastructure drift detection and remediation.
BCP/DR, RTO/RPO Definition & Testing
Collaborate with stakeholders to evaluate and define service-level RTO and RPO targets based on business and technical requirements.
Design and implement BCP/DR architectures and procedures (backups, restore workflows, replication, failover/failback, data integrity validation).
Coordinate and execute structured DR tests (tabletop, simulation, partial failover, full failover) and document outcomes.
Maintain DR runbooks, dependency maps, and recovery checklists; drive remediation of gapsidentifiedduring testing.
Produce metrics and reporting on DR readiness, test results, and continuous improvement actions.
Qualifications:
7+ years of experience in SRE, DevOps, Platform Engineering, or Systems Engineering roles supporting production environments.
Strongproficiencywith observability platforms (e.g., Datadog, Prometheus/Grafana, ELK/OpenSearch, Nagios, Nimsoft,etc).
Strong hands-on AWS experience building and operating production systems.
Provenexpertisewith Infrastructure as Code (Terraform and/or CloudFormation/CDK).
Strong CI/CD and automation background (pipeline design, deployment strategies, testing automation).
Experience defining and validating RTO/RPO, andimplementing BCP/DR plans with structured testing.
Experience with Kubernetes andauto-scalingcontainer platforms (EKS, ECS, or Kubernetes on-prem).
Strong Linux fundamentals, networking concepts (DNS, TCP/IP, load balancing), and troubleshooting skills.
Proficiencyin at least one scripting/programming language (Python, Go, Bash, or similar).
Ability to write clear operational documentation, runbooks, and post-incident reports.
Ability to work effectively in a fast-paced, dynamic and high-intensity environment including open-floor plan if applicable to the position, with timely responsiveness and the ability to work beyond normal business hours when required.
Preferred Qualifications:
Familiarity with Azure and/or Oracle Cloud (OCI).
Familiarity with Service Mesh, API Gateways, and distributed tracing tooling.
Familiarity withOpenTelemetry, client instrumentations and collector configurations.
Security and compliance familiarity in cloud environments (IAM design,secretsmanagement, audit logging).
Experience implementing progressive delivery (blue/green, canary), feature flags, and automated rollback.
Relevant certifications (AWS Solutions Architect/DevOps Engineer, Kubernetes CKA/CKAD).
Experience with ArgoCD & Karpenter.
Employee Programs & Benefits:
CCI offers competitive benefits and programs to support our employees, their families and local communities. These include:
Competitive comprehensive medical, dental, retirement and life insurance benefits
Employee assistance & wellness programs
Parental and family leave policies
CCI in the Community: Each office has a Charity Committee and as a part of this program employees are allocated 2 days annually to volunteer at the selected charities.
Charitable contribution match program
Tuition assistance & reimbursement
Quarterly Innovation & Collaboration Awards
Employee discount program, including access to fitness facilities
Competitive paid time off
Continued learning opportunities
Visit https://www.cci.com/careers/life-at-cci/# to learn more!
#LI-CD1
Sourced by ZipRecruiter
Oil and gas extraction
501 - 1,000 Employees
Stamford, CT, US
1997