We are looking for a Site Reliability Engineer (SRE) with an interest in distributed systems, cloud computing and the architecture of large-scale systems. The SRE will ensure our InsightIDR services have high reliability and uptime necessary to meet our customers’ needs.
Our InsightIDR product helps identify and address key cybersecurity risks to our customers. We apply AI, ML, threat intelligence, and BI to event sources, including desktops, servers, network switches, firewalls, cloud services, directory servers, DHCP servers, and SIEMs in order to distill hundreds or thousands of daily events per customer into the few real, high priority threats that need attention. Our systems ingest large amounts of data that need to be highly available and performant at all times.
As SRE, you will work closely with our engineering team and partner teams throughout Rapid7 to help solve challenging problems at a massive scale.
In this role, you will:
Support services before they go live through activities such as design, deployment, migration strategy, monitoring, and playbook reviews
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
Ensure systems are scalable, apply automation, driving service and infrastructure improvements
Troubleshoot production issues and liaise with relevant Engineering or Infrastructure teams to find a resolution
Participate in on-call support, and incident response follow-ups such as post-mortems
Work closely with Engineering teams, Architecture, Infrastructure and Product teams to improve the lifecycle of the InsightIDR services - from inception, design, deployment, operations, monitoring, security, upgrade and maintenance
Continuously develop and refine your own skill set
3+ years of experience in Unix/Linux systems, IP networking, database operations and performance optimization
Experience programming in Python or Shell scripting, knowledge of Git
Knowledge of AWS cloud and its services like EC2, RDS, S3, etc, experience in Cloud operations
Knowledge of monitoring systems
Strong written and verbal communication skills
Bachelor’s degree in Computer Science, STEM-related field, or 3+ years industry experience
Strong plus if you heard about:
Kubernetes and related technologies like Docker etc.
IaC, preferably Terraform
Observability approaches
ELK stack
RESTFul architectures
CI/CD
SRE mindset
We know that the best ideas and solutions come from multi-dimensional teams. That’s because these teams reflect a variety of backgrounds and professional experiences. If you feel you have required experience, please don’t be shy - apply today.
At Rapid7, we are on a mission to create a secure digital world for our customers, our industry, and our communities. We do this by embracing tenacity, passion, and collaboration to challenge what’s possible and drive extraordinary impact.
Here, we’re building a dynamic workplace where everyone can have the career experience of a lifetime. We challenge ourselves to grow to our full potential. We learn from our missteps and celebrate our victories. We come to work every day to push boundaries in cybersecurity and keep our 10,000 global customers ahead of whatever’s next.
Join us and bring your unique experiences and perspectives to tackle some of the world’s biggest security challenges.
Security and Compliance
Rapid7 is committed to keeping customers secure. As a first line of defense, all employees are expected to uphold the highest standards of security and privacy, ensuring the protection of sensitive information and compliance with relevant regulations.
Application loading...
Thank you
Application loading...
Before you leave, complete the form below to join our talent community so we can stay in touch.
Thank you
We use cookies.
Some are necessary to operate the website and its functions. Others help personalize, improve content and services to show you the most relevant job opportunities. With the decision "Accept essential only" we will respect your privacy and will not set cookies that aren't necessary for the operation of the site.