Yesterday
Site Reliability Engineer
Linus Health
GenOps Responsibility Profile
Runtime Ownership
✓ Yes
Control Plane
✓ Yes
Governance / Policy
✓ Yes
Observability / Telemetry
✓ Yes
Incident / Reliability
✓ Yes
Regulated Context
✓ Yes
AI / Model Runtime
✓ Yes
Primary Domain
cloud platform
Production Environment
prod
Classification Confidence
95%
Rationale: This is a clear GenOps role with ownership of production infrastructure via IaC (Terraform), 24/7 on-call rotation for incident response, and responsibility for security/compliance in a regulated healthcare context. The role combines control plane work (CI/CD, containerized services, networking) with governance scope (security, compliance) and reliability ownership (SRE practices, actionable alerting).
Job Description
Linus Health is a Boston-based digital health company transforming brain health worldwide. We combine cutting-edge neuroscience, clinical expertise, and AI to advance early detection and intervention for cognitive and brain disordersâempowering people to live longer, healthier lives. With 100+ team members and growing, we're entering a phase of accelerated growth and looking for top talent to help shape our future. Currently, we are looking for a Mid-level SRE to join our small but mighty team. This role will report to our Director of IT, Cloud & Security and work closely with our Staff SRE as well as other Engineering team members and cross functional team members. Please note that while this role is remote, you must be based in the US to be considered for this position. Unfortunately, we are not able to provide sponsorship at this time. What You'll Do: - Leverage infrastructure as code (Terraform) to build and maintain complex production and analytics workflows including networking and containerized services. - Rapidly diagnose and resolve faults in system services as part of a 24/7 on-call rotation focused on actionable alerting and eliminating toil. - Improve speed of delivery by developing and maintaining CI/CD pipelines. - Develop infrastructure automation leveraging Terraform, Python and Typescript. - Improve system availability, security, compliance, cost effectiveness and performance. - Estimate work, prioritize tasks, track dependencies, report progress, highlight blockers - Participate in continuous improvement initiatives, advocate for SRE best practices, and stay current with emerging technologies and trends. - Be part of a team where your focus will be on building, measuring, and refining the systems infrastructure that runs ouPlease mention the word **REALISTIC** and tag RMTA4LjE0LjI0My41OQ== when applying to show you read the job post completely (#RMTA4LjE0LjI0My41OQ==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.