Embark Veterinary Site Reliability Engineer in Boston, Massachusetts
Discover your dog more than fur deep with the most comprehensive DNA test on the market. Designed by world leaders in dog genetics, in partnership with Cornell University, the Embark DNA Test tells owners what breeds make up their pets, how to prevent future possible health problems, and what features and traits their pet might have. Help us end preventable disease in dogs and improve the lives of pets and their people through genomics.
Embark is the only dog DNA test using a research-grade DNA microarray, letting us give our customers the most accurate and comprehensive results on the market. More importantly, it allows us to do ongoing research into the genetics of dogs, which are a fantastic population for genetic discovery due to selective breeding over time. Our research focuses on mapping new traits and diseases, improving personalized veterinary medicine, and developing new breeding programs to eliminate preventable diseases in pets.
Interested in joining? We?re looking for highly motivated and driven employees who will help us stay on the cutting edge of creativity and innovation in the fast-growing consumer genetics space.
The Site Reliability Engineer role at Embark is a Software Engineer who focuses on ensuring reliability and uptime of Embark?s critical systems through monitoring, automation, strong DevOps principles, and collaboration with system owners. This includes optimizing existing systems as well as building reliability into the design of new systems.
Review and improve system design and implementation of existing and new systems to ensure our systems are designed for high availability
Implement logging, monitoring, and alerting across websites, batch processing systems, and other AWS-based services to detect and prevent production issues
Design and implement disaster recovery architecture, systems, and protocols
Optimize delivery of releases to production to balance reliability, safety, and speed
Continuously improve Embark?s incident response process and systems
Automate all the things whenever possible, and document manual, repeatable actions so we can automate them when the time is right
Design, build and maintain core infrastructure as code
Be a technology and DevOps evangelist for the rest of the company
At least a Bachelors in Computer Science or equivalent practical experience
Recent experience in an SRE role or equivalent
In-depth, hands-on professional experience designing for reliability and uptime on core AWS services, including:
EC2, S3, CloudWatch, RDS/Aurora (PostgreSQL), Route 53, Elastic Beanstalk, Elasticache, Lambda, VPC/networking
Expertise with automation tools, including:
Infra-as-code, e.g. Terraform or CloudFormation
CI/CD pipelines, e.g. CircleCI, Jenkins, or GitHub Actions
Experience optimizing Linux-based systems for reliability and uptime
Expertise with log analysis, monitoring, and alerting, e.g. with CloudWatch
Expertise with performance monitoring, preferably with New Relic
Systems administration, automation and scripting with bash
Experience contributing to modern Python 3 applications
Mypy types, pytest, packaging and deploy, versioning
Knowledge of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
Excellent written and verbal communication skills and an ability to work with peers and customers
Curiosity to troubleshoot systems and an eagerness to make it easy for others to troubleshoot
Willingness to participate in on-call rotation including some weekends and holidays
What We Offer
Dog-friendly office near South Station, Boston
Perks tailored for dog lovers including subsidized dog-walking services and paw-ternity leave
Startup perks with big-company benefits
Competitive salaries, all-inclusive health care, and equity participation
A flexible vacation policy along with paid maternal and paternal leave
Fully-stocked office snack bar and regular office events
New iMacs and MacBook Pros, or laptops running Linux
Continuing education including attending conferences
Embark is an equal opportunity workplace and values diversity at our company. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, citizenship status, sexual orientation, age, disability status, marital status, gender identity or expression, veteran status, or any other characteristics protected by federal, state or local laws. See also EEO is the Law (https://www.eeoc.gov/sites/default/files/migratedfiles/employers/posterscreenreaderoptimized.pdf) .