Data Engineer

Company:  Probably Genetic
Location: San Francisco
Closing Date: 18/10/2024
Salary: £150 - £200 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

About Probably Genetic

Probably Genetic is changing the lives of patients living with severe, complex diseases. Our data platform is used by drug developers and patient advocacy groups to develop and launch treatments for these patients. Our technology discovers undiagnosed patients online, analyzes their disease state using machine learning and at-home testing, and enables compliant communication with patients. In doing so, we help patients access diagnoses, clinical trials, and treatments as early as possible.

We are a tight-knit group of hard-working, ambitious problem solvers united by a mission greater than ourselves.

We do well by doing right by patients. Our annually recurring revenue is growing >6x year over year, we’re profitable, and our roadmap is packed with innovations in bioinformatics, machine learning, and drug development. We are building an all-star team to help us bring our vision to life, and we want you to be a part of it.

Probably Genetic has raised multiple rounds of funding from Silicon Valley’s best investors, including Threshold, Khosla, and Y Combinator, giving us the ability to pay competitive salaries, offer great benefits, and provide meaningful equity. We’re dedicated to ensuring your journey with us is unforgettable, with incredible team retreats to places like Barbados, the Alps, Mexico, Costa Rica, and Portugal, just to name a few.

About The Role

We are looking for a Data Engineer to build data pipelines and visualizations for key business metrics, machine learning (ML) models, and growth metrics to enable our data-driven organization.

What you will do

In this role, you will have the unique opportunity to use data to provide life-changing services, including genetic tests, to patients living with severe, complex diseases. This involves building data infrastructure to assess patient acquisition channel quality, ML training pipelines, and dashboards for business metrics driving all company decisions. You will own pipelines end-to-end while building beautiful visualizations and dashboards. You will play a pivotal role in enabling Probably Genetic to use our data to grow the business.

You will drive the Probably Genetic data infrastructure by:

  • Collaborating with stakeholders in Business Development, Growth, Product, ML and Finance to design data visualizations and pipelines
  • Developing ETL pipelines from a PostgreSQL database using Python Django and AWS services such as S3, Glue, and Athena
  • Integrating with external data sources such as Meta and Google Ad platforms, product analytics platforms, and financial platforms to extract valuable data and insights
  • Developing visualizations in a business intelligence (BI) tool of your choice to create reports and dashboards

You will collaborate internally to achieve data and company goals by:

  • Collaborating directly with the CEO and executive team on investor updates using data from BI reports and dashboards
  • Identifying missing data points which should be collected and working with the engineering team to collect all valuable data
  • Writing documentation to onboard team members to BI tooling and ensuring that datasets are available for ad-hoc analysis

Who you are

What will help you succeed in this role:

  • 5+ years of relevant experience in data engineering
  • Experience using Python and SQL for ETL pipelines and data analysis
  • Experience in data reporting and dashboarding using a variety of BI tools
  • Track record of solving extremely difficult, ambiguous problems

Some things that are not required, but you will learn on the job:

  • Experience in contributing to a Python Django backend
  • Expertise with healthcare data, bioinformatics, ML infrastructure, or growth marketing

As with all new hires at Probably Genetic, you will also need to be:

  • A good person. We work with some of the most marginalized populations on the planet and empathy is key
  • Patient-focused and motivated to have a lasting, positive impact on humanity
  • Comfortable in a fast-paced, often ambiguous environment with rapid change
  • Action-oriented and excited to build a company from the ground up

The salary range for this role is $127,000-$195,000 annually. Actual compensation offered will depend on several factors including but not limited to: work experience, education, skill level, and/or other business and organizational needs.

What we offer at Probably Genetic:

  • An engaging and supportive team
  • 30 days of vacation a year
  • Hybrid, flexible work
  • A “work from anywhere” policy, up to 4 weeks a year
  • Competitive equity grants
  • All-expenses paid quarterly team retreats
  • Benefits including medical and vision

This is a hybrid role that will require working on-site 3 days a week in San Francisco. Local candidates only. Relocation is not offered for this role.

How to apply

Please send your resume and anything else you'd like to share to with “Data Engineer application” as the subject line. We will get back to you as soon as we can. We can’t wait to meet you!

Probably Genetic is committed to fostering a welcoming and inclusive work environment for people of all genders, sexuality, ethnicity, socioeconomic background and life experiences. We urge candidates of all backgrounds to apply. If you require specific accommodations as you interview or consider working with us, please let us know.

Compensation Range: $127K - $195K

#J-18808-Ljbffr
Apply Now
An error has occurred. This application may no longer respond until reloaded. Reload 🗙