Staff Data Engineer

Company:  Karius Inc
Location: Redwood City
Closing Date: 24/10/2024
Salary: £150 - £200 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

About Karius

Karius is a venture-backed life science startup that is transforming the way pathogens and other microbes are observed throughout the body. By unlocking the information present in microbial cell-free DNA, we're helping doctors quickly solve their most challenging cases, providing industry partners with access to 1000’s of biomarkers to accelerate clinical trials, discovering new microbes, and reducing patient suffering worldwide.

Position Summary

We are seeking a seasoned Staff Data Engineer to drive data platform initiatives across the data value chain at Karius. We develop and operate AI-driven data analytics pipelines to deliver life-saving results in the highly complex infectious disease landscape. In this role, you will have the opportunity to develop and optimize the data platform to enable our users to extract insights from large amounts of commercial, operational, genomic, and clinical data, ultimately providing actionable insights that serve the business and patients. You will be able to incorporate the next generation of AI technologies and tools (e.g. Generative AI) into the Karius data platform to significantly increase delivered value to internal stakeholders and our customers.

Reports to: Sr. Manager, Analytical Systems & Data Insights

Location: Redwood City, CA (Hybrid)

Primary Responsibilities

  1. Data Platform Development: Design, develop, test, deploy, and maintain production-grade platforms and tooling that add value across the data lifecycle (ingest, transform, serve) for various use cases such as reporting, data analytics, machine learning, and bioinformatics.
  2. Data Architecture: Ensure centralized, standardized, and secure data access across business domains.
  3. Data Pipelines and Reporting: Combine internal (including PHI) and external data sources via ETL/ELT tooling to calculate operational and commercial KPIs for data analysis and reporting use-cases.
  4. ML/AI Data Platform Capabilities: Provide support for computational teams to drive value from clinical and genomic data assets through the ML lifecycle and data science tooling.
  5. Generative AI Integrations: Implement generative AI solutions to increase the value and usability of Karius data assets.
  6. Project Management: Proactively interface with cross-functional technical and non-technical stakeholders to identify unmet needs, ensure alignment with data initiatives, and communicate results and outcomes.
  7. Collaboration: Coordinate with the engineering and IT domains to understand, interface with, and extend production data and software systems using engineering best practices.
  8. Data Governance: Inform, implement, and follow data governance best practices and policies in conjunction with the Security and Compliance team.
  9. Continuous Improvement: Foster a growth mindset and self-starter attitude, continually seeking opportunities for process and system improvements.

Position Requirements

We are seeking a data engineer with exceptional system thinking. Critical to this role is the ability to grasp business needs, identify the complexity and interconnections of data elements, and determine the desired insights to extract. The ideal candidate will excel in translating business requirements into a technical roadmap and developing remarkable solutions to satisfy those needs.

Educational Background

• B.S. degree in Computer Science, Software Engineering, Electrical Engineering, Bioengineering, or related technical fields involving algorithms or coding.

Professional Experience

• 10+ years of data engineering / software development experience with at least 5 years of relevant experience in building enterprise-scale data platforms.

Technical Skills

  1. Data Platforms and Cloud Services: Hands-on experience with data platforms (e.g. Databricks - strongly preferred, Snowflake) and cloud services (e.g AWS - strongly preferred, GCP, Azure).
  2. Data Integration and Pipelines: Experience with ETL/ELT tools (e.g. Fivetran, Stitch, Airbyte) for integrating internal and third-party data sources.
  3. Batch and Stream Processing: Experience in building scalable infrastructure for batch processing (e.g., Spark, Hadoop) and stream processing (e.g., Kafka, Kinesis).
  4. Developer Toolset: Proficiency in programming languages for data engineering (i.e. Python and SQL) applied in conjunction with SDLC principles.
  5. Data Modeling and Architecture: Strong conceptual understanding of data modeling and practical experience with enterprise data models.
  6. Reporting and Visualization: Experience with reporting and dashboard tools (e.g. Looker, Streamlit, Tableau, PowerBI).
  7. ML Tooling: Familiarity with data science tooling such as notebooks and standard data processing/visualization libraries.
  8. Generative AI: Working knowledge of generative AI concepts and hands-on experience with frameworks and tooling.
  9. Data Governance and Compliance: Demonstrated experience in implementing and maintaining data governance and compliance frameworks.

Non-Technical Skills

  1. Ability to work in a fast-paced, dynamic startup environment.
  2. Strong organizational and time management abilities.
  3. Excellent communication and collaboration skills.
  4. Attention to detail and commitment to delivering high-quality solutions.

Nice-to-haves

• Experience working in the healthcare or life sciences industries, especially within a diagnostics setting.

Disclaimer

The above job description is intended to describe the general nature and level of work being performed by individuals assigned to this position. Responsibilities and duties may change or be adjusted to meet the needs of the company.

#J-18808-Ljbffr
Apply Now
An error has occurred. This application may no longer respond until reloaded. Reload 🗙