Company:
Panda Intelligence
Location: Boston
Closing Date: 23/10/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
A boutique consultancy specializing in Data Science support and strategy is seeking a Senior Data Engineer to support a large drug discovery project with a leading commercial Biopharma firm in Boston. Candidates are required to have 7+ years of experience in Python, experience with Nextflow and previous experience partnering with or working in Life Sciences firms is preferred.
Key Responsibilities:
- Design and implement scalable data pipelines for drug discovery and development using Python, Nextflow, and AWS.
- Collaborate with cross-functional teams, including biologists, bioinformaticians, and computational scientists, to develop workflows for analyzing large datasets (e.g., genomic, proteomic).
- Optimize and automate bioinformatics workflows in the cloud to handle complex datasets.
- Ensure performance, reliability, and cost-effectiveness of cloud infrastructure (AWS) for computational processes.
- Use modern data engineering practices to solve complex scientific problems in a fast-paced biotech environment.
- Mentor and guide junior team members in best practices for scalable cloud computing and workflow automation.
Required Skills & Qualifications:
- 7+ years of professional experience in data science or software engineering, with a focus on biotech or pharma projects.
- Full proficiency in Python for scientific computing, data manipulation, and workflow automation.
- Experience with Nextflow (or similar workflow management tools like Snakemake, Cromwell) for building complex bioinformatics pipelines.
- Hands-on experience with AWS (e.g., EC2, S3, Lambda, Batch, CloudFormation) for deploying and managing scalable workflows in the cloud.
- Familiarity with containerization tools like Docker and orchestration tools like Kubernetes is a plus.
- Solid understanding of bioinformatics tools and computational methods in genomics, proteomics, or other omics fields is a plus.
Preferred Qualifications:
- Familiarity with machine learning or AI tools used in drug discovery is a strong advantage.
- Experience with distributed computing systems (HPC, AWS Batch, or Lambda).
- Knowledge of compliance and security standards in cloud computing for regulated industries like pharma.
If interested, please apply below and share with your network.
Share this job
Panda Intelligence