Bioinformatics Scientist
Develops and maintains bioinformatics pipelines for NGS data analysis including variant calling, gene expression, and genomics assembly. Provides statistical modeling, scripting support, and experimental design consulting for NIAID researchers; requires 4+ years experience and expertise in Python/R and pipeline frameworks.
Responsibilities
- Address bioinformatics, scientific computing, and data analysis needs of users within the Division of Intramural Research (DIR) at NIAID and collaborators by providing expertise in statistical design of biological experiments involving high-throughput genome-scale technologies, data mining, knowledge discovery, automated workflow development, and experimental design consulting.
- Provide bioinformatics support including somatic and germline variant calling and analysis, single-cell and bulk gene expression analysis, de novo assembly of genomes and transcriptomes, comparative genomics of model and non-model organisms, proteomics, ChIP-seq, ATAC-seq, and development of high-throughput pipelines.
- Provide programming and troubleshooting support to Research Technologies Branch (RTB) of NIAID and collaborating institutes for research data dissemination.
- Perform computational data analysis on genomic and clinical research data.
- Work with staff on scientific programming and experimental design; provide statistical support/analysis on research data.
- Contribute to and co-author scientific publications in peer-reviewed journals.
- Contribute to development of scalable and flexible resource for NGS bioinformatics, structural biology, computational, and machine learning data support to NIAID scientists.
Requirements
- NGS analysis: whole-exome, whole-genome, RNA-seq, ChIP-Seq (long- and short-read technologies).
- Statistical modeling.
- Scripting.
- Biological interpretation.
Specific Qualifications
- Minimum Bachelor's degree in related field (Master's or Ph.D. preferred).
- 4+ years experience in bioinformatics.
- 2+ years experience developing and maintaining production bioinformatics pipelines.
- Expert in 2 of: Python, R, C/C++, sh/bash.
- Experience building images using 1 of: Docker, Podman, Singularity/Apptainer.
- Expert in 1 pipeline framework: Snakemake, Nextflow, CWL/WDL.
- Expertise building technical and user documentation for tools, pipelines, and applications.
- Experience building pipelines locally on-premise using HPC infrastructure.
- Plus: Cloud (AWS/GCP/Azure), frameworks (Django, Flask, RShiny, Electron, Flutter, React), laboratory setting.
Senior Data Scientist, Causal Inference
Lead causal inference and marketing mix modeling efforts to measure and optimize marketing investments for Lyft's Growth Products team. Requires 4+ years experience, advanced degree, and expertise in Python, SQL, and causal methods.
Senior Data Scientist, Trust & Safety
Senior data scientist driving Trust & Safety product decisions at a fintech company through experimentation, statistical modeling, and cross-functional collaboration. Requires 5-7 years of experience and expert SQL skills.