Sr. Genomics Software Engineer

About TileDB


TileDB is the database designed for discovery, built by scientists to unlock innovation. TileDB structures all data types, including data that does not easily fit into relational databases. Built on a powerful shape-shifting array database, TileDB handles the complexities of non-traditional “unstructured” multimodal data, such as genomic variants, bulk and single-cell transcriptomics, proteomics, biomedical imaging, as well as the frontier data of the future.


Used by big pharma and biotechs to power their multiomic data platforms, TileDB is the destination for scientific breakthroughs where frontier multimodal data is driving drug discovery.

About the role

  • TileDB is hiring an experienced (Sr.+ level) genomics software engineer. TileDB is a Series B company empowering scientists and data teams to organize, structure, collaborate, and analyze all of their data to accelerate breakthroughs.
  • We are a high-trust, high-ownership environment with colleagues who bring decades of experience at companies including Arrikto, Amazon Web Services, Cloudant, Cockroach Labs, Hashicorp, Intel, Mesosphere, Meta Platforms, Puppet Labs, Raytheon, Sourcegraph, Vertica, and more.
  • We want to actively encourage anyone to apply if they are passionate about the mission of TileDB! Application rates can vary significantly among qualified candidates, so please consider applying even if you do not have experience in every single area/skill listed below (add a note in your cover letter identifying other particular areas of strength you might contribute).

What you'll do

  • ​​We are looking for an engineer with experience building high-performance bioinformatics software to join our team and drive the development tools for large-scale genomics analysis. You will be responsible for implementing new features in our platform, including in the open source TileDB-VCF and TileDB-SOMA libraries, owning features from design to implementation.

Qualifications

Required

  • 3+ years of software development experience
  • Strong experience with a JVM-based language (Java or Groovy preferred)
  • Experience with C++ and Python
    • ideally Python FFI tools, such as pybind11 or Cython
  • Experience with pipeline frameworks (e.g. Nextflow, WDL, Snakemake, or similar)
  • Experience with the analysis of high-throughput sequencing data (e.g., quality control, mapping, variant calling)

Nice to have

  • Experience with systems built on at least one cloud provider (AWS preferred)
  • Familiar with htslib and common data formats for genomics data (e.g., FASTQ, SAM/BAM/CRAM, GTF/GFF/GFF3, VCF/BCF, BED)
  • Experience with toolkits or file formats related to single-cell biology (e.g.: AnnData, Seurat).
  • Experience with any of R, Go, or Rust

Additional Details

  • Competitive salary (depending on location and experience)
  • Stock options in Series B company ($34m fund raise in Oct. 2023)
  • 100% medical and dental coverage for employee and any dependents
  • Paid time off (vacation, sick, and public holidays)
  • Flexible time off and flexible work hours
  • Fully remote within continental US timezones (GMT -4 to -7)
    • Exceptions may be made for outstanding candidates in EU timezones (GMT-3 to GMT+2)

Engineering

Remote (United States)

Share on:

Terms of servicePrivacyCookiesPowered by Rippling