Backend Bioinformatics Engineer

Design, evolve, and industrialize production bioinformatics pipelines and the associated backend services to improve reliability, turnaround time (TAT), traceability, and operability of analyses.

Key Responsibilities

1) Pipeline Development & Industrialization

  • Develop and maintain production bioinformatics pipelines (e.g., FASTQ → BAM, BAM → VCF, VCF → annotated VCF) with a high standard of quality, reproducibility, and robustness.
  • Contribute to major refactoring initiatives (e.g., modularization, standardization, pipeline tracing, and performance optimization).
  • Build and maintain testing and validation capabilities (test datasets, QA reports, regression testing, automation).

2) Backend & APIs (Platform / Service Orientation)

  • Design and implement orchestration and backend components for distributed analysis execution: job scheduling, dependency management, launching, tracking, and exposing outputs (e.g., APIs/endpoints, internal services, asynchronous jobs).
  • Improve observability (logs, Prometheus metrics, Grafana dashboards, trace collection) and maintainability(clean architecture, managing technical debt).
  • Collaborate with DevOps/Infra on CI/CD, Cloud execution, and resource management.

3) Interfaces & Coordination (Subsystem Responsibilities)

  • Work closely with stream-aligned teams and other subsystems (e.g., Knowledge, AI Core, BioScience) on interface contracts: inputs/outputs, SLAs, formats, versioning.
  • Maintain a clear technical backlog (e.g., Notion/GitLab) with concise, pragmatic specs and consistent prioritization.

Expected Deliverables (Examples)

  • Stable, tested, observable pipelines with controlled turnaround time (TAT).
  • Reusable validation assets: test datasets + QA reports + validation plans.
  • Documented, versioned backend APIs/services with monitoring in place.
  • High-quality documentation and traces aligned with quality/regulatory expectations.

Required Skills

  • Bioinformatics Core
    • Strong mastery of NGS concepts and formats (FASTQ/BAM/VCF, annotation, QC, reproducibility).
    • Ability to investigate discrepancies (data, algorithm, implementation) and deliver robust fixes.
    • Experience with Nextflow or similar workflow engines.
  • Backend Engineering (Key Target of the Role)
    • Experience designing backend services (APIs, async processing, orchestration, queues/jobs).
    • Solid software engineering practices (testing, CI, refactoring, pragmatic design).
    • Strong production mindset: performance, observability, reliability.
    • Familiarity with database optimization (e.g., Cassandra, PostgreSQL) and time-series data, metrics storage, or observability data pipelines.
  • Collaboration & Organization
    • Clear communication across interfaces, dependencies, and trade-offs.

Profile

  • Degree in bioinformatics / computer science / related field.
  • Significant experience in production environments (pipelines + software engineering).
  • Hybrid profile: scientific rigor in approach, engineering excellence in execution.

Nice to Have

  • Knowledge of regulatory/RAQA requirements in production systems.

Job Type: Full-time, Permanent

Pay: From 50,000.00€ per year

Work Location: Hybrid/Remote. Company based in Montpellier, France (34000).

Apply for this position