Pharma & Biotech

Vanda Pharmaceuticals

Genome Analysis Data Platform

1

unified web platform

6x

faster data processing

Case Studies

The client needed a scalable genome analytics tool to investigate the role of genetic mutations in disease vulnerability.

Pharma & Biotech

Industry

United States

Location

$200,000–$500,000

Budget

CI/CD deployment, Warehouse design, Web portal

Services

Challenge

Project involved processing hundreds of terabytes of sequencing data from COVID-19 patients.

See what we can do for you
Tech Stack

To deliver a scalable genome analysis platform, Blackthorn AI applied:

Angular
CloudWatch
Apache Airflow
Lambda
Roadmap

Project duration

01 Month

Business goal validation & architecture

Defined core goals and designed scalable solution to support high-throughput genomics workflows.

02 Month

Pipeline design & genotyping workflows

Implemented raw data ingestion, quality checks, and genotyping processes using AWS Batch and Docker.

03 Month

Annotation & batch compute setup

Deployed variant calling and annotation using GATK, VEP, and LOFTEE in containerized GPU environments.

04 Month

Web portal development

Built researcher-facing UI with secure data access, variant query tools, and result visualization.

05 Month

Analytics engine setup

Designed analytical data warehouse (Redshift); implemented fast SQL for complex genomic queries.

06 Month

CI/CD autoscaling & deployment

Automated full deployment, lifecycle management for hot/cold genomic data, and fault-tolerant scaling.

Team Size

6 Qualified
AI Experts
Bioinformatics Engineer
Backend Developer
Frontend Developer
AWS Cloud Engineer
Data Engineer
DevOps / Airflow Engineer

Delivering Impact

Beyond the values already highlighted, there’s even more to discover. Our commitment to innovation, client success, and impactful results sets us apart.

Book a Meeting

1000+

WGS samples processed

Enabled high-throughput sequencing workflows for COVID-19 susceptibility screening

100%

automation of genomics workflows

Fully containerized and orchestrated pipeline across GATK, VEP, FastQC, and LOFTEE

6x

faster data processing

Reduced sample analysis time from 4 days to under 6 hours via batch orchestration and autoscaling compute

Discover More 
Related Case Studies