Naila Srivastava Naila-Srivastava

👋 Hello, I’m Naila Srivastava

Bioinformatician - Genomics & Pipelines

MSc Bioinformatics & Systems Biology | University of Manchester 🇬🇧
Open to global opportunities

About Me

I am a Bioinformatician specialising in the analysis of high-dimensional genomic data, including NGS (WES/WGS) and Single-cell transcriptomics. I focus on developing reproducible, scalable pipelines using Python, R, and Nextflow to interpret complex biological datasets and prioritise variants for precision medicine. Currently, I am leading the design of high-throughput analytics pipelines that bridge the gap between large-scale data engineering and biological discovery.

Currently building & exploring

Scalable WES Pipelines: Deploying reproducible variant calling workflows on HPC environments using SLURM to identify pathogenic hits in rare disease cohorts.
Cancer Transcriptomics: Utilising TCGAbiolinks and DESeq2 to identify differentially expressed genes (DEGs) and validating them via OncoKB.
Multi-Omics Annotation: Designing frameworks for GWAS SNP prioritization using FUMA, eQTL mapping, and BioMart.
Single-Cell Atlases: Analysing scRNA-seq datasets to identify cell-type-specific regulatory factors and transcription factors across developmental stages.
Reproducible Research: Building research-grade pipelines that follow Ensembl best-practice guidelines.

Skills Overview

Bioinformatics & Pipeline Engineering

Omics Expertise: WES/WGS, bulk/single-cell RNA-seq, Spatial Transcriptomics, GWAS, and Multi-omics Integration.
Programming: Python (Pandas, Scikit-learn, TensorFlow), R (Bioconductor, Tidyverse), Bash, SQL, and Git/GitHub.
Pipeline Orchestration: Nextflow, Snakemake, and Linux/Unix environments.
NGS Tool Stack:
- Alignment & Processing: STAR, BWA, Bowtie, SAMtools, Picard, and BEDTools.
- Variant Calling & Annotation: GATK, bcftools, VEP, GTEx, and GWAS Catalog.
- Differential Expression & GWAS: DESeq2 and PLINK.
- Computation & Cloud: SLURM, HPC environments, and AWS (EC2, S3, Lambda).

Data Science & Analytics

Machine Learning: Supervised/Unsupervised Learning, Deep Learning for omics, and Statistical Validation of ML Models.
Product Analytics: Amplitude tracking, Looker Studio, and Google Analytics.
Visualisation: R (ggplot2, Shiny), Python (Matplotlib, Plotly), Tableau, and Power BI.

Writing & Contributions

Medium Publications

I write about trends and deep-dive workflows and tutorials in bioinformatics, translating biological complexity into analytical clarity.

📖 Visit My Medium Profile

Kaggle Datasets & Notebooks

I actively contribute curated datasets and code with real-world relevance:

Drugs, Conditions & Side Effects: (5.5K+ views | 1.2K+ downloads).
Life Expectancy Analysis: (6K+ views | 1.5K+ downloads).

🔗 Visit My Kaggle Profile

🌐 Let’s Connect

I am open to collaborations and research roles in Bioinformatics, Genomics, and Precision Medicine.

"In a world full of noise, I turn complex biological data into meaningful signals."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly