Skip to content
View Naila-Srivastava's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Naila-Srivastava

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Naila-Srivastava/README.md

👋 Hello, I’m Naila Srivastava

Bioinformatician - Genomics & Pipelines

MSc Bioinformatics & Systems Biology | University of Manchester 🇬🇧
Open to global opportunities


About Me

I am a Bioinformatician specialising in the analysis of high-dimensional genomic data, including NGS (WES/WGS) and Single-cell transcriptomics. I focus on developing reproducible, scalable pipelines using Python, R, and Nextflow to interpret complex biological datasets and prioritise variants for precision medicine. Currently, I am leading the design of high-throughput analytics pipelines that bridge the gap between large-scale data engineering and biological discovery.

Currently building & exploring

  • Scalable WES Pipelines: Deploying reproducible variant calling workflows on HPC environments using SLURM to identify pathogenic hits in rare disease cohorts.
  • Cancer Transcriptomics: Utilising TCGAbiolinks and DESeq2 to identify differentially expressed genes (DEGs) and validating them via OncoKB.
  • Multi-Omics Annotation: Designing frameworks for GWAS SNP prioritization using FUMA, eQTL mapping, and BioMart.
  • Single-Cell Atlases: Analysing scRNA-seq datasets to identify cell-type-specific regulatory factors and transcription factors across developmental stages.
  • Reproducible Research: Building research-grade pipelines that follow Ensembl best-practice guidelines.

Skills Overview

Bioinformatics & Pipeline Engineering

  • Omics Expertise: WES/WGS, bulk/single-cell RNA-seq, Spatial Transcriptomics, GWAS, and Multi-omics Integration.

  • Programming: Python (Pandas, Scikit-learn, TensorFlow), R (Bioconductor, Tidyverse), Bash, SQL, and Git/GitHub.

  • Pipeline Orchestration: Nextflow, Snakemake, and Linux/Unix environments.

  • NGS Tool Stack:

    • Alignment & Processing: STAR, BWA, Bowtie, SAMtools, Picard, and BEDTools.
    • Variant Calling & Annotation: GATK, bcftools, VEP, GTEx, and GWAS Catalog.
    • Differential Expression & GWAS: DESeq2 and PLINK.
    • Computation & Cloud: SLURM, HPC environments, and AWS (EC2, S3, Lambda).

Data Science & Analytics

  • Machine Learning: Supervised/Unsupervised Learning, Deep Learning for omics, and Statistical Validation of ML Models.
  • Product Analytics: Amplitude tracking, Looker Studio, and Google Analytics.
  • Visualisation: R (ggplot2, Shiny), Python (Matplotlib, Plotly), Tableau, and Power BI.

Writing & Contributions

Medium Publications

I write about trends and deep-dive workflows and tutorials in bioinformatics, translating biological complexity into analytical clarity.

📖 Visit My Medium Profile

Kaggle Datasets & Notebooks

I actively contribute curated datasets and code with real-world relevance:

  • Drugs, Conditions & Side Effects: (5.5K+ views | 1.2K+ downloads).
  • Life Expectancy Analysis: (6K+ views | 1.5K+ downloads).

🔗 Visit My Kaggle Profile

🌐 Let’s Connect

I am open to collaborations and research roles in Bioinformatics, Genomics, and Precision Medicine.

"In a world full of noise, I turn complex biological data into meaningful signals."


Pinned Loading

  1. Variant_Calling-WES-RareDisease Variant_Calling-WES-RareDisease Public

    Variant calling pipeline for WES data of a MOPD-II affected neonate, as part of a rare disease case study. This project demonstrates a reproducible analysis framework for rare disease genomics rese…

    Shell 1

  2. Cancer_Transcriptomics_Analysis Cancer_Transcriptomics_Analysis Public

    RNA-Seq differential expression analysis of TCGA-COAD (Colorectal Adenocarcinoma) using DESeq2, clusterProfiler, and OncoKB. Includes data preprocessing, normalization, DEG identification, function…

    R 1

  3. GWAS-SNPs-Annotation-Prioritization GWAS-SNPs-Annotation-Prioritization Public

    This repository contains a fully reproducible pipeline for prioritising candidate genes and pathways from GWAS summary statistics using integrative pGWAS functional annotation to identify the most …

    R 1

  4. Life-Expectancy-Analysis Life-Expectancy-Analysis Public

    Advanced ML & DL project analyzing global health and socio-economic indicators to predict life expectancy.

    Jupyter Notebook 1

  5. Drugs_Side-Effects_Conditions_Analysis Drugs_Side-Effects_Conditions_Analysis Public

    A data analysis project on real-world drug data exploring drug effectiveness and side effects. Includes visualizations and an interactive lookup tool ready for use.

    Jupyter Notebook

  6. Covid19-Clinical-Trials-Analysis Covid19-Clinical-Trials-Analysis Public

    Tracking the global fight against COVID-19 through data-driven visualizations

    Jupyter Notebook