Siddharth Singh SidSin0809

Hello, I’m Siddharth Singh

Bio-data scientist | Multi-omics pipeline architect | Computational microbiome neurobiologist

I build fault-tolerant, production-ready pipelines that turn terabytes of raw omics data into testable biological insight. Recent work spans:

Streaming multi-GB+ HMDB XML files on low-RAM machines using Python iterparse and multithreading
Automating ligand preparation with Open Babel and AutoDockTools for structure-based drug discovery
Dissecting the molecular dialogue between the human microbiome and brain through Human Accelerated Regions (HARs)
Designing standardized, reproducible multi-omics workflows for large‐scale microbiome projects
Mapping metabolomic signatures that stratify COVID-19 severity
Using Mendelian randomization to link host genetics, gut microbes, and neuro-psychiatric risk

Category	Toolkit
Languages	Python 3 (asyncio, pandas), R (tidyverse), Bash/Batch
Data Engineering	multiprocessing, `xml.etree.ElementTree`, Airflow
Bio-/Cheminformatics	HMDB, Open Babel, AutoDock Vina, PDB
Multi-omics Analytics	MetaboAnalystR, NMDC EDGE, Cytoscape
Statistics	Mendelian Randomization, multivariate metabolomics

I’m keen to collaborate on multi-omics standardization, resource-constrained bio-computing, and translational microbiome research.*

When I’m not wrangling cells and code, you’ll find me sketching data-driven sci-fi storyboards.