Bio-data scientist | Multi-omics pipeline architect | Computational microbiome neurobiologist
I build fault-tolerant, production-ready pipelines that turn terabytes of raw omics data into testable biological insight. Recent work spans:
- Streaming multi-GB+ HMDB XML files on low-RAM machines using Python
iterparseand multithreading - Automating ligand preparation with Open Babel and AutoDockTools for structure-based drug discovery
- Dissecting the molecular dialogue between the human microbiome and brain through Human Accelerated Regions (HARs)
- Designing standardized, reproducible multi-omics workflows for large‐scale microbiome projects
- Mapping metabolomic signatures that stratify COVID-19 severity
- Using Mendelian randomization to link host genetics, gut microbes, and neuro-psychiatric risk
| Category | Toolkit |
|---|---|
| Languages | Python 3 (asyncio, pandas), R (tidyverse), Bash/Batch |
| Data Engineering | multiprocessing, xml.etree.ElementTree, Airflow |
| Bio-/Cheminformatics | HMDB, Open Babel, AutoDock Vina, PDB |
| Multi-omics Analytics | MetaboAnalystR, NMDC EDGE, Cytoscape |
| Statistics | Mendelian Randomization, multivariate metabolomics |
Full list → Professional portfolio
- I’m keen to collaborate on multi-omics standardization, resource-constrained bio-computing, and translational microbiome research.*
| Channel | Handle |
|---|---|
mscphd2301171011@iiti.ac.in |
|
| /in/sidsin0809 | |
| ORCID | 0000-0002-1388-6221 |
When I’m not wrangling cells and code, you’ll find me sketching data-driven sci-fi storyboards.