Skip to content
View SyedIkram's full-sized avatar

Block or report SyedIkram

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SyedIkram/README.md

Hey, I'm Ikram

Lead Data Engineer with 10+ years across data engineering, machine learning, and software engineering. I build data platforms from the ground up and scale them.

Currently exploring RAG pipelines and LLM-powered data systems. Open to Lead/Senior Data Engineering roles in Toronto.


What I Do

  • Design and build end-to-end data platforms (batch + streaming, medallion architecture, facts/dimensions)
  • Lead data engineering teams — hiring, mentoring, and shipping
  • Bridge data engineering and ML — from pipelines to production models

Tech I Work With

Data Engineering: Apache Spark, PySpark, Airflow (MWAA), AWS Glue, Databricks, Snowflake, Delta Lake, Azure Data Factory, Synapse

Cloud: AWS (S3, Redshift, RDS, DynamoDB, Lambda) · Azure (Data Lake Gen2, SQL Server, CosmosDB)

ML & AI: PyTorch, Keras, scikit-learn, SparkML, NLP

Languages: Python, SQL, JavaScript


Professional Experience

Lead Data Engineer at TalkShopLive — Oct 2023 – Mar 2026
  • Founded the data engineering function from zero — built the entire data platform, pipelines, and analytics infrastructure as the first data hire.
  • Architected a real-time video analytics system processing millions of events across iOS, Android, Web, and SDK platforms.
  • Designed and implemented a custom clickstream collection system (build vs. buy decision that saved significant vendor costs).
Lead Data Engineer at Best Buy Canada — Sep 2020 – Sep 2023
  • Led a team of 14 data engineers (onshore + offshore) delivering data products across the organization.
  • Built an ML-driven Margin Based Bidding system that generated $1M+ in incremental revenue.
  • Established data governance practices and engineering standards across the data platform.
Machine Learning Intern at Flipboard — Sep 2019 – Dec 2019
  • Developed and shipped AutoGen — a recommendation engine generating top articles monthly for each topic in Flipboard's database.
  • Built and productionized a listicle classifier for news articles using Python/PyTorch/AWS, integrated into the news feed pipeline.
Software Engineer II at Oracle — Jul 2014 – Aug 2017
  • Interop specialist for ZFS Storage products — qualified OS/protocol/kernel combinations across Linux, Solaris, AIX, Windows, and Mac OS.
  • Built API modules and automated testing for ZFS test infrastructure.
  • Led an agile project simulating high-priority customer-facing failure scenarios.

Education

MSc Computer Science — Simon Fraser University (Big Data & Machine Learning, GPA 3.74)

BEng Computer Science — National Institute of Engineering, India (9.2/10)


Featured Projects


Let's Connect

LinkedIn Email

Pinned Loading

  1. CryptoIntel CryptoIntel Public

    CryptoIntel is a one stop dashboard which gives all the information about cryptocurrencies. All the inquisitive users can get their answers related to cryptocurrencies from cryptointel.

    HTML 2

  2. Analysis-and-Recommendations-on-YELP-Dataset Analysis-and-Recommendations-on-YELP-Dataset Public

    Analysis and Recommendations on YELP Dataset

    Python 1 1

  3. Defence-Mechanisms-Against-One-Pixel-Attack Defence-Mechanisms-Against-One-Pixel-Attack Public

    By understanding the One-Pixel attack and how it affects the neural network, it is clear that the location of the pixel plays a significant role for the image to be perturbed. Performing changes on…

    Jupyter Notebook 2

  4. Natural-Language-Processing Natural-Language-Processing Public

    DeepLearning AI Course - NLP Specialization

    Jupyter Notebook

  5. Predict-Happiness-Source Predict-Happiness-Source Public

    Happiness Source Predictor coding challenge given by HackerEarth

    Python

  6. Semantic-Search-for-speeches-in-Audio Semantic-Search-for-speeches-in-Audio Public

    Semantic search is the ability to search for documents by understanding the overall meaning of the query rather than using simple keyword matches.

    Jupyter Notebook 1 1