Skip to content
View Arjun-M-101's full-sized avatar
🙂
🙂
  • India

Block or report Arjun-M-101

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Arjun-M-101/README.md

Hello I'm Arjun

Aspiring Data Engineer


🙋‍♂️ About Me

  • 👨‍💻 I’m currently working as a Database Administrator, building strong foundations in data management and reliability
  • 🌱 Transitioning into Data Engineering by designing end‑to‑end batch and streaming pipelines
  • 🛠️ Passionate about building scalable, reliable data pipelines that turn raw data into actionable insights
  • 👯 Open to collaborating on Data Engineering & Open Source projects
  • 👨‍💻 Explore my work here: My Portfolio
  • 📫 Reach me at arjunmpec101@gmail.com
  • ⚡ Fun fact: I debug pipelines the way I play games — with persistence and strategy

🚀 Tech Stack

pandas Spark Kafka Airflow MySQL Streamlit HTML CSS JavaScript Bootstrap Postman


📂 Featured Projects

  • 🗄️ YouTube Data Engineering Pipeline (Batch Processing)
    End‑to‑end batch ETL pipeline implementing the Medallion Architecture (Bronze → Silver → Gold).

    • Orchestrated with Apache Airflow (3.x)
    • Transformations with Apache Spark
    • Data lake layers on local filesystem (Bronze/Silver/Gold)
    • Serving layer in Postgres (analytics‑ready tables)
    • Interactive Streamlit + Altair dashboard via SQLAlchemy
    • Ingests raw YouTube trending data (CSV/JSON), cleans, enriches, and computes derived metrics for BI
  • 📊 StockPulse (Streaming Pipeline)
    Real‑time streaming pipeline simulating stock ticks and processing them end‑to‑end.

    • Ingestion via Kafka producer publishing to stock_ticks topic
    • Processing with Spark Structured Streaming (schema enforcement + derived metrics)
    • Dual sinks: Postgres (serving layer) + Parquet (partitioned by index/date)
    • Interactive Streamlit + Altair dashboard for real‑time visualization
    • Fully orchestrated with Apache Airflow

📊 My GitHub Stats

Arjun's streak

Arjun M's Github Stats Arjun M's Top Languages

Note: Top languages is only a metric of the languages my public code consists of and doesn't reflect experience or skill level.


Arjun's Graph

🌐 Connect with Me


❤ Views and Followers

GitHub Badge

Pinned Loading

  1. Weather-App Weather-App Public

    A simple weather app built using React JS with Openweather API

    CSS