Skip to content
View DanielBBrasileiro's full-sized avatar
  • Brasília

Block or report DanielBBrasileiro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DanielBBrasileiro/README.md
Typing SVG


About Me

I am a data professional transitioning from Data Analysis to Data Engineering. My background in extracting insights gives me a unique perspective: I build pipelines not just to move data, but to ensure it arrives clean, reliable, and ready for business impact.

Currently, I am focused on mastering the Modern Data Stack (MDS), implementing Data Lakehouses, and adhering to software engineering best practices like CI/CD and containerization.

🔭 Current Focus: Architecting a scalable ETL pipeline for Brazilian Corporate Data (CNPJ) processing terabytes of public information using Spark, Docker, and MinIO.


The "100 Days of Code" Challenge

I have publicly committed to coding every single day to accelerate my transition to Data Engineering.

Current Project Stack Status
CNPJ Analytics Pipeline Python, Docker, MinIO, PostgreSQL, dbt 🚧 In Progress (Building Architecture)

Check my daily progress and code commits in my repositories!


Technical Arsenal

I organize my stack by function to demonstrate architectural understanding.

Core & Compute Infrastructure & Storage Orchestration & Transformation Visualization
Python Docker dbt Power Bi
SQL MinIO Apache Spark Tableau
Git AWS Airflow Matplotlib

Engineering Philosophy

Automation as a Baseline: Repetition is a signal. If I do it twice, I script it. If I do it a third time, I automate it end-to-end and turn it into a maintainable pipeline.

Data Quality by Design: Data quality is not optional. I enforce validation at ingestion using tools like Pydantic and Great Expectations to ensure that every downstream system receives clean, trustworthy data.

Documentation with Purpose: Code explains how things work; documentation explains why. I prioritize clear, human-friendly READMEs that accelerate onboarding and decision-making.


"Talk is cheap. Show me the code." — Linus Torvalds

Popular repositories Loading

  1. DanielBBrasileiro DanielBBrasileiro Public

  2. matplotlib-numpy matplotlib-numpy Public

    Python studies and libraries ;)

    Jupyter Notebook

  3. Python Python Public

    Jupyter Notebook

  4. cnpj-analytics-pipeline cnpj-analytics-pipeline Public

    Python

  5. data-engineering-cheatsheets data-engineering-cheatsheets Public

    Practical cheatsheets for Data Engineers

    HTML

  6. free-programming-books free-programming-books Public

    Forked from EbookFoundation/free-programming-books

    📚 Freely available programming books

    Python