Data Projects

Here you can find my main data-related projects. Many of them are grouped in the "Data Engineering" repository on GitHub, but for didactic and better presentation purposes, I believe that the format below is a more elegant way to showcase the projects. As I have availability, I will create the pages, in addition to the pages that can also be checked in the respective GitHub repositories.

anac-project

Django + Django Ninja API Project

This project was the result of a request from a company I worked for, asking us to create a study project using Django + Django Ninja for API development. We needed to assist with maintenance and feature creation in one of their products, and this project served to demonstrate our basic knowledge. It was a very enjoyable experience.

anac-project

ANAC Data Project

This project involved working with public data from ANAC (Brazil’s National Civil Aviation Agency). The project documentation is currently available only in Portuguese, including a video explanation. I used technologies such as Airflow and Spark running in Docker, along with Jupyter Notebooks. I also created dashboards using Power BI. Moreover, I leveraged GPT-4o to accelerate the entire process, showcasing how AI tools can enhance our workflows, assist with challenges, and lead to faster deliveries.

i-hate-tech-interviews

I Hate Tech Interviews!

Calm down, don't be sad about the title if you're a Tech Recruiter! I'll always be willing to take on your technical challenges and, being honest, I enjoy trying them. I just confess it's uncomfortable when it's something like Live Coding, because the nervousness is high and, being honest, we don't know everything by heart, on the contrary, research and mental reflection are the biggest daily tools for tech professionals (shhh! don't tell them I revealed the secret, hehe). Anyway, here are some cases I've done in the past in other selection processes. I hope it helps colleagues who need to go through this (and Tech Recruiters, so you can already get an idea of my competence, if it will be aligned with what you want, hahaha).

dbt-project

dbt Project

I created this personal project to practice basic dbt fundamentals since we started using it at our company.

hive-impala-course

Hive/Impala Course and Project

This was a course with instructor Fernando Amaral, an expert in the field of Data. The focus was on studying Hive and Impala, with the creation of a Data Warehouse in Hive and using HDFS on a Cloudera Red Hat virtual machine.

advanced-sql-server-course

SQL Server Master Class – Advanced SQL

This was a course with the brilliant instructor Travis Cuzick, who brought excellent exercises and theoretical content on intermediate and advanced SQL using Microsoft's SQL Server. The course covered everything from Windows Functions like OVER, PARTITION BY, RANK, DENSE RANK, to CTEs, PIVOT, TEMP TABLES, STORED PROCEDURES, and DYNAMIC SQL.

palmeira-vale

Palmeira do Vale – My First Project!

Is it fantastic? Is it amazing? Is it perfect? No, I highly doubt it. But this was the first project that marked the beginning of my journey in the data world back in 2020. It's a very short and straightforward project about a sports club that decided to have a system for member registration and an analytical system to later understand its data.

ellas-eshop

Ellas E-Shop – The Second Project!

My second project was bolder, and although it was simple, it was a significant challenge at the time. In late 2020 and early 2021, this project was inspired by a real need: my sister's dream of setting up an e-shop for women's ballet flats. Leveraging her business rules and daring to create something closer to real life, I invested months in this more robust project. All the data was created by me, and everything was done manually, without fake Python or random Excel. 😉

budget-thumbail

Budget Management – A Power BI Professional Course

Course taught by instructor André Rosa with a focus on learning to master the main DAX formulas to solve challenges in creating a dashboard in Power BI. The project was to create a budget management dashboard.

python-ibge-thumbail

Automation in Python

This was an automation I created while working for a logistics and maritime transportation client. One of their analysts used to manually retrieve a PDF file every month, read it, and finally enter the data into the dashboard. This automation uses Python and Pandas to transform unstructured data into semi-structured data, which is then further processed in the ETL pipeline (in the real project, it was done using Apache Hop and Python, but what I replicated here for educational purposes was in pure Python).

dio-github-thumbail

DIO's GitHub Certification Formation

This was a course I took to review my knowledge in Git and GitHub. It was a very well-planned course, although basic, that covered everything from Git language, common uses of GitHub, to less explored parts in other basic courses, such as GitHub administration, plans, Codespaces, etc. The training also included 3 project challenges and one code challenge.