πΎ Data Engineer focused on building reliable data platforms, turning raw data into meaningful insights, and designing scalable solutions in the cloud.
I enjoy solving complex problems, learning continuously, and transforming ideas into data-driven products π
- π Degree in Systems Analysis and Development
- π§° Experience designing and implementing data pipelines, ETL/ELT workflows, and analytics architectures
- βοΈ Hands-on with Azure, Databricks, and Apache Spark in production environments
- π Passionate about Python, SQL, and end-to-end automation
- π Interested in data governance, quality, and best practices for scalable data platforms
- π― I believe data should not only describe the past, but also support decisions and drive innovation
| Project | Description | Stack |
|---|---|---|
| ETL Automation Pipeline | End-to-end automated ETL/ELT workflow on Databricks and Airflow, orchestrating large-scale batch jobs with monitoring and logging for reliability. | Python, Airflow, Databricks, Spark |
| Sales Analytics Dashboard | Interactive analytics solution providing real-time sales KPIs, trends, and drill-downs for business stakeholders. | SQL, Power BI , Python, PySpark |
| Azure Data Lake Project | Designed a scalable data lake architecture for analytics and reporting, with structured zones and standardized ingestion patterns. | Azure, Spark, Python |
I share content about Data Engineering, PySpark, and Cloud Technologies on Medium:
π Recent topics include:
- PySpark UDFs and performance optimization
- Working with broadcast variables in distributed environments
- Building efficient, maintainable data pipelines on Databricks
π Read my articles here: medium.com/@luciana.sampaio84
π« Iβm always happy to exchange ideas about data, analytics, and technology:
β¨ βGood data tells a story β great data drives change.β


