Portfolio

Jonhnatta Augusto Data Engineer

Professional with over 9 years of experience in Networks, Infrastructure and Information Technology. My career began at the intersection between I.T. and Digital Marketing, where I used advanced techniques data analysis to optimize campaigns and maximize return on investment. I'm looking for constantly developing professional and personal, always looking for new opportunities to apply my technical skills and positively impact projects.

Featured Projects

Projects that demonstrate my technical skills and problem-solving ability.

Real-time Data Processing with AWS Lambda and Kafka

Real-time Data Processing with AWS Lambda and Kafka

Developed a serverless webhook using AWS Lambda, Python, and Flask to capture and stream real-time data to a Kafka cluster. The data was then consumed by a real-time application for continuous processing.

Kafka
Python
Flask
Docker
AWS Lambda
Argument Retrieval Using the Retrieve and Generate (RAG) Technique

Argument Retrieval Using the Retrieve and Generate (RAG) Technique

This project implements a system for argument retrieval using the 'Retrieve and Generate' (RAG) technique. The code can extract information from .docx documents and generate answers based on user queries. The goal is to streamline information retrieval, enabling users to obtain answers quickly and efficiently.

Python
Openai
Langchain
ChromaDB
Image Description Generation with Computer Vision and AI

Image Description Generation with Computer Vision and AI

This project is a web application developed with Streamlit that uses OpenAI's GPT-4o-mini model to automatically generate image descriptions.

Python
Openai
Streamlit
Image Generation with DALL-E and Streamlit

Image Generation with DALL-E and Streamlit

This project features an interactive web application developed with Streamlit, utilizing OpenAI's DALL-E model to generate images from user-provided textual descriptions. The user-friendly interface enables users to interact with the model and view the generated images in real-time.

Python
Openai
Streamlit
DALL-E
ETL Pipeline for Processing Multiple JSON Files with Output in Parquet or CSV

ETL Pipeline for Processing Multiple JSON Files with Output in Parquet or CSV

This project features an ETL pipeline developed in Python to process multiple JSON files. By leveraging Pandas for data manipulation and transformation, and Pandera for data validation and quality assurance, the pipeline enables users to select the desired output format—either Parquet or CSV—upon completion of the processing.

Python
Pandas
Pandera

Technical Skills

Technologies and tools I'm proficient with

Python
JavaScript
TypeScript
SQL
Node.js
HTML
AWS
Docker
Apache Spark
Git
Airflow
PostgreSQL
MongoDB

Get in Touch

Do you have a project in mind or want to discuss opportunities and technology? I'd love to hear from you.

Contact Information

Feel free to reach out through any of these channels

Email

jonhnatta.augusto@gmail.com

GitHub

github.com/jonhnatta

LinkedIn

linkedin.com/in/jonhnata

I am currently open to full-time projects and opportunities.

    Ï