hello, i'm

 ██████╗██╗███╗   ██╗████████╗██╗ █████╗     ██╗   
██╔════╝██║████╗  ██║╚══██╔══╝██║██╔══██╗    ██║   
██║     ██║██╔██╗ ██║   ██║   ██║███████║    ██║   
██║     ██║██║╚██╗██║   ██║   ██║██╔══██║    ██║   
╚██████╗██║██║ ╚████║   ██║   ██║██║  ██║    ██║██╗
 ╚═════╝╚═╝╚═╝  ╚═══╝   ╚═╝   ╚═╝╚═╝  ╚═╝    ╚═╝╚═╝

data_science · ml_engineering · ai_agents

Data scientist working at the intersection of applied machine learning, database systems, and AI agents



about

About

I'm a data scientist based in São Paulo, Brazil. My work centres on modelling complex systems as networks, machine learning, and — increasingly — computer vision and retrieval-augmented AI agents.

I build end-to-end, reproducible data pipelines (ingestion → transformation → analysis → interactive apps) and reapply the same architectural patterns across domains: public health, financial derivatives, real estate, and astronomy.

Currently completing a BSc in Data Science alongside specializations in Robotics and applied mathematics. Open to remote roles in ML engineering and applied AI.

locationSão Paulo, BR
focusML · Graphs · Agents
stackPython · DuckDB · NetworkX
degreeBSc Data Science '26
statusopen to remote
projects

Projects

Each project is an end-to-end pipeline with a live demo, dataset, or reproducible code. Click through for the full case study.

sptrans_network_resilience

python·graph_theory·case_study

Graph-theory analysis of São Paulo's transit network on SPTrans GTFS data — 22K+ nodes, 29K+ edges. Finds critical bottlenecks and tests robustness under targeted vs random failure.

NetworkXGeoPandasSciPyCentrality

legislative_sentiment_analysis

python·nlp·case_study

Sentiment analysis of São Paulo city-council speeches using transformer models over a full NLP pipeline — ingestion, cleaning, inference, and visual reporting.

TransformersNLPHuggingFace

astronomia_rag

python·llm/rag·case_study

A retrieval-augmented FAQ system for observational astronomy — modular Mistral-7B + FAISS pipeline over a curated Portuguese corpus, with a full evaluation report.

Mistral-7BFAISSLangChainRAG
technique_matrix

Technique Emphasis Across Projects

Where each project leans, at a glance. Cells are shaded against the per-row maximum — greener = stronger emphasis, pinker = lighter touch — same idea as the autolab detail-score grid.

lighterheavier scale 0.0 → 1.0 · self-rated emphasis
Project Data
Engineering
ML /
Modeling
Graphs /
Networks
NLP LLM /
Agents
Vision /
Perception
Robotics /
Control
Applied
Math
Deploy /
Infra
Viz /
App
Testing /
evals
Networks & Graphs
sp_public_transit
NLP & Language
legislative_nlp
LLM & Agents
astronomia_rag

Values are a self-rated qualitative measure of where each project's effort concentrated — illustrative, not a measured benchmark. The shading engine (data-v → pink↔green) is the reusable part: drop real metrics in on any project page and the colours follow.

experience

Experience & Education

analytics

Analytics Analyst

HandsOn.TV — Mountain View, USA
  • Owned analytics for a content platform across web, Facebook, and YouTube to inform content and distribution strategy.
  • Surfaced content patterns that shaped editorial direction.
data_engineering

Database Developer

E-VAL Tecnologia — São Paulo, Brazil
  • Modelled the database infrastructure behind products for Hospital Israelita Albert Einstein and Central Nacional Unimed (diabetes-risk model).
  • Designed ETL pipelines and documented schemas across the development lifecycle.
2022 — today

BSc in Data Science

UNIVESP
  • Final project: topological analysis of São Paulo's transit-network vulnerability & resilience via graph theory.
2024 — 2025

Specialization · Applied Mathematical Methods

UTFPR
  • Mathematics · Python · Operations Research.
2025 — today

Specialization · Robotics

UFV
  • Computer Vision · ROS2 · Control Systems · Path Planning.
tech_stack

Tech Stack

languages & data

PythonRSQLPolarsDuckDBParquet

ml & ai

NetworkXscikit-learnLangChainOllamaFAISS

apps & viz

StreamlitPlotlyMatplotlibSeaborn

scientific computing

NumPySciPyGeoPandasPandera
contact

Get in touch

Open to remote opportunities in ML engineering and applied AI. The fastest way to reach me is GitHub or LinkedIn.