Cintia I. Shinoda — Data Scientist

██████╗██╗███╗ ██╗████████╗██╗ █████╗ ██╗ ██╔════╝██║████╗ ██║╚══██╔══╝██║██╔══██╗ ██║ ██║ ██║██╔██╗ ██║ ██║ ██║███████║ ██║ ██║ ██║██║╚██╗██║ ██║ ██║██╔══██║ ██║ ╚██████╗██║██║ ╚████║ ██║ ██║██║ ██║ ██║██╗ ╚═════╝╚═╝╚═╝ ╚═══╝ ╚═╝ ╚═╝╚═╝ ╚═╝ ╚═╝╚═╝

███████╗██╗ ██╗██╗███╗ ██╗ ██████╗ ██████╗ █████╗ ██╔════╝██║ ██║██║████╗ ██║██╔═══██╗██╔══██╗██╔══██╗ ███████╗███████║██║██╔██╗ ██║██║ ██║██║ ██║███████║ ╚════██║██╔══██║██║██║╚██╗██║██║ ██║██║ ██║██╔══██║ ███████║██║ ██║██║██║ ╚████║╚██████╔╝██████╔╝██║ ██║ ╚══════╝╚═╝ ╚═╝╚═╝╚═╝ ╚═══╝ ╚═════╝ ╚═════╝ ╚═╝ ╚═╝

About

I'm a data scientist based in São Paulo, Brazil. My work centres on modelling complex systems as networks, machine learning, and — increasingly — computer vision and retrieval-augmented AI agents.

I build end-to-end, reproducible data pipelines (ingestion → transformation → analysis → interactive apps) and reapply the same architectural patterns across domains: public health, financial derivatives, real estate, and astronomy.

Currently completing a BSc in Data Science alongside specializations in Robotics and applied mathematics. Open to remote roles in ML engineering and applied AI.

locationSão Paulo, BR

focusML · Graphs · Agents

stackPython · DuckDB · NetworkX

degreeBSc Data Science '26

statusopen to remote

Projects

Each project is an end-to-end pipeline with a live demo, dataset, or reproducible code. Click through for the full case study.

sptrans_network_resilience

python·graph_theory·case_study

Graph-theory analysis of São Paulo's transit network on SPTrans GTFS data — 22K+ nodes, 29K+ edges. Finds critical bottlenecks and tests robustness under targeted vs random failure.

NetworkXGeoPandasSciPyCentrality

> case_study > live_demo > code

legislative_sentiment_analysis

python·nlp·case_study

Sentiment analysis of São Paulo city-council speeches using transformer models over a full NLP pipeline — ingestion, cleaning, inference, and visual reporting.

TransformersNLPHuggingFace

> case_study > code

astronomia_rag

python·llm/rag·case_study

A retrieval-augmented FAQ system for observational astronomy — modular Mistral-7B + FAISS pipeline over a curated Portuguese corpus, with a full evaluation report.

Mistral-7BFAISSLangChainRAG

> case_study > code

Technique Emphasis Across Projects

Where each project leans, at a glance. Cells are shaded against the per-row maximum — greener = stronger emphasis, pinker = lighter touch — same idea as the autolab detail-score grid.

lighterheavier scale 0.0 → 1.0 · self-rated emphasis

Project	Data Engineering	ML / Modeling	Graphs / Networks	NLP	LLM / Agents	Vision / Perception	Robotics / Control	Applied Math	Deploy / Infra	Viz / App	Testing / evals
Networks & Graphs
sp_public_transit
NLP & Language
legislative_nlp
LLM & Agents
astronomia_rag

Values are a self-rated qualitative measure of where each project's effort concentrated — illustrative, not a measured benchmark. The shading engine (data-v → pink↔green) is the reusable part: drop real metrics in on any project page and the colours follow.

Experience & Education

analytics

Analytics Analyst

HandsOn.TV — Mountain View, USA

Owned analytics for a content platform across web, Facebook, and YouTube to inform content and distribution strategy.
Surfaced content patterns that shaped editorial direction.

data_engineering

Database Developer

E-VAL Tecnologia — São Paulo, Brazil

Modelled the database infrastructure behind products for Hospital Israelita Albert Einstein and Central Nacional Unimed (diabetes-risk model).
Designed ETL pipelines and documented schemas across the development lifecycle.

2022 — today

BSc in Data Science

UNIVESP

Final project: topological analysis of São Paulo's transit-network vulnerability & resilience via graph theory.

2024 — 2025

Specialization · Applied Mathematical Methods

UTFPR

Mathematics · Python · Operations Research.

2025 — today

Specialization · Robotics

UFV

Computer Vision · ROS2 · Control Systems · Path Planning.

About

Projects

sptrans_network_resilience

legislative_sentiment_analysis

astronomia_rag

Technique Emphasis Across Projects

Experience & Education

Analytics Analyst

Database Developer

BSc in Data Science

Specialization · Applied Mathematical Methods

Specialization · Robotics

Tech Stack

languages & data

ml & ai

apps & viz

scientific computing

Get in touch