Skip to content
View thalicsouza's full-sized avatar

Block or report thalicsouza

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
thalicsouza/README.md

Hello humans 👋

I'm Thalita Souza (she/her)

Data Scientist based in Panamá 🇵🇦 with 6+ ys of experience. I'm currently working as Media Data Scientist @P&G.

I work with AI/ML, statistical experimentation, NLP, data pipelines and I occasionally combine data with my love for CS 🎮.

🔭 Currently Learning: Statistical inference and Non-parametric tests.


🧠 What I work on

  • Experimentation & Statistics — A/B test frameworks with rigorous statistical significance testing
  • Machine Learning — Churn prediction, sales forecasting, marketing segmentation, sentiment analysis
  • NLP — Sentiment analysis (thesis project @ ICMC-USP)
  • Data Dashboards — Real-time data pipelines and visualizations (CS2 match stats, scraping)
  • Exploratory Analysis — Notebooks covering a wide range of domains

🛠️ Tech Stack

Python SQL Pandas NumPy Scikit-learn Databricks GCP


📌 Featured Projects

Project Description
poke-gen Generates brand new fictional Pokémon by learning patterns from the real ones
ab-tests-experimentation A/B test framework with statistical significance
sentiment-analysis NLP sentiment analysis for my undergraduate thesis (ICMC-USP)
cs2-dash Dashboard for CS2 data
thalis-public-projects Churn prediction, NLP brand sentiment analysys, etc.
previsao_vendas Time series sales forecasting
marketing_segmentation Customer segmentation case study

🌍 Languages

  • 🇧🇷 Portuguese — Native
  • 🇺🇸 English — Fluent
  • 🇪🇸 Spanish — Fluent

📫 Let's connect

Email LinkedIn GitHub

Pinned Loading

  1. poke-gen poke-gen Public

    Pokémon generator system

    Jupyter Notebook

  2. ab-tests-experimentation ab-tests-experimentation Public

    Repo dedicated to design tests workflow to evaluate a various range of results with statistical significance.

    HTML

  3. thalis-public-projects thalis-public-projects Public

    Repo created to store some of the public projects I've been working on.

    Jupyter Notebook

  4. cs2-dash cs2-dash Public

    Repo created to build a dashboard that collects and store data from recent CS2 matches results.

    Python

  5. sentiment-analysis sentiment-analysis Public

    Repositório do projeto de análise de sentimentos desenvolvido para o Trabalho de conclusão de curso apresentado ao Centro de Ciências Matemáticas Aplicadas à Indústria do Instituto de Ciências Mate…

    Jupyter Notebook