Jovan Stojanović
  • Jovan Stojanović
  • Talks/Conferences(current)
  • Publications
  • Projects
  • Repositories
  • Teaching
  • About me

Inria Soda-HeKa seminar - "dirty-cat, module for encoding categorical data"

May 9, 2022

2022   ·   talk     ·   software  

Abstract:

A presentation on dirty categorical variable and the existing encoding methods used. An overview of why dirty-cat’s new encoding methods improve results and facilitate machine learning.

Pour quelques exemples d’utilisation de dirty-cat voir:

  • https://dirty-cat.github.io/stable/
  • https://github.com/dirty-cat/dirty_cat/tree/main/examples



    Enjoy Reading This Article?

    Here are some more articles you might like to read next:

  • 2nd Welfare & Policy Conference 2025, Bordeaux - "Politicians’ Social Media Discourse Strategies - Twitter and the French 2022 Legislative Elections"
  • Economics PhD meeting 2024, Paris Saclay University - "Twitter, Topics and the French 2022 legislative election"
  • Euroscipy 2023 poster - "skrub, preparing tables for machine learning"
  • JupyterCon 2023 - "Machine learning with dirty tables encoding, joining and deduplicating"
  • PyConFR 2023 - "Apprentissage statistique adapté aux données sales avec dirty-cat"
  • © Copyright 2025 Jovan Stojanović. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages.