Wage column - Alanmg298/Kaggle-API-to-Clean-Data-in-Python GitHub Wiki

This column was quite a challenge

  • Let's get rid of the text "K" and any symbol "$", "€" ( if the column has it, we'll need to homologate the Wage
  • The following code will extract all text and alphanumeric characters (symbols)
  • That is why we'll be transforming it into as string
  • The print will tell us if any further transformation is needed in case there's not only Euro in the Wage

Since theres only Euro €, we are not perfoming any further transformation :)

but just to be sure the code below will perform an extra confirmation that the column "numbers_only" has in fact only numbers 😆

Looks like its running correctly :) And now data can be manipulated to perform basic statistical analysis