These exercises are mainly to get your laptop (or possibly Google Colab) environment set-up.
You are free to use python or R to complete any exercise of this workshop.
Load into your favorite data frame (R, pandas, polars, ...) the data ../notebooks/data/big-claim-events.csv, and for each of the four sub-populations defined by
Load into some data frame ../notebooks/data/aggregate-claim.csv. Identify at least one potential data quality issue.
Hint: There are some convenient data profiling tools for both python (e.g. fg-data-profiling, formerly ydata-profiling, formerly-er pandas-profiling) and R (e.g. DataExplorer).