Consumer Complaints
Description of the Data
The data set contains information about consumer complaints and the responses from companies.
The downloaded data set has two versions:
complaints.csv and complaints.json
In July 2023, Mark added a transformed version called processed.csv in which each line has 18 fields (see below).
Transformations to the original data source
Kevin originally downloaded this data set in June 2021. He created three transformations of the original data set:
processed.csv
processed.feather
processed.parquet
Kevin removed all commas from the data set. For instance, if a consumer complaint had commas in the text data, he removed them. That way, every line contained 19 fields (a line number and the original 18 fields).
In July 2023, Mark downloaded the updated consumer complaint data. Mark transformed the complaints.csv into a file called processed.csv using R by: removing all commas; replacing each "\\n\\n" with "\\n"; and then replacing "\\n" with " ". In particular, Mark did not add a line number, so Mark’s version of processed.csv has only the original 18 fields per line, and no added line numbers.
Mark did not (yet) make a feather or parquet version of the data set.