use one column: replace each category with the frequency that category appears - e.g. if red appears 3/11 times, replace all instance of red with 3/11