For the purpose of this exercise, I created a simple data set of three types of plants: vegetables, fruits and flowers. I classified text (taken from Wikipedia) based on the three categories. It looked like this:
I loaded my input csv file into LightSIDE and extracted basic features like unigrams and bigrams first. Then I checked different basic features and extracted their feature sets.
I saved all the feature sets for building models later using alternative feature spaces.