I would start with a simple approach: extract all customerID,itemID
tuples from the orders table and use them as your input data. How many
of those do you have? The datasize will dictate whether you need to
employ a distributed approach to recommendation mining or not.


