Netflix interview question

How do you approach working in ambiguity with large data sets?