Larger data set analysis requires different solutions than smaller data set analysis. That might seem like a simple argument to make, but it is very important to understand where the tipping point occurs. A long time ago in a faraway basement, I learned that my dataset had expanded beyond the confines of a single computer. I needed something better. I needed a solution that was scalable and inexpensive. That is when I started studying the Apache project Hadoop.