Finding Similar Items with Amazon Elastic MapReduce, Python, and Hadoop Streaming - worth a look, sounds quite interesting.