homercles337
Diamond Member
- Dec 29, 2004
- 6,340
- 3
- 71
Just to be my usual contentious self:
A recent ISCA paper: Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU
(from Intel)
http://portal.acm.org/citation.cfm?...&dl=GUIDE&CFID=11111111&CFTOKEN=2222222&ret=1
Thanks for that reference--just printed it. In my experience there is a lot of variability. FFTs see about 1.5x speed up, Euclidean distances are up to 7x, and as i said before Jaccard distances are up to 22x speed up. This is max speedup and occurs when the dataset sizes are as big as the card will handle. Which, in my case is 4096^2. In reality, the datasets i deal with are often in the millions. I should also note that that my code for Euclidean and Jaccard distances is highly optimized.