Re: [firedrake] PDESoft 2014 slides

12 Jul 2014

      Those look like interesting results.

Do we have any idea why we are slow on CUDA on the RHS?

Do we have any indication of actual speed compared with peak flops or
bandwidth?

Regards,

David

On Friday, July 11, 2014, Rathgeber, Florian <f.rathgeber10@imperial.ac.uk>
wrote:
...
I have now added performance results for advection assembly (matrix +
RHS). We can still claim (performance) portability to some degree across
sequential, OpenMP and CUDA.
On 10/07/14 11:23, David Ham wrote:
...
I'm concerned that there are no performance results at all. Do we not
even have CPU results?
On Wednesday, July 9, 2014, Rathgeber, Florian
<f.rathgeber10@imperial.ac.uk <javascript:;> <mailto:
f.rathgeber10@imperial.ac.uk <javascript:;>>> wrote:
Draft slides for my 15min PDESoft talk on PyOP2 next week are at
    http://kynan.github.io/pdesoft2014
Any comments and suggestions much appreciated.
Florian
-- 
Dr David Ham
Departments of Mathematics and Computing
Imperial College London

http://www.imperial.ac.uk/people/david.ham