ADPs in structure factor calculation, continued by apeck12 · Pull Request #36 · tjlane/thor

apeck12 · 2017-12-06T01:48:20Z

Thanks for the feedback! Benchmark (1000 q-vectors, 1000 atoms, 1000 rotations) for the original cpuscatter function prior to including ADPs:

$ time ./cputest
CPP OUTPUT:
0.000000
0.000000

real 0m11.691s
user 0m11.682s
sys 0m0.008s

For the original ADP implementation, it clocked in at:

$ time ./cputest
CPP OUTPUT:
0.000000
0.000000

real 0m18.371s
user 0m18.366s
sys 0m0.004s

I revised this function so that Debye-Waller factors are pre-computed in the first nested loop (which loops over q-vectors) rather than in the third nested loop. However, the improvement in speed is marginal:

$ time ./cputest
CPP OUTPUT:
0.000000
0.000000

real 0m16.515s
user 0m16.511s
sys 0m0.003s

I only revised the CPU code, as a comment in cpp_scatter.cu indicates that caching pre-computed Debye-Waller factors could pose memory problems for the GPU version.

Ariana Peck and others added 6 commits December 4, 2017 21:05

modified adp calculation for slight speed improvement on CPU

43d1e24

modified adp calculation, slight speed-up on CPU

f8010a2

Revert to 3b83ba2

bbbc5fb

moved adp calculation, slight speed-up on CPU

ae7ef6d

Merge branch 'diffuse' of https://github.com/apeck12/thor into diffuse

a9a5aa2

Merge branch 'apeck12-diffuse' into diffuse

211c0a6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADPs in structure factor calculation, continued#36

ADPs in structure factor calculation, continued#36
apeck12 wants to merge 6 commits intotjlane:apeck12-diffusefrom
apeck12:diffuse

apeck12 commented Dec 6, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

apeck12 commented Dec 6, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant