Skip to content

ACRC Improvements #12

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 7 commits into
base: master
Choose a base branch
from
Open

Conversation

harrywaugh
Copy link

@harrywaugh harrywaugh commented Jun 17, 2019

Summary of Changes:

  • Added some restrict and const qualifiers so that the compiler is better at auto-vectorizing the propagate loop.
  • Edited the Makefile so that pb_mpi can be build with Intel compiler, this can be done using make INTEL=1. (Faster than GNU currently). To compile on Intel, some missing return statements were also inserted to preserve the correctness of the output trace file.
  • Removed explicit MPI_Barriers, which then idenitifes MPI_Bcast as a major bottleneck with more cores.
  • Added benchmark report.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant