Skip to content

Run Regent on a distributed system #69

@crl123

Description

@crl123

Good afternoon,
I am running Regent on my cluster of 9 node with the following parameters:
mpirun -np 9 -ppn 1 ./TaskBench/task-bench/regent/main.shard14 -steps 10 -type fft -kernel compute_bound -iter 1000000
And it is giving me the following problem:
main.shard14: core.cc:588: void TaskGraph::execute_point(long int, long int, char*, size_t, const char**, const size_t*, size_t, char*, size_t) const: Assertion `input[i].second == dep' failed.

= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= EXIT CODE: 6
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES

APPLICATION TERMINATED WITH THE EXIT STRING: Aborted (signal 6)
And sometimes the following problem:
main.shard14: core.cc:565: void TaskGraph::execute_point(long int, long int, char*, size_t, const char**, const size_t*, size_t, char*, size_t) const: Assertion `offset <= point && point < offset+width' failed.

= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= EXIT CODE: 6
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES

APPLICATION TERMINATED WITH THE EXIT STRING: Aborted (signal 6)
I have the same problem when I use the tree type, but when I use the stencil_1d type I don't have the problem.
I compile regent as follows:
DEFAULT_FEATURES=0 USE_REGENT=1 ./get_deps.sh
export CXX=mpicxx
export CC=mpicc
./build_all.sh
Thank you in advance for your help,

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions