等级 2 导致所有机架集体中止

Rank 2 caused collective abort of all racks

下面的代码尝试使用 mpi 查找数组的最大数量。但是我不断收到以下错误:

Rank 2 in job 47 caused collective abort of all ranks. Exit status of rank 2 : killed by signal 9

谁能告诉我怎么了?

#include <stdio.h>
#include <stdlib.h>
#include "mpi.h"

int main(int argc , char * argv[])
{
    int myRank , numOfProcesses;
    int source , destination;
    int tag = 0;
    int i = 0, j = 0, k = 0;
    int masterArray[] = {5,6,8,10,12,3,9,-1,3,7};
    int max , globalMax = -100000;
    int flag = 0;

    MPI_Init(&argc, &argv);
    MPI_Status status;
    MPI_Comm_rank(MPI_COMM_WORLD , &myRank);
    MPI_Comm_size(MPI_COMM_WORLD , &numOfProcesses);
    printf("Process : %d \n" , myRank);

    int masterSize = sizeof(masterArray)/sizeof(int);
    //printf("%d \n" , masterSize);
    int slaveSize = masterSize/(numOfProcesses-1);
    //printf("%d \n" , slaveSize);
    int slaveArray[slaveSize];

     if (myRank == 0){

        for (i=1; i<numOfProcesses; i++){
            for (j=0; j<slaveSize; j++){
                slaveArray[j] = masterArray[k];
               // printf("%d \n" , masterArray[k]);
                k++;
            }

           MPI_Send(slaveArray, slaveSize, MPI_INT, i, tag, MPI_COMM_WORLD);
        }


       for (i=1; i<numOfProcesses; i++){
            MPI_Recv(max , 1, MPI_INT, i, tag, MPI_COMM_WORLD, &status);

            if (globalMax < max)
                max = globalMax;
        }
        printf("Global Maximum %d \n" , globalMax);

    }

    else{

        MPI_Recv(slaveArray , slaveSize, MPI_INT, 0, tag, MPI_COMM_WORLD, &status);
        max = slaveArray[0];

        for (i=0; i<slaveSize; i++){
                if (slaveArray[i] > max)
                    max = slaveArray[i];
        }
        printf("Max in %d %d \n" , myRank, max);

         MPI_Send(max , 1, MPI_INT, 0, tag, MPI_COMM_WORLD);

    }

 MPI_Finalize();

    return 0;
}

在 MPI 中发送和接收消息总是通过地址进行的。在以下内容中:

MPI_Recv(max , 1, MPI_INT, i, tag, MPI_COMM_WORLD, &status);

...

MPI_Send(max , 1, MPI_INT, 0, tag, MPI_COMM_WORLD);

您使用的值。您必须添加 & 才能获取地址。

你也应该学会使用合适的集体操作:MPI_Scatter and MPI_Reduce.

顺便说一下,这一行的顺序也是错误的:

max = globalMax;

也请学会倾听你的编译器!任何具有合理设置的合理编译器都会警告您将整数作为地址传递。