Unexpected result given by custom operator for buffer object

This issue is reproduced from the question on stack overflow.

When I tried to implement a customer operator that returns a numpy buffer with the smallest second item, it appears reduce gives the correct result, but Reduce does not.

Here is the working code with reduce

import numpy as np
from mpi4py import MPI

def find_min(a1:np.ndarray, a2:np.ndarray, datatype:MPI.Datatype) -> np.ndarray:
    """ Get the one with the smaller 2nd term."""
    print(a1, a2)
    if a1[1] > a2[1]:
        a1 = a2
    return a1

comm = MPI.COMM_WORLD
rank = comm.rank
RS = np.random.RandomState(rank)
a  = np.array([rank, RS.random()], dtype=np.float64)
print(rank, a)

FIND_MIN = MPI.Op.Create(find_min, commute=True)
amin = comm.reduce(a, op=FIND_MIN, root=0)
if rank == 0:
    print(amin)

and its output

0 [0.        0.5488135]
1 [1.        0.417022]
2 [2.        0.4359949]
3 [3.        0.5507979]

# 4 comparisons are conducted
[0.        0.5488135] [1.        0.417022]
[1.        0.417022]  [2.        0.4359949]
[2.        0.4359949] [3.        0.5507979]
[1.        0.417022]  [3.        0.5507979]

[1.       0.417022]  # <- Correct result

The testing code with Reduce is as follows

from typing import Tuple
import numpy as np
from mpi4py import MPI

def find_min(a1:MPI.memory, a2:MPI.memory, datatype:MPI.Datatype) -> MPI.memory:
    """ Get the one with the smaller 2nd term. """
    dim = (2,)          # Hard-coded
    dtype = np.float64  # Hard-coded

    a1p = to_ndarray(a1, dtype, dim)
    a2p = to_ndarray(a2, dtype, dim)
    print(a1p, a2p)

    if a1p[1] < a2p[1]:
        return a1
    return a2

def to_ndarray(a:MPI.memory, dtype:type, dim:Tuple[int,...]) -> np.ndarray:
    """ Convert a mpi4py.MPI.memory object to a numpy ndarray. """
    buf = np.array(a, dtype='B', copy=True)
    return np.ndarray(buffer=buf, dtype=dtype, shape=dim)

comm = MPI.COMM_WORLD
rank = comm.rank
RS = np.random.RandomState(rank)
a  = np.array([rank, RS.random()], dtype=np.float64)  # dim is (2,)
print(rank, a)

FIND_MIN = MPI.Op.Create(find_min, commute=True)
amin = np.zeros(2)
comm.Reduce(a, amin, op=FIND_MIN, root=0)
if rank == 0:
    print(amin)

and it gives incorrect output as

0  [0.       0.5488135]
1  [1.       0.417022]
2  [2.       0.4359949]
3  [3.       0.5507979]

# 3 comparisons are conducted
[0.        0.5488135] [3.        0.5507979]
[1.        0.417022]  [3.        0.5507979]
[2.        0.4359949] [3.        0.5507979]

[3.        0.5507979]  # <- Wrong! Should be [1.       0.417022]

The comparisons made by the code with Reduce are incomplete. My tests show that either returning a MPI.memory object or a numpy.ndarray does not improve the result. The codes are tested with openmpi-4.0.3 and mpi4py-3.0.3.

Comments (7)