Deploy non-trivial Serialization for function pointers

In the 2023-01-31 Serialization meeting, we resolved to change the Serialization status and semantics of pointer-to-function types:

Current Semantics (through release 2022.9.0)

Pointer-to-function types are TriviallySerializable (by virtue of being TriviallyCopyable). However due to ASLR and randomization of shared library load addresses, pointers-to-function values are generally NOT meaningfully portable across address spaces (ie. a raw function pointer address constructed by one process cannot reliably be used by another to invoke the function).

Proposed Semantics

Pointer-to-function types remain Serializable, but become non-TriviallySerializable. The UPC++ library defines the serialization for all such types, with an opaque implementation ensuring that serializing a valid pointer-to-function value on one process and subsequently deserializing it at another will result in a pointer-to-function value referencing "the same function" in the memory space of the target process. As a result, valid pointer-to-function values can be meaningfully transmitted across address spaces via RPC and reliably used for function invocation at another process.

The pointer-to-function translation described above was already being applied to pointer-to-function arguments passed as the func callable argument in rpc(), rpc_ff() and as_rpc(). This work extends the scope of that mechanism to include Serialization of all pointer-to-function values, regardless of where they appear in RPC arguments.

Breaking changes:

pointer-to-function types are no longer TriviallySerializable.
As a consequence of 1, user types containing non-static pointer-to-function fields may cease to be TriviallySerializable (and lacking serialization declarations, possibly also cease to be Serializable). Such types may need to deploy serialization declarations such as UPCXX_SERIALIZED_FIELDS(...) to restore Serializable.
As a consequence of 1 and 2, objects having or containing a pointer-to-function type may no longer be communicated using RMA (rput*(), rget*()) or non-experimental data collectives (broadcast(), reduce_{one,all}).

Preserving legacy use cases

There are (currently hypothetical?) obscure yet valid use cases where it might make sense to transmit the raw bits of a pointer-to-function value (via trivial serialization), without using those transmitted values for function invocation on a different process (which would fail in general). Several potential workarounds exist to preserve such use cases, such as reinterpreting the raw bits into a TriviallySerializable type of suffiient size (e.g. uintptr_t), or embedding the pointer-to-function object in a struct S and specializing is_trivially_serializable<S>::value = true.

Work assignments

@Colin MacLean will implement this as a stand-alone Impl PR
@Amir Kamil will pursue a corresponding Spec PR.

Current Semantics (through release 2022.9.0)

Proposed Semantics

Breaking changes:

Preserving legacy use cases

Work assignments

Comments (1)