[Rcpp-devel] Assertion error in ud_ep.c, when running with MPI

Maddegedara Lalith lalithwij at gmail.com
Tue Jun 14 07:24:19 CEST 2022


Hello,

I want to use RInside in my C++ based MPI application to do time series
forecasting using the auto.arima library of R. The RInside instance in each
MPI rank is expected to do an independent calculation (e.g. time series
forecast).

With one MPI rank, it always completes without producing any error.
However, with more than 1 mpi ranks, it produces the following error.
Depending on the run, different numbers of mpi ranks produce the same
error. On rare occasions, all the ranks successfully complete the
execution. Further, I found that even your example
"rinside_mpi_sample4.cpp" produces the same error.

I am using the Intel MPI library (version 2021.1). I tried compiling with
icpc and g++. Both produced the same error.

Could you please help me to solve this problem.

With best regards
Lal

[ibis:14878:0:14992]       ud_ep.c:565  Assertion `ep->dest_ep_id ==
UCT_UD_EP_NULL_ID || ep->dest_ep_id == ctl->conn_rep.src_ep_id' failed

==== backtrace (tid:  14994) ====
 0 0x000000000004d455 ucs_debug_print_backtrace()  ???:0
 1 0x0000000000042b5f uct_ud_ep_process_rx()  ???:0
 2 0x00000000000471cd uct_ud_mlx5_ep_t_delete()  ???:0
 3 0x000000000003ebdf uct_ud_iface_release_desc()  ???:0
 4 0x0000000000040436 ucs_cpu_get_memcpy_bw()  ???:0
 5 0x000000000004050b ucs_cpu_get_memcpy_bw()  ???:0
 6 0x0000000000041343 ucs_async_dispatch_handlers()  ???:0
 7 0x0000000000041488 ucs_async_dispatch_timerq()  ???:0
 8 0x0000000000043c34 ucs_async_pipe_drain()  ???:0
 9 0x0000000000007ea5 start_thread()  pthread_create.c:0
10 0x00000000000fe96d __clone()  ???:0
=================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20220614/2c8ec3f3/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ARIMA_MPI.cpp
Type: text/x-c++src
Size: 3497 bytes
Desc: not available
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20220614/2c8ec3f3/attachment.cpp>


More information about the Rcpp-devel mailing list