[Rcpp-devel] Assertion error in ud_ep.c, when running with MPI

Serguei Sokol serguei.sokol at gmail.com
Tue Jun 14 09:38:38 CEST 2022


Hi,

Probably, this issue would be better posted here 
https://github.com/openucx/ucx/issues

Best,
Serguei.

Le 14/06/2022 à 07:24, Maddegedara Lalith a écrit :
> Hello,
> 
> I want to use RInside in my C++ based MPI application to do time series 
> forecasting using the auto.arima library of R. The RInside instance in 
> each MPI rank is expected to do an independent calculation (e.g. time 
> series forecast).
> 
> With one MPI rank, it always completes without producing any error.  
> However, with more than 1 mpi ranks, it produces the following error. 
> Depending on the run, different numbers of mpi ranks produce the same 
> error. On rare occasions, all the ranks successfully complete the 
> execution. Further, I found that even your example 
> "rinside_mpi_sample4.cpp" produces the same error.
> 
> I am using the Intel MPI library (version 2021.1). I tried 
> compiling with icpc and g++. Both produced the same error.
> Could you please help me to solve this problem.
> 
> With best regards
> Lal
> 
> [ibis:14878:0:14992]       ud_ep.c:565  Assertion `ep->dest_ep_id == 
> UCT_UD_EP_NULL_ID || ep->dest_ep_id == ctl->conn_rep.src_ep_id' failed
> 
> ==== backtrace (tid:  14994) ====
>   0 0x000000000004d455 ucs_debug_print_backtrace()  ???:0
>   1 0x0000000000042b5f uct_ud_ep_process_rx()  ???:0
>   2 0x00000000000471cd uct_ud_mlx5_ep_t_delete()  ???:0
>   3 0x000000000003ebdf uct_ud_iface_release_desc()  ???:0
>   4 0x0000000000040436 ucs_cpu_get_memcpy_bw()  ???:0
>   5 0x000000000004050b ucs_cpu_get_memcpy_bw()  ???:0
>   6 0x0000000000041343 ucs_async_dispatch_handlers()  ???:0
>   7 0x0000000000041488 ucs_async_dispatch_timerq()  ???:0
>   8 0x0000000000043c34 ucs_async_pipe_drain()  ???:0
>   9 0x0000000000007ea5 start_thread()  pthread_create.c:0
> 10 0x00000000000fe96d __clone()  ???:0
> =================================
> 
> _______________________________________________
> Rcpp-devel mailing list
> Rcpp-devel at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel



More information about the Rcpp-devel mailing list