One important thing I forgot to mention: please don't use CFXHOSTS anymore, it might be removed in the future. The right variable ist R_WLM_CFXHOSTLIST Best Marcus On 4/18/19 7:23 AM, Marcus Wagner wrote:
Hi Marek,
sorry, this was my fault. We wanted to generalize our environment for future, coming WorkLoadManagement systems. No, nothing in sight up to now ;) So I wanted to get rid of CFXHOSTS and create R_WLM_... variables. The failure has to do with the way, how environment variables are injected into the jobs environment by SLURM.
You need to echo the variable together with its export, e.g. echo "export R_WLM_CFXHOSTLIST=..."
if you then do something like this (what I did): echo "export CFXHOSTS=$R_WLM_CFXHOSTLIST"
on first sight looks good, but in fact gives you an empty variable CFXHOSTS, as R_WLM_CFXHOSTLIST has never been defined before. I forgot, that I did only an echo.
This has been fixed now.
Best Marcus
On 4/17/19 5:11 PM, simon@isf.rwth-aachen.de wrote:
Hello Marcus,
The issue was fixed, thank you very much!
However, today i receive a similar error again?
--------------------------------------------------------------------------------------------
:::::::::::::: slurm_ansys.sh :::::::::::::: #!/usr/local_rwth/bin/zsh
### Job name #SBATCH -J CFX_OpenMPI
### File / path where output will be written, the %J is the job id #SBATCH -o CFX_OpenMPI.%J ### Request the time you need for execution in minutes ### The format for the parameter is: [hour:]minute, ### that means for 80 minutes you could also use this: 1:20 #SBATCH -t 24:00:00 ### Request virtual memory you need for your job in MB #SBATCH --mem-per-cpu 1G
#SBATCH --nodes=1
### Request the number of compute slots you want to use #SBATCH -n 12
### Enable Verbose Output #SBATCH -V
#SBATCH --mail-type=end #SBATCH --mail-user=sim@isf.rwth-aachen.de #SBATCH --account=rwth0398
### Change to the work directory cd $WORK/Ansys_Lichtbogencluster/IIW2019
more slurm_ansys.sh
### Purge default MPI and Compiler to use the CFX MPI implementation module purge ### load modules and execute module load TECHNICS module load ansys
### start non-interactive batch job cfx5solve -def 2019-04-17_IIW_13500K20V_withDrop.def -par-dist "$CFXHOSTS" -start-method "Intel MPI Distributed Parallel" -ccl R.ccl
### (OK) Unloading intelmpi 2018.4.274 (OK) Unloading Intel Suite 19.0.1.144 (WW) ATTENTION!!! Unloading this module can result in inoperable software!!! (OK) Loading TECHNICS environment (!!) Please notice: Using ansys requires payment. (!!) If in doubt, please contact your institute's IT-administrator or servicedesk@itc.rwth-aachen.de. (!!) Run the workbench GUI using runwb2 (OK) Loading ansys 19.3 An error has occurred in cfx5solve:
Errors in CCL processing:
invalid object name """": leading character must be alphabetic or '_'
Warning!
Host name lookup failed for host ""
An error has occurred in cfx5solve:
Unable to find the master host ncm0914.hpc.itc.rwthaachen.de (ncm0914.hpc.itc.rwth-aachen.de) in the host list: at least one partition must be assigned to the master host.
------------------------------------------------------------------------------
It seems like $CFXHOSTS is an empty string? Or am i mistaken?
Best Regards, Marek _______________________________________________ claix18-slurm-pilot mailing list -- claix18-slurm-pilot@lists.rwth-aachen.de To unsubscribe send an email to claix18-slurm-pilot-leave@lists.rwth-aachen.de
-- Marcus Wagner, Dipl.-Inf. IT Center Abteilung: Systeme und Betrieb RWTH Aachen University Seffenter Weg 23 52074 Aachen Tel: +49 241 80-24383 Fax: +49 241 80-624383 wagner@itc.rwth-aachen.de www.itc.rwth-aachen.de