Dear Marek, sorry for the inconvenience. Yes, the variable has been changed, but only because we saw, that the mechanism we used before, was in many times wrong. So I wrote a parser for the job details, to find out, how many cores on which host had been scheduled. The parser worked perfect, but in the TaskProlog script for SLURM I set a wrong environment variable. This led to an empty list in CFXHOSTS :/ It has been fixed now. Best Marcus On 4/8/19 2:22 PM, simon@isf.rwth-aachen.de wrote:
Dear all,
i attach my jobsubmission file with the error:
:::::::::::::: slurm_ansys.sh :::::::::::::: #!/usr/local_rwth/bin/zsh
### Job name #SBATCH -J CFX_OpenMPI
### File / path where output will be written, the %J is the job id #SBATCH -o CFX_OpenMPI.%J
### Request the time you need for execution in minutes ### The format for the parameter is: [hour:]minute, ### that means for 80 minutes you could also use this: 1:20 #SBATCH -t 24:00:00
### Request virtual memory you need for your job in MB #SBATCH --mem-per-cpu 2G
#SBATCH --nodes=1
### Request the number of compute slots you want to use #SBATCH --cpus-per-task=8
### Enable Verbose Output #SBATCH -V
#SBATCH --mail-type=end #SBATCH --mail-user=sim@isf.rwth-aachen.de #SBATCH --account=rwth0398
### Change to the work directory cd $WORK/Droplet CFX/
more slurm_ansys.sh
### Purge default MPI and Compiler to use the CFX MPI implementation module purge
### load modules and execute module load TECHNICS module load ansys
### start non-interactive batch job cfx5solve -def droplets_scaled2_sim.def -par-dist "$CFXHOSTS" -start-method "Intel MPI Distributed Parallel" -ccl R.ccl
### (OK) Unloading intelmpi 2018.4.274 (!!) neither LSF nor SLURM environment present, environment broken? (OK) Unloading Intel Suite 19.0.1.144 (WW) ATTENTION!!! Unloading this module can result in inoperable software!!! (OK) Loading TECHNICS environment (!!) Please notice: Using ansys requires payment. (!!) If in doubt, please contact your institute's IT-administrator or servicedesk@itc.rwth-aachen.de. (!!) Run the workbench GUI using runwb2 (OK) Loading ansys 19.3 Warning!
The parallel start method specified in the input file: IBM MPI Local Parallel, is not a default parallel start method for this platform and has therefore been preserved. A different start method may be specified using the -start-method option.
An error has occurred in cfx5solve:
Unable to find the master host ncm0067.hpc.itc.rwthaachen.de (ncm0067.hpc.itc.rwth-aachen.de) in the host list: at least one partition must be assigned to the master host.
I dont understand why im receiving this error. Did somehthing change in $CFXHOSTS??
Best Regards, Marek _______________________________________________ claix18-slurm-pilot mailing list -- claix18-slurm-pilot@lists.rwth-aachen.de To unsubscribe send an email to claix18-slurm-pilot-leave@lists.rwth-aachen.de
-- Marcus Wagner, Dipl.-Inf. IT Center Abteilung: Systeme und Betrieb RWTH Aachen University Seffenter Weg 23 52074 Aachen Tel: +49 241 80-24383 Fax: +49 241 80-624383 wagner@itc.rwth-aachen.de www.itc.rwth-aachen.de