In Intel platform and Intel SDK compiler clBuildProgram returns CL_BUILD_PROGRAM_FAILURE clGetProgramBuildInfo shows status as 0. But the AMD and NVIDIA platforms will not produce any error for the same code when building with their SDK and compiler.
The program build log is shown below
OPENCL ERROR : CL_BUILD_PROGRAM_FAILURE
Build failed; error=-11, status=0, programLog:
Compilation started
Compilation done
Linking started
Linking done
Kernel <Kernel1> was successfully vectorized
The available platforms and devices are
1. FULL_PROFILE - OpenCL 1.2 - Intel(R) OpenCL - Intel(R) Corporation
1. Intel(R) Core(TM) i5-4670 CPU # 3.40GHz - OpenCL 1.2 (Build 76413)
2. Intel(R) HD Graphics 4600 - OpenCL 1.2
Operating System is Windows 7 64 bit.
I found the problem. It is because of
#pragma OPENCL EXTENSION cl_khr_fp64: enable
if I comment this and made the double to float then clBuildProgram will return success. But by enabling this gives the output.
Related
I need to use uMat and user subroutines in Abaqus.
I installed Abaqus 2020, Visual Studio 2019, Intel oneApi Base Toolkit and Intel oneApi HPC Toolkit (in this order). After successfully linking the Fortran Compiler (Intel Fortran Compiler 2021.4) with VS19 (setting all the path variables and editing the abaqus2020.bat and the abaqus_v6.env) I started Abaqus Command (as admin) and used the command abaqus verify -user .. it PASSED right away.
My main problem is: if I plug in "abaqus info=system" everything is fine except the C++ Compiler.
C++ Compiler: Unable to locate or determine the version of a C++ compiler on this
system. If a C++ compiler is installed on this system, please load vcvars64.bat
file before running Abaqus
I tried:
different versions of VS
different OS
different Processor
calling the vcvars64.bat in abaqus2020.bat
Installing third party c++ compilers (MinGW)
My current setup:
Processor: AMD Ryzen 5 3600
RAM: 32 GB DDR4 3200
Graphics: MSI NVidea Geforce GTX 1660 Ti
OS: Windows 11
Linker Version: Microsoft Incremental Linker Version 14.29.30137.0
Fortran Compiler: Intel Fortran Compiler 2021.4 MPI MS-MPI 9.0.12497.11
Error Message
I try build OpenSSl for Intel Xeon Phi coprocessor.
So far as I understand this problem reduced to building OpenSSL for
"linux-generic64" as target (this document).
I have Windows10(x64) and VC2015.
So I use next string for configure:
/perl ./Configure --prefix=D:\openssl\bin shared linux-generic64/
nmake
But "nmake" command lead to failure. Here is a screen capture of the log:
Please, help me build OpenSSL for Intel Xeon Phi.
Best regards.
I am trying to install CUDA on my Mac Pro (15-inch, Mid 2009 with GPU GeForce 9400M).
I have installed from https://developer.nvidia.com/cuda-downloads the toolkit.
nvcc --version returns: ... Cuda compilation tools, release 7.5, V7.5.19
I have installed directly the driver from the package, then from http://www.nvidia.com/object/mac-driver-archive.html. Then from System Preferences > CUDA Preferences, I can see CUDA Driver Version: 7.5.25 (though the update 6.5.51 is proposed (?)).
I can compile a sample (0_Simple/asyncAPI). When I launch it, I get:
[./asyncAPI] - Starting...
CUDA error at ../../common/inc/helper_cuda.h:1111
code=35(cudaErrorInsufficientDriver)
"cudaGetDeviceCount(&device_count)"
Why this error??
The CUDA 7 release cycle removed support for compute capability 1.x devices on all platforms. This includes your Geforce 9400M.
The last version with support of those devices was CUDA 6.5. You will need to work out what XCode version will work with that CUDA toolkit and your OS version and install that instead.
[This answer assembled from comments as a community wiki entry to get this question off the unanswered queue for the CUDA tag].
I'm building PCL library master in Windows 8.1 using CMAKE:
All other modules are successfully built except the gpu/cuda modules!
Here is the error log
Observation:
-ccbin $(VCInstallDir)bin -> Environment Variable not set successfully. Due to that this error is Generated: '$' is not recognized as an internal or external command, operable program or batch file? I'm i right? What else could be the problem?
Note that only pcl_gpu_containers module was successfully built.
Can someone please help me fix this?
Version Details:
Microsoft Visual Studio Verison: 11 (VS Prof 2012)
cuda toolkit: 7.5
boost version: boost-1_57
eigen: 3.3
VTK Version: 6.2
PC Info:
OS Name Microsoft Windows 8.1 Pro N
Version 6.3.9600 Build 9600
System Type x64-based PC
Processor AMD FX(tm)-9590 Eight-Core Processor, 4700 Mhz, 4 Core(s), 8 Logical Processor(s)
Installed Physical Memory (RAM) 8.00 GB
Name NVIDIA GeForce GT 610
Adapter Type GeForce GT 610, NVIDIA compatible
Adapter RAM (2,147,483,648) bytes
Name NVIDIA GeForce GT 730
Adapter Type GeForce GT 730, NVIDIA compatible
Adapter RAM (2,147,483,648) bytes
Here is my CMakeCache.txt
IIRC, that issue was related to a missing environment variable setting.
On my system, this setting was missing after installing CUDA as admin, then working as non-admin user.
After fixing this, now the variables are set as follows:
CUDA_PATH =
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v7.5
and
PATH =
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v7.5\bin;
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v7.5\libnvvp;
...
my code finally compiles fine on my arm cluster.
now i want to run it.
it does not run on the arm itself, as there is no screen attached
1 OpenCL Platforms found
Platform 0: (EMBEDDED_PROFILE OpenCL 1.1 ) Vivante Corporation Vivante OpenCL Platform
1 OpenCL devices found for this platform
Device 0: Vivante Corporation Vivante OpenCL Device
Initializing GLUT...
freeglut (./prognonmpi): failed to open display ''
when I access the cluster with ssh -Y name i get the following erro message
1 OpenCL Platforms found
Platform 0: (EMBEDDED_PROFILE OpenCL 1.1 ) Vivante Corporation Vivante OpenCL Platform
1 OpenCL devices found for this platform
Device 0: Vivante Corporation Vivante OpenCL Device
Initializing GLUT...
init 160 x 100
Loading extensions: Missing GL version
Error: failed to get minimal extensions for demo
This sample requires:
OpenGL version 1.5
GL_ARB_vertex_buffer_object
GL_ARB_pixel_buffer_object
glxinfo glxgears and so on are running fine and show on my screen, when run on the cluster
which package is missing to get the program running?