Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- $ ls /opt/amdgpupro-opencl/lib
- libamdocl12cl64.so libamdocl64.so libdrm_amdpro.so.1 libdrm_amdpro.so.1.0.0 libdrm.so.2 libdrm.so.2.4.0
- $ ls /opt/intel/opencl/
- clbltfne9_img_cbk.o clbltfnh8_img_cbk.o clbltfnl9_img_cbk.o clbltfnshared.rtl kernel-4.7.patch libcommon_clang.so libigdbcl.so libigdmcl.so libIntelOpenCL.so libOpenCL.so libtbbmalloc.so libtbb.so.2 __ocl_svml_e9.so OpenCL.pc
- clbltfne9_img_cbk.rtl clbltfnh8_img_cbk.rtl clbltfnl9_img_cbk.rtl igdclbif.bin libclang_compiler.so libcpu_device.so libigdccl.so libigdrcl.so libmd.so libOpenCL.so.1 libtbbmalloc.so.2 LICENSE __ocl_svml_h8.so
- clbltfne9.rtl clbltfnh8.rtl clbltfnl9.rtl include libcl_logger.so libiga64.so libigdfcl.so libintelocl.so libOclCpuBackEnd.so libtask_executor.so libtbb.so NOTICES __ocl_svml_l9.so
- $ ls /usr/lib/nvidia-375/
- alt_ld.so.conf libEGL.so.1 libGLESv2_nvidia.so.2 libGL.so.1.0.0 libnvcuvid.so libnvidia-compiler.so libnvidia-encode.so.1 libnvidia-glsi.so.375.26 libnvidia-ml.so.375.26 libnvidia-wfb.so.375.26
- bin libGLdispatch.so.0 libGLESv2_nvidia.so.375.26 libGLX_indirect.so.0 libnvcuvid.so.1 libnvidia-compiler.so.1 libnvidia-encode.so.375.26 libnvidia-ifr.so libnvidia-opencl.so.1 libOpenGL.so
- ld.so.conf libGLESv1_CM_nvidia.so.1 libGLESv2.so libGLX_nvidia.so.0 libnvcuvid.so.375.26 libnvidia-compiler.so.375.26 libnvidia-fatbinaryloader.so.375.26 libnvidia-ifr.so.1 libnvidia-opencl.so.375.26 libOpenGL.so.0
- libEGL_nvidia.so.0 libGLESv1_CM_nvidia.so.375.26 libGLESv2.so.2 libGLX_nvidia.so.375.26 libnvidia-cfg.so libnvidia-eglcore.so.375.26 libnvidia-fbc.so.1 libnvidia-ifr.so.375.26 libnvidia-ptxjitcompiler.so.375.26 tls
- libEGL_nvidia.so.375.26 libGLESv1_CM.so libGL.so libGLX.so libnvidia-cfg.so.1 libnvidia-egl-wayland.so.375.26 libnvidia-fbc.so.375.26 libnvidia-ml.so libnvidia-tls.so.375.26 vdpau
- libEGL.so libGLESv1_CM.so.1 libGL.so.1 libGLX.so.0 libnvidia-cfg.so.375.26 libnvidia-encode.so libnvidia-glcore.so.375.26 libnvidia-ml.so.1 libnvidia-wfb.so.1 xorg
- $ cat .local/bin/amdprocl
- #! /bin/sh
- export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/amdgpupro-opencl/lib/"
- ${@}
- #EOF
- $ cat .local/bin/intelcl
- #! /bin/sh
- export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/intel/opencl"
- ${@}
- #EOF
- $ cat .local/bin/nvidiacl
- #! /bin/sh
- export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/usr/lib/nvidia-375/"
- ${@}
- #EOF
- $ amdprocl intelcl nvidiacl clinfo | grep 'Platform Name\|Device Name' | tail -n +4
- Platform Name Intel(R) OpenCL
- Device Name Intel(R) HD Graphics
- Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
- Platform Name NVIDIA CUDA
- Device Name Quadro K1100M
- Platform Name AMD Accelerated Parallel Processing
- Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
- $ amdprocl intelcl nvidiacl clinfo
- Number of platforms 3
- Platform Name Intel(R) OpenCL
- Platform Vendor Intel(R) Corporation
- Platform Version OpenCL 1.2
- Platform Profile FULL_PROFILE
- Platform Extensions cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir
- Platform Extensions function suffix INTEL
- Platform Name NVIDIA CUDA
- Platform Vendor NVIDIA Corporation
- Platform Version OpenCL 1.2 CUDA 8.0.0
- Platform Profile FULL_PROFILE
- Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_gl_event
- Platform Extensions function suffix NV
- Platform Name AMD Accelerated Parallel Processing
- Platform Vendor Advanced Micro Devices, Inc.
- Platform Version OpenCL 2.0 AMD-APP (2236.5)
- Platform Profile FULL_PROFILE
- Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
- Platform Extensions function suffix AMD
- Platform Name Intel(R) OpenCL
- Number of devices 2
- Device Name Intel(R) HD Graphics
- Device Vendor Intel(R) Corporation
- Device Vendor ID 0x8086
- Device Version OpenCL 1.2
- Driver Version r3.1.58620
- Device OpenCL C Version OpenCL C 1.2
- Device Type GPU
- Device Profile FULL_PROFILE
- Max compute units 20
- Max clock frequency 0MHz
- Device Partition (core)
- Max number of sub-devices 0
- Supported partition types by <unknown> (0x7F6700000000)
- Max work item dimensions 3
- Max work item sizes 256x256x256
- Max work group size 256
- Preferred work group size multiple 32
- Preferred / native vector sizes
- char 16 / 16
- short 8 / 8
- int 4 / 4
- long 1 / 1
- half 0 / 0 (n/a)
- float 1 / 1
- double 0 / 0 (n/a)
- Half-precision Floating-point support (n/a)
- Single-precision Floating-point support (core)
- Denormals No
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations Yes
- Double-precision Floating-point support (n/a)
- Address bits 64, Little-Endian
- Global memory size 1709598311 (1.592GiB)
- Error Correction support No
- Max memory allocation 854799155 (815.2MiB)
- Unified memory for Host and Device Yes
- Minimum alignment for any data type 128 bytes
- Alignment of base address 1024 bits (128 bytes)
- Global Memory cache type Read/Write
- Global Memory cache size 524288
- Global Memory cache line 64 bytes
- Image support Yes
- Max number of samplers per kernel 16
- Max size for 1D images from buffer 53424947 pixels
- Max 1D or 2D image array size 2048 images
- Max 2D image size 16384x16384 pixels
- Max 3D image size 2048x2048x2048 pixels
- Max number of read image args 128
- Max number of write image args 128
- Local memory type Local
- Local memory size 65536 (64KiB)
- Max constant buffer size 854799155 (815.2MiB)
- Max number of constant args 8
- Max size of kernel argument 1024
- Queue properties
- Out-of-order execution No
- Profiling Yes
- Prefer user sync for interop Yes
- Profiling timer resolution 80ns
- Execution capabilities
- Run OpenCL kernels Yes
- Run native kernels No
- SPIR versions 1.2
- printf() buffer size 4194304 (4MiB)
- Built-in kernels block_motion_estimate_intel;block_advanced_motion_estimate_check_intel;block_advanced_motion_estimate_bidirectional_check_intel
- Motion Estimation accelerator version (Intel) 2
- Device Available Yes
- Compiler Available Yes
- Linker Available Yes
- Device Extensions cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_driver_diagnostics cl_intel_motion_estimation cl_intel_packed_yuv cl_intel_required_subgroup_size cl_intel_subgroups cl_intel_va_api_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir
- Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
- Device Vendor Intel(R) Corporation
- Device Vendor ID 0x8086
- Device Version OpenCL 1.2 (Build 330)
- Driver Version 1.2.0.330
- Device OpenCL C Version OpenCL C 1.2
- Device Type CPU
- Device Profile FULL_PROFILE
- Max compute units 8
- Max clock frequency 2800MHz
- Device Partition (core)
- Max number of sub-devices 8
- Supported partition types by counts, equally, by names (Intel)
- Max work item dimensions 3
- Max work item sizes 8192x8192x8192
- Max work group size 8192
- Preferred work group size multiple 128
- Preferred / native vector sizes
- char 1 / 32
- short 1 / 16
- int 1 / 8
- long 1 / 4
- half 0 / 0 (n/a)
- float 1 / 8
- double 1 / 4 (cl_khr_fp64)
- Half-precision Floating-point support (n/a)
- Single-precision Floating-point support (core)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero No
- Round to infinity No
- IEEE754-2008 fused multiply-add No
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations No
- Double-precision Floating-point support (cl_khr_fp64)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations No
- Address bits 64, Little-Endian
- Global memory size 16426655744 (15.3GiB)
- Error Correction support No
- Max memory allocation 4106663936 (3.825GiB)
- Unified memory for Host and Device Yes
- Minimum alignment for any data type 128 bytes
- Alignment of base address 1024 bits (128 bytes)
- Global Memory cache type Read/Write
- Global Memory cache size 262144
- Global Memory cache line 64 bytes
- Image support Yes
- Max number of samplers per kernel 480
- Max size for 1D images from buffer 256666496 pixels
- Max 1D or 2D image array size 2048 images
- Max 2D image size 16384x16384 pixels
- Max 3D image size 2048x2048x2048 pixels
- Max number of read image args 480
- Max number of write image args 480
- Local memory type Global
- Local memory size 32768 (32KiB)
- Max constant buffer size 131072 (128KiB)
- Max number of constant args 480
- Max size of kernel argument 3840 (3.75KiB)
- Queue properties
- Out-of-order execution Yes
- Profiling Yes
- Local thread execution (Intel) Yes
- Prefer user sync for interop No
- Profiling timer resolution 1ns
- Execution capabilities
- Run OpenCL kernels Yes
- Run native kernels Yes
- SPIR versions 1.2
- printf() buffer size 1048576 (1024KiB)
- Built-in kernels
- Device Available Yes
- Compiler Available Yes
- Linker Available Yes
- Device Extensions cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_3d_image_writes cl_intel_exec_by_local_thread cl_khr_spir cl_khr_fp64
- Platform Name NVIDIA CUDA
- Number of devices 1
- Device Name Quadro K1100M
- Device Vendor NVIDIA Corporation
- Device Vendor ID 0x10de
- Device Version OpenCL 1.2 CUDA
- Driver Version 375.26
- Device OpenCL C Version OpenCL C 1.2
- Device Type GPU
- Device Profile FULL_PROFILE
- Device Topology (NV) PCI-E, 01:00.0
- Max compute units 2
- Max clock frequency 705MHz
- Compute Capability (NV) 3.0
- Device Partition (core)
- Max number of sub-devices 1
- Supported partition types None
- Max work item dimensions 3
- Max work item sizes 1024x1024x64
- Max work group size 1024
- Preferred work group size multiple 32
- Warp size (NV) 32
- Preferred / native vector sizes
- char 1 / 1
- short 1 / 1
- int 1 / 1
- long 1 / 1
- half 0 / 0 (n/a)
- float 1 / 1
- double 1 / 1 (cl_khr_fp64)
- Half-precision Floating-point support (n/a)
- Single-precision Floating-point support (core)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations Yes
- Double-precision Floating-point support (cl_khr_fp64)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations No
- Address bits 64, Little-Endian
- Global memory size 2095251456 (1.951GiB)
- Error Correction support No
- Max memory allocation 523812864 (499.5MiB)
- Unified memory for Host and Device No
- Integrated memory (NV) No
- Minimum alignment for any data type 128 bytes
- Alignment of base address 4096 bits (512 bytes)
- Global Memory cache type Read/Write
- Global Memory cache size 32768
- Global Memory cache line 128 bytes
- Image support Yes
- Max number of samplers per kernel 32
- Max size for 1D images from buffer 134217728 pixels
- Max 1D or 2D image array size 2048 images
- Max 2D image size 16384x16384 pixels
- Max 3D image size 4096x4096x4096 pixels
- Max number of read image args 256
- Max number of write image args 16
- Local memory type Local
- Local memory size 49152 (48KiB)
- Registers per block (NV) 65536
- Max constant buffer size 65536 (64KiB)
- Max number of constant args 9
- Max size of kernel argument 4352 (4.25KiB)
- Queue properties
- Out-of-order execution Yes
- Profiling Yes
- Prefer user sync for interop No
- Profiling timer resolution 1000ns
- Execution capabilities
- Run OpenCL kernels Yes
- Run native kernels No
- Kernel execution timeout (NV) No
- Concurrent copy and kernel execution (NV) Yes
- Number of async copy engines 1
- printf() buffer size 1048576 (1024KiB)
- Built-in kernels
- Device Available Yes
- Compiler Available Yes
- Linker Available Yes
- Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_gl_event
- Platform Name AMD Accelerated Parallel Processing
- Number of devices 1
- Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
- Device Vendor GenuineIntel
- Device Vendor ID 0x1002
- Device Version OpenCL 1.2 AMD-APP (2236.5)
- Driver Version 2236.5 (sse2,avx)
- Device OpenCL C Version OpenCL C 1.2
- Device Type CPU
- Device Profile FULL_PROFILE
- Device Board Name (AMD)
- Device Topology (AMD) (n/a)
- Max compute units 8
- Max clock frequency 3700MHz
- Device Partition (core, cl_ext_device_fission)
- Max number of sub-devices 8
- Supported partition types equally, by counts, by affinity domain
- Supported affinity domains L3 cache, L2 cache, L1 cache, next partitionable
- Supported partition types (ext) equally, by counts, by affinity domain
- Supported affinity domains (ext) L3 cache, L2 cache, L1 cache, next fissionable
- Max work item dimensions 3
- Max work item sizes 1024x1024x1024
- Max work group size 1024
- Preferred work group size multiple 1
- Preferred / native vector sizes
- char 16 / 16
- short 8 / 8
- int 4 / 4
- long 2 / 2
- half 4 / 4 (n/a)
- float 8 / 8
- double 4 / 4 (cl_khr_fp64)
- Half-precision Floating-point support (n/a)
- Single-precision Floating-point support (core)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations Yes
- Double-precision Floating-point support (cl_khr_fp64)
- Denormals Yes
- Infinity and NANs Yes
- Round to nearest Yes
- Round to zero Yes
- Round to infinity Yes
- IEEE754-2008 fused multiply-add Yes
- Support is emulated in software No
- Correctly-rounded divide and sqrt operations No
- Address bits 64, Little-Endian
- Global memory size 16426655744 (15.3GiB)
- Error Correction support No
- Max memory allocation 4106663936 (3.825GiB)
- Unified memory for Host and Device Yes
- Minimum alignment for any data type 128 bytes
- Alignment of base address 1024 bits (128 bytes)
- Global Memory cache type Read/Write
- Global Memory cache size 32768
- Global Memory cache line 64 bytes
- Image support Yes
- Max number of samplers per kernel 16
- Max size for 1D images from buffer 65536 pixels
- Max 1D or 2D image array size 2048 images
- Max 2D image size 8192x8192 pixels
- Max 3D image size 2048x2048x2048 pixels
- Max number of read image args 128
- Max number of write image args 64
- Local memory type Global
- Local memory size 32768 (32KiB)
- Max constant buffer size 65536 (64KiB)
- Max number of constant args 8
- Max size of kernel argument 4096 (4KiB)
- Queue properties
- Out-of-order execution No
- Profiling Yes
- Prefer user sync for interop Yes
- Profiling timer resolution 1ns
- Profiling timer offset since Epoch (AMD) 1497203026521170561ns (Sun Jun 11 19:43:46 2017)
- Execution capabilities
- Run OpenCL kernels Yes
- Run native kernels Yes
- SPIR versions 1.2
- printf() buffer size 65536 (64KiB)
- Built-in kernels
- Device Available Yes
- Compiler Available Yes
- Linker Available Yes
- Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_spir cl_khr_gl_event
- NULL platform behavior
- clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
- clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
- clCreateContext(NULL, ...) [default] No platform
- clCreateContext(NULL, ...) [other] Success [INTEL]
- clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No platform
- clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform
- clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No platform
- clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No platform
- clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement