Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reduce_over_sub_group.cpp failing on native_cpu #16415

Open
lbushi25 opened this issue Dec 18, 2024 · 0 comments
Open

reduce_over_sub_group.cpp failing on native_cpu #16415

lbushi25 opened this issue Dec 18, 2024 · 0 comments
Labels
bug Something isn't working confirmed native-cpu

Comments

@lbushi25
Copy link
Contributor

Describe the bug

The performance test SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp is failing on native_cpu. Upon resolution, please remove XFAIL from test source.

To reproduce

FAIL: SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp (8 of 8)
******************** TEST 'SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/__w/llvm/llvm/toolchain/bin//clang++  -Werror  -fsycl -fsycl-targets=spir64,native_cpu  /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# executed command: /__w/llvm/llvm/toolchain/bin//clang++ -Werror -fsycl -fsycl-targets=spir64,native_cpu /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# .---command stderr------------
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIhEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xbb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xd6): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testItEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x208): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x226): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIjEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x358): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x373): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testImEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x4a8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x4c3): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIaEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x5fb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x616): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIsEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x748): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x766): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIiEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x898): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x8b3): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIlEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x9e8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xa03): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIN4sycl3_V16detail9half_impl4halfEEvvENKUlRNS1_7handlerEE_clES6_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xb46): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xb6b): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIfEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xe67): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xe83): undefined reference to `__mux_sub_group_broadcast_f32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIdEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xfe7): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x1003): undefined reference to `__mux_sub_group_broadcast_f64'
# | clang++: error: linker command failed with exit code 1 (use -v to see invocation)
# `-----------------------------
# error: command failed with exit status: 1

Environment

  • OS: Any
  • Target device and vendor: native_cpu
  • DPC++ version: da317d0
  • Dependencies version:
- 2024-12-17T21:51:33.5779673Z �[36;1msycl-ls --verbose�[0m
2024-12-17T21:51:33.5780436Z shell: sh -e {0}
2024-12-17T21:51:33.5780864Z env:
2024-12-17T21:51:33.5781299Z   LIT_FILTER: PerformanceTests/
2024-12-17T21:51:33.5781958Z   CMAKE_PREFIX_PATH: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5782676Z   CPATH: /runtimes/oneapi-tbb/env/../include
2024-12-17T21:51:33.5783693Z   LD_LIBRARY_PATH: /__w/llvm/llvm/toolchain/lib/:/runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5784819Z   LIBRARY_PATH: /runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5785683Z   PKG_CONFIG_PATH: /runtimes/oneapi-tbb/env/../lib/pkgconfig
2024-12-17T21:51:33.5786445Z   TBBROOT: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5787440Z   PATH: /__w/llvm/llvm/toolchain/bin/:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
2024-12-17T21:51:33.5788417Z ##[endgroup]
2024-12-17T21:51:33.6363228Z [level_zero:gpu][level_zero:0] Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6363940Z [opencl:gpu][opencl:0] Intel(R) OpenCL Graphics, Intel(R) Iris(R) Xe Graphics OpenCL 3.0 NEO  [24.39.31294.12]
2024-12-17T21:51:33.6364412Z [opencl:cpu][opencl:1] Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6364945Z [opencl:fpga][opencl:2] Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6365391Z [native_cpu:cpu][native_cpu:0] SYCL_NATIVE_CPU, SYCL Native CPU 0.1 [0.0.0]
2024-12-17T21:51:33.6365584Z 
2024-12-17T21:51:33.6366425Z Platforms: 5
2024-12-17T21:51:33.6366566Z Platform [#1]:
2024-12-17T21:51:33.6366703Z     Version  : 1.6
2024-12-17T21:51:33.6366924Z     Name     : Intel(R) oneAPI Unified Runtime over Level-Zero
2024-12-17T21:51:33.6367155Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6367338Z     Devices  : 1
2024-12-17T21:51:33.6367477Z         Device [#0]:
2024-12-17T21:51:33.6367627Z         Type              : gpu
2024-12-17T21:51:33.6367795Z         Version           : 12.0.0
2024-12-17T21:51:33.6367990Z         Name              : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6368208Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6368406Z         Driver            : 1.6.31294.120000
2024-12-17T21:51:33.6368594Z         UUID              : 13412873154100002000000
2024-12-17T21:51:33.6368785Z         DeviceID          : 39497
2024-12-17T21:51:33.6368958Z         Num SubDevices    : 0
2024-12-17T21:51:33.6369124Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6370909Z         Aspects           : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_eu_simd_width ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_memory_clock_rate ext_intel_memory_bus_width ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_limited_graph ext_oneapi_private_alloca ext_oneapi_queue_profiling_tag ext_oneapi_virtual_mem ext_oneapi_virtual_functions
2024-12-17T21:51:33.6374960Z         info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6375204Z         Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6375396Z Platform [#2]:
2024-12-17T21:51:33.6375542Z     Version  : OpenCL 3.0 
2024-12-17T21:51:33.6375717Z     Name     : Intel(R) OpenCL Graphics
2024-12-17T21:51:33.6375928Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6376101Z     Devices  : 1
2024-12-17T21:51:33.6376228Z         Device [#0]:
2024-12-17T21:51:33.6376363Z         Type              : gpu
2024-12-17T21:51:33.6376679Z         Version           : OpenCL 3.0 NEO 
2024-12-17T21:51:33.6376879Z         Name              : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6377079Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6377257Z         Driver            : 24.39.31294.12
2024-12-17T21:51:33.6377432Z         UUID              : 13412873154100002000000
2024-12-17T21:51:33.6377605Z         DeviceID          : 39497
2024-12-17T21:51:33.6377771Z         Num SubDevices    : 0
2024-12-17T21:51:33.6377928Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6379165Z         Aspects           : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_atomic16 ext_oneapi_virtual_functions
2024-12-17T21:51:33.6380418Z         info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6380610Z         Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6380773Z Platform [#3]:
2024-12-17T21:51:33.6380902Z     Version  : OpenCL 3.0 LINUX
2024-12-17T21:51:33.6381057Z     Name     : Intel(R) OpenCL
2024-12-17T21:51:33.6381215Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6381375Z     Devices  : 1
2024-12-17T21:51:33.6381500Z         Device [#1]:
2024-12-17T21:51:33.6381631Z         Type              : cpu
2024-12-17T21:51:33.6381791Z         Version           : OpenCL 3.0 (Build 0)
2024-12-17T21:51:33.6382078Z         Name              : 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz
2024-12-17T21:51:33.6382287Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6382469Z         Driver            : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6382649Z         DeviceID          : 526017
2024-12-17T21:51:33.6543853Z         Num SubDevices    : 0
2024-12-17T21:51:33.6544081Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6545480Z         Aspects           : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations atomic64 ext_oneapi_srgb ext_oneapi_native_assert ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_virtual_functions
2024-12-17T21:51:33.6546895Z         info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6547098Z         Architecture: x86_64
2024-12-17T21:51:33.6547261Z Platform [#4]:
2024-12-17T21:51:33.6547473Z     Version  : OpenCL 1.2 Intel(R) FPGA SDK for OpenCL(TM), Version 20.3
2024-12-17T21:51:33.6547755Z     Name     : Intel(R) FPGA Emulation Platform for OpenCL(TM)
2024-12-17T21:51:33.6547975Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6548146Z     Devices  : 1
2024-12-17T21:51:33.6548284Z         Device [#2]:
2024-12-17T21:51:33.6548429Z         Type              : fpga
2024-12-17T21:51:33.6548599Z         Version           : OpenCL 1.2 
2024-12-17T21:51:33.6548800Z         Name              : Intel(R) FPGA Emulation Device
2024-12-17T21:51:33.6549007Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6549204Z         Driver            : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6549390Z         DeviceID          : 526017
2024-12-17T21:51:33.6752831Z         Num SubDevices    : 0
2024-12-17T21:51:33.6754179Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6762258Z         Aspects           : accelerator fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_intel_fpga_task_sequence ext_oneapi_private_alloca
2024-12-17T21:51:33.6769624Z         info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6770811Z         Architecture: unknown
2024-12-17T21:51:33.6771610Z Platform [#5]:
2024-12-17T21:51:33.6772245Z     Version  : 0.1
2024-12-17T21:51:33.6772914Z     Name     : SYCL_NATIVE_CPU
2024-12-17T21:51:33.6773714Z     Vendor   : tbd
2024-12-17T21:51:33.6774373Z     Devices  : 1
2024-12-17T21:51:33.6774999Z         Device [#0]:
2024-12-17T21:51:33.6775737Z         Type              : cpu
2024-12-17T21:51:33.6776593Z         Version           : 0.1
2024-12-17T21:51:33.6777421Z         Name              : SYCL Native CPU
2024-12-17T21:51:33.6778340Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6779241Z         Driver            : 0.0.0
2024-12-17T21:51:33.6780130Z         DeviceID          : UNKNOWN
2024-12-17T21:51:33.6781039Z         Num SubDevices    : 0
2024-12-17T21:51:33.6781927Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6784998Z         Aspects           : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations usm_atomic_host_allocations usm_atomic_shared_allocations atomic64
2024-12-17T21:51:33.6788095Z         info::device::sub_group_sizes: 1
2024-12-17T21:51:33.6789016Z         Architecture: unknown
2024-12-17T21:51:33.6791320Z default_selector()      : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6794207Z accelerator_selector()  : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6796865Z cpu_selector()          : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6799087Z gpu_selector()          : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6801120Z custom_selector(gpu)    : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6803352Z custom_selector(cpu)    : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6805946Z custom_selector(acc)    : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]

Additional context

No response

@lbushi25 lbushi25 added bug Something isn't working native-cpu confirmed labels Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working confirmed native-cpu
Projects
None yet
Development

No branches or pull requests

1 participant